Gowtham Ramesh Profile
Gowtham Ramesh

@gowtham_ramesh1

Followers
365
Following
6K
Media
0
Statuses
138

Applied RS GenAI @AMD | Ex - Student Researcher @GoogleAI NYC, @WisconsinCS, @ai4bharat, @iitmadras

San Jose, CA
Joined October 2016
Don't wanna be here? Send us removal request.
@gowtham_ramesh1
Gowtham Ramesh
9 months
We've just open-sourced our 1B language model with comprehensive details to make it easy to reproduce. Check out the model card and other details here:
@EmadBarsoumPi
Emad Barsoum
9 months
AMD first 1B LLM model is released!!! proud of the team. We released everything training script, dataset detail, weights, score card and benchmark results. #AMD #LLM #ML #AI #HW #MI300X.
0
4
28
@gowtham_ramesh1
Gowtham Ramesh
1 month
RT @PrakamyaMishra: 🎉Introducing TTT-Bench: A new benchmark for evaluating reasoning ability with Simple and Novel Tic-Tac-Toe-style Games🧩….
0
1
0
@gowtham_ramesh1
Gowtham Ramesh
1 month
RT @EmadBarsoumPi: Introducing Instella-Long model that we trained from scratch, on MI300X, with up to 128K sequence length, fully open: ch….
0
6
0
@gowtham_ramesh1
Gowtham Ramesh
1 month
RT @PrakamyaMishra: 🚨Introducing Instella-3B-Long-Instruct✨.- 128K context length 📏.- Trained on 64 Instinct MI300X GPUs ⚡️.- Fully open mo….
0
3
0
@gowtham_ramesh1
Gowtham Ramesh
1 month
RT @Wu_Jialian: We released Instella-Long, a fully open 3B long-context language model supporting 128K context length, trained on AMD MI300….
0
1
0
@gowtham_ramesh1
Gowtham Ramesh
2 months
RT @SarvamAI: Today we introduce Sarvam-M, a 24B open-weights hybrid model built on top of Mistral Small. Sarvam-M achieves a new benchmar….
0
300
0
@gowtham_ramesh1
Gowtham Ramesh
2 months
RT @PrimeIntellect: Releasing INTELLECT-2: We’re open-sourcing the first 32B parameter model trained via globally distributed reinforcement….
0
303
0
@gowtham_ramesh1
Gowtham Ramesh
3 months
RT @SarvamAI: It's official 🇮🇳. We're proud to announce that Sarvam has been selected by the Government of India under the IndiaAI Mission….
0
763
0
@gowtham_ramesh1
Gowtham Ramesh
3 months
RT @tatsu_hashimoto: I think CS336 has one of the best LLM problem sets of any AI/LM class thanks to our incredible TAs (@nelsonfliu,@Gabri….
0
63
0
@gowtham_ramesh1
Gowtham Ramesh
3 months
RT @EmadBarsoumPi: Reinforcement Learning from Human Feedback via VeRL on AMD MI300X and ROCm, single and multiple nodes training. @thu_yus….
0
3
0
@gowtham_ramesh1
Gowtham Ramesh
3 months
RT @AMD: AMD has announced the open-sourcing of Instella, a fully open 3-billion-parameter LMs trained on AMD Instinct MI300X GPUs. These….
0
45
0
@gowtham_ramesh1
Gowtham Ramesh
3 months
RT @PeoplePlusAI: @ai4bharat started with a few students experimenting with deep learning projects at IIT Madras. What began with building….
0
26
0
@gowtham_ramesh1
Gowtham Ramesh
4 months
RT @PrakamyaMishra: 🚀AMD Instinct�� GPUs go-brrrrr. 💡Key releases and blogs:. 🤖Model Releases:. Introducing Instella✨: New State-of-the-art….
0
2
0
@gowtham_ramesh1
Gowtham Ramesh
4 months
RT @EmadBarsoumPi: Volcano Engine Reinforcement Learning for LLM now support AMD GPU and ROCm!!!. . Looking forward….
0
7
0
@gowtham_ramesh1
Gowtham Ramesh
4 months
RT @EmadBarsoumPi: Introducing Instella-VL-1B, AMD first vision language model!!! Trained on MI300X and fully open. Kudo to AMD GenAI Resea….
0
3
0
@gowtham_ramesh1
Gowtham Ramesh
4 months
RT @EmadBarsoumPi: Introducing AMD Intella3B, a fully open 3B LLM model trained from scratch on MI300X on 4 trillion tokens with competitiv….
0
13
0
@gowtham_ramesh1
Gowtham Ramesh
4 months
RT @Wu_Jialian: We released our 3B LLM Instella-3B✨! Instella-3B is trained on AMD MI300 GPUs. Training data, code, weights are released:.G….
0
3
0
@gowtham_ramesh1
Gowtham Ramesh
4 months
RT @PrakamyaMishra: 🚀 𝗜𝗻𝘁𝗿𝗼𝗱𝘂𝗰𝗶𝗻𝗴 𝗜𝗻𝘀𝘁𝗲𝗹𝗹𝗮✨: 𝗙𝘂𝗹𝗹𝘆 𝗢𝗽𝗲𝗻 𝟯𝗕 𝗟𝗟𝗠𝘀 𝗯𝘆 𝗔𝗠𝗗 . Blog: Keep an eye out for more exciting b….
0
2
0
@gowtham_ramesh1
Gowtham Ramesh
5 months
RT @Thom_Wolf: After 6+ months in the making and burning over a year of GPU compute time, we're super excited to finally release the "Ultra….
0
708
0
@gowtham_ramesh1
Gowtham Ramesh
6 months
RT @pratykumar: The deepseek models are a step change and will enable faster progress towards even more powerful intelligent systems. What….
0
75
0
@gowtham_ramesh1
Gowtham Ramesh
6 months
RT @AnushElangovan: MI300x perf tracked now in every Pytorch Inductor / torch.compile commit
0
10
0