
Siddharth Sharma
@siddrrsh
Followers
3K
Following
10K
Media
31
Statuses
991
CS @ Stanford. Building @mlfoundry. Prev @AWS, @Lux_Capital, @UniofOxford
Joined July 2020
Introducing ambientGPT: an open-source and multimodal MacOS foundation model GUI. Run GPT-4o and open-source models with full ambient knowledge of your screen. Foundation models have long been confined to the browser. With ambientGPT, your screen context is directly inferred as
31
88
580
Re Llama3V: Firstly, we want to apologize to the original authors of MiniCPM. @AkshGarg03 and I posted Llama3V with @mustafaaljadery. Mustafa wrote the code for the project. Aksh and I were both excited about multimodal models and liked the architectural extensions on top of.
Shocked! Llama3-V project from a Stanford team plagiarized a lot from MiniCPM-Llama3-V 2.5!.its code is a reformatting of MiniCPM-Llama3-V 2.5, and the model's behavior is highly similar to a noised version of MiniCPM-Llama3-V 2.5 checkpoint. Evidence:
45
42
268
RT @SakanaAILabs: Sakana AI is proud to sponsor the LLM Merging Competition: Building LLMs Efficiently through Merging at #NeurIPS2024 🤗. I….
0
61
0
Will be an awesome event!.
Updates w.r.t. the upcoming Compound AI Systems Workshop (June 13th in San Francisco):. Accepted Posters. We are excited to announce the accepted posters for the Compound AI Systems workshop. Due to space constraints, we were only able to accept 28 featured posters. Sincere.
0
0
1
Congrats Omar!.
I'm excited to share that I will be joining MIT EECS as an assistant professor in Fall 2025!. I'll be recruiting PhD students from the December 2024 application pool. Indicate interest if you'd like to work with me on NLP, IR, or ML Systems! Stay tuned for more about my new lab.
0
0
3
RT @MajmudarAdam: I've spent the past ~3 weeks going through the entire history of deep learning and reimplementing all the core breakthrou….
0
364
0
RT @AnthropicAI: This week, we showed how altering internal "features" in our AI, Claude, could change its behavior. We found a feature th….
0
249
0
Great to see a community developing around this! @ollama, @GroqInc, and @vllm_project integrations on the way 🫡.
Introducing ambientGPT: an open-source and multimodal MacOS foundation model GUI. Run GPT-4o and open-source models with full ambient knowledge of your screen. Foundation models have long been confined to the browser. With ambientGPT, your screen context is directly inferred as
2
1
4
I love GPU pods.
1/ @SohamGovande, @jameszhou02, @jzhou891 and I spent the weekend building PodPlex: A platform for distributed training & serverless inference at scale. I'm very glad to say that we left $10,000 GPU credits richer and 36 hours of sleep poorer. more details in 🧵
0
0
6
Excellent writing on decentralized/distributed training. Exciting times ahead!.
1/ As promised, here's my thesis on the future of decentralized training of foundation models. Covers:. 1) why decentralized makes sense from scaling, margins, and marketplace lenses.2) challenges .3) exciting enabling research shifts. In long form at:
0
0
5