algo_diver Profile Banner
chansung Profile
chansung

@algo_diver

Followers
5K
Following
13K
Media
426
Statuses
5K

@GoogleDevExpert for ML and @googlecloud | @huggingface Fellow | @dstackai Ambassador | @MistralAI Ambassador | Researcher | Engineering | Open Source Lover

Daejeon, Republic of Korea
Joined August 2018
Don't wanna be here? Send us removal request.
@algo_diver
chansung
4 years
The best award I have ever received ❤ thanks @TensorFlow
Tweet media one
11
7
346
@algo_diver
chansung
7 hours
RT @QGallouedec: Merry Christmas 🎁 GSPO is in TRL. Looking forward to see your reward curves 📈
Tweet media one
0
17
0
@algo_diver
chansung
3 days
RT @satyanadella: Today we’re releasing GitHub Spark — a new tool in Copilot that turns your ideas into full-stack apps, entirely in natura….
0
3K
0
@algo_diver
chansung
3 days
RT @GoogleLabs: We just discovered the 🔥 COOLEST 🔥 trick in Flow that we have to share:. Instead of wordsmithing the perfect prompt, you ca….
0
381
0
@algo_diver
chansung
5 days
RT @Hesamation: the legendary @danielhanchen just made a full 3-hour workshop on reinforcement learning and agents. he goes through RL fu….
0
180
0
@algo_diver
chansung
5 days
RT @casper_hansen_: vLLM is finally addressing a long-standing problem: startup times. 35s -> 2s for CUDA graph capture is a great reductio….
0
43
0
@algo_diver
chansung
5 days
RT @mervenoyann: Now it's possible to do RAG with any-to-any models 🔥. Learn how to search in a video dataset and generate using OmniEmbed,….
0
52
0
@algo_diver
chansung
6 days
RT @sophiamyang: How to train a model that actually understands both audio and text like Voxtral from @MistralAI? Here is a quick video wal….
0
136
0
@algo_diver
chansung
8 days
RT @Wauplin: Big update: Hugging Face Inference Providers now work out of the box with the OpenAI client!. Just add the provider name to th….
0
7
0
@algo_diver
chansung
9 days
RT @liliang_ren: We’re open-sourcing the pre-training code for Phi4-mini-Flash, our SoTA hybrid model that delivers 10× faster reasoning th….
Tweet card summary image
github.com
Simple & Scalable Pretraining for Neural Architecture Research - microsoft/ArchScale
0
215
0
@algo_diver
chansung
10 days
RT @sophiamyang: Super excited to announce the latest features in @MistralAI le Chat:. 🔍 Deep Research: dive into complex topics with our s….
0
79
0
@algo_diver
chansung
10 days
RT @OfficialLoganK: Veo 3 is now live in the Gemini API 📽️!! . Veo 3 is state of the art, can natively generative audio in the videos, come….
0
71
0
@algo_diver
chansung
11 days
RT @NeurIPSConf: NeurIPS is pleased to officially endorse EurIPS, an independently-organized meeting taking place in Copenhagen this year,….
0
113
0
@algo_diver
chansung
12 days
RT @deedydas: Google DeepMind just dropped this new LLM model architecture called Mixture-of-Recursions. It gets 2x inference speed, reduc….
0
452
0
@algo_diver
chansung
12 days
RT @danielhanchen: Highly recommend this Stanford lecture video with @_jasonwei and @hwchung27 :).It's one of my favorites on scaling laws….
0
62
0
@algo_diver
chansung
12 days
RT @MistralAI: Introducing the world's best (and open) speech recognition models!
Tweet media one
0
493
0
@algo_diver
chansung
12 days
RT @Teknium1: In case the post was too vague, yes - this is the Hermes 3 dataset. - 1 Million Samples.- Created SOTA without the censorship….
0
55
0
@algo_diver
chansung
12 days
RT @EnricoShippole: We open-sourced 99% of US caselaw on @huggingface. Both AI and legal tech companies are selling this data for a high pr….
Tweet card summary image
huggingface.co
0
406
0
@algo_diver
chansung
12 days
RT @m4rkmc: 📣 Gemini CLI roadmap. It has been so cool seeing everyone building with the Gemini CLI (60k ⭐s!), and sharing feedback (1k open….
0
22
0