
Swayam Singh @ ICML'25
@swayaminsync
Followers
1K
Following
4K
Media
590
Statuses
3K
ML Research @MSFTResearch | Core Maintainer @numpy_team (QuadDType)
living in the moment
Joined April 2021
RT @LocalAI_API: 🔥 New model alert! 🔥. Microsoft NextCoder-32B is now available in LocalAI! 🚀 This code-editing LLM boasts impressive perfo….
0
1
0
RT @HiSohan: 📄 Paper 20/42: "NextCoder: Robust Adaptation of Code LMs to Diverse Code Edits".🇮🇳 Tushar Aggarwal (Microsoft). LinkedIn: h….
0
2
0
This is really nice but no technical reports of either models and SWE-Bench as the only benchmark looks a bit suspicious. Don't get me wrong, Following the Mistral work a long way but it would be more acceptable if you guys open-up on some details and development of Devstral.
Introducing Devstral Small and Medium 2507! This latest update offers improved performance and cost efficiency, perfectly suited for coding agents and software engineering tasks.
0
0
8
Our recent work, "NextCoder" is now public. Dropping:.1️⃣ Models with strong code-editing capabilities (7B, 14B, 32B) .2️⃣ Complete training dataset .3️⃣ A clever training algorithm: Selective Knowledge Transfer (SeleKT). ✅ This is just the first phase, with more to come soon.
Excited to share the NextCoder family of SLMs with strong code-editing abilities. Finetuned with Selective Knowledge Transfer (SeleKT) and GitHub/synthetic data. #ICML. GitHub: Azure AI Foundry: HF:
3
3
29
Sir Aditya Gopalan is a humble, intelligent person. We met at the MSR AI Summit and discussed off-policy RL scenarios, methods that work but lack mathematical backing, and efficient sparse model training regimes. He is a genuinely insightful, welcoming individual.
"DPO can give you a policy that is worse than what you started with".@today_itself reveals how the theoretical backing behind one of the most ubiquitous alignment methods breaks down for real-life LLMs, causing unpredictable alignment failures. He then shows how to fix it.
0
0
6