
Martin Vechev
@mvechev
Followers
2K
Following
275
Media
18
Statuses
195
Professor of Computer Science, ETH Zurich. Founder of INSAIT (https://t.co/bqKTA6e8X0). Works on Safe/Secure AI, LLMs, Quantum. Co-founder of 6 Deep-Tech start-ups.
Joined June 2012
RT @INSAITinstitute: 🤝We are delighted to announce that INSAIT is starting a joint research program with the MIT Computer Science and Artif….
0
3
0
RT @INSAITinstitute: 🌐 We are delighted to announce the launch of a new 1 million USD joint research program between INSAIT and the MIT Com….
0
2
0
RT @j_dekoninck: Thrilled to share a major step forward for AI for mathematical proof generation! . We are releasing the Open Proof Corpus:….
0
20
0
Thrilled to share that Snyk (@snyksec), a leader in cybersecurity, has acquired our AI spin-off @InvariantLabsAI, a year after launch! 🚀. Co-founded with @florian_tramer and PhDs from my lab, Invariant built a SOTA safeguard platform for securing AI agents. Congrats to all!
2
2
23
RT @ni_jovanovic: There's a lot of work now on LLM watermarking. But can we extend this to transformers trained for autoregressive image ge….
0
54
0
RT @mbalunovic: Two updates from MathArena: .- DeepSeek-R1-0528 shows strong performance very close to top closed source models on all comp….
0
8
0
RT @eucopresident: Inspiring visit to @INSAITinstitute at @SofiaTechPark, the first institute of its kind in Eastern Europe. Its cutting-….
0
23
0
RT @INSAITinstitute: 🇪🇺 🇧🇬 Today, António Costa @eucopresident, visited INSAIT during his official visit to Bulgaria. The visit was also at….
0
2
0
RT @INSAITinstitute: 🚀 We are delighted to announce MamayLM, a new state-of-the-art efficient Ukrainian LLM!. 📈 MamayLM surpasses all simil….
0
2
0
RT @lbeurerkellner: 🔴 New MCP attack leaks WhatsApp messages via MCP, side-stepping WhatsApp security. 1/n. We show a new MCP attack that l….
0
187
0
RT @mbalunovic: After many requests, we’ve evaluated Grok 3 on the USAMO 2025. The results are in: Grok 3 is tied with DeepSeek-R1 for the….
0
27
0
RT @mbalunovic: Big update to our MathArena USAMO evaluation: Gemini 2.5 Pro, which was released *the same day* as our benchmark, is the fi….
0
146
0
RT @florian_tramer: Designing a network of interconnected agents and servers will be a security nightmare if we don't first fix prompt inje….
0
2
0
RT @mbalunovic: Can LLMs actually solve hard math problems? Given the strong performance at AIME, we now go to the next tier: our MathArena….
0
85
0
RT @the_sri_lab: Claude 3.7 is making waves, being featured in impressive demos and showing strong results on common benchmarks. But can it….
0
4
0
RT @the_sri_lab: 💡 Hype vs Reality: can LLMs generate production-level code such as backends? Turns out, no. ⚠️ Using our new framework Ba….
0
10
0
RT @mbalunovic: MathArena results for HMMT Feb 2025 are out, showing that high school math competitions are still far from being solved by….
0
3
0
AIME II 2025 LLM results are here! Check them out :) Great work by students in the group from @the_sri_lab at @ETH_en and @INSAITinstitute in Sofia: @mbalunovic @IvoPetrov01 @j_dekoninck, @ni_jovanovic.
Results of the second part of AIME 2025 are live on Another convincing win for @openai's o3-mini 🥇.Great work by the entire MathArena team: @j_dekoninck, @ni_jovanovic and @IvoPetrov01!
0
1
7
Exciting stuff by @INSAITinstitute and @the_sri_lab at @ETH_en, @ETH_AI_Center, comparing modern models, reasoning and regular, on the AIME I 2025 math comp that just finished. Big gap between R1 and O1, unlike AIME 2024. Check it out:
0
2
15