
Mark Vero
@mark_veroe
Followers
43
Following
112
Media
4
Statuses
49
PhD Student @ ETH Zürich @the_sri_lab
Joined March 2012
RT @j_dekoninck: Thrilled to share a major step forward for AI for mathematical proof generation! . We are releasing the Open Proof Corpus:….
0
21
0
RT @ni_jovanovic: There's a lot of work now on LLM watermarking. But can we extend this to transformers trained for autoregressive image ge….
0
54
0
RT @lbeurerkellner: 😈 BEWARE: Claude 4 + GitHub MCP will leak your private GitHub repositories, no questions asked. We discovered a new at….
0
496
0
RT @mimicrobotics: The Zurich Builds x @mimicrobotics x @loki_robotics x @OpenAI is ongoing! Looking forward to some insane demos!. @arnie_….
0
21
0
If you are at ICLR, come by our posters in the DL4C (Garnet 218-219) and BuildingTrust (Hall 4 #6) workshops. We have now evaluated over 30 models, including Grok 3, Gemini Pro, and o3—none of them are ready for you to give in for pure vibes in coding.
💡 Hype vs Reality: can LLMs generate production-level code such as backends? Turns out, no. ⚠️ Using our new framework BaxBench, we show that even the best LLMs generate correct code only ~60% of the time. More alarmingly, >50% of their code is susceptible to security exploits!
0
3
8
RT @ni_jovanovic: Monday, Building Trust & DL4C workshops: BaxBench, a great recent work where we study the (in)ability of LLMs to write ba….
0
1
0
RT @the_sri_lab: SRI Lab is proud to present 5 of our works on AI Security and Privacy at @iclr_conf main conference. Looking forward to se….
0
4
0
RT @mbalunovic: I am at ICLR 2025 🇸🇬, reach out if you would like to chat about AI for math, reasoning, or math evals we are doing at MathA….
0
2
0