Florian
@FlorianCaesar
Followers
346
Following
16K
Media
36
Statuses
930
programming the universe @destacksh | organize @zurichnlp
Zurich, Switzerland
Joined February 2014
Announcing the Beta release of ty: an extremely fast type checker and language server for Python, written in Rust. We now use ty exclusively in our own projects and are ready to recommend it to motivated users. 10x, 50x, even 100x faster than existing type checkers and LSPs.
95
287
3K
🔉 Introducing SAM Audio, the first unified model that isolates any sound from complex audio mixtures using text, visual, or span prompts. We’re sharing SAM Audio with the community, along with a perception encoder model, benchmarks and research papers, to empower others to
205
891
6K
Introducing the Devstral 2 coding model family. Two sizes, both open source. Also, meet Mistral Vibe, a native CLI, enabling end-to-end automation. 🧵
144
452
3K
Help me help them out gang!
We're so close to 1000, but we just reached 2000 on LinkedIn (that ratio is concerning, yes). Please @giffmana need your help here :D
6
1
96
Gemini 3 Deep Think is here. Deep Think is our most advanced reasoning mode that explores multiple hypotheses simultaneously to give you an even more sophisticated output.
412
791
6K
Introducing the Mistral 3 family of models: Frontier intelligence at all sizes. Apache 2.0. Details in 🧵
175
831
5K
🏆 World-Leading Reasoning 🔹 V3.2: Balanced inference vs. length. Your daily driver at GPT-5 level performance. 🔹 V3.2-Speciale: Maxed-out reasoning capabilities. Rivals Gemini-3.0-Pro. 🥇 Gold-Medal Performance: V3.2-Speciale attains gold-level results in IMO, CMO, ICPC World
73
306
3K
Opus 4.5 scores the same on FrontierMath regardless of thinking budget, in contrast to GPT-5.1 where higher reasoning settings correspond to higher scores. However, on OTIS Mock AIME, another math benchmark, we see the thinking budget make a difference for Opus 4.5 as well.
3
3
80
Opus 4.5 (Thinking, 64k) on ARC-AGI Semi-Private Eval - ARC-AGI-1: 80.00%, $1.47/task - ARC-AGI-2: 37.64%, $2.40/task New SOTA for released frontier models from @AnthropicAI
52
130
1K
Introducing Claude Opus 4.5: the best model in the world for coding, agents, and computer use. Opus 4.5 is a step forward in what AI systems can do, and a preview of larger changes to how work gets done.
1K
2K
19K
We have much more planned for ZurichAI. Soon!
Thank you to the 100+ people who showed up for tonight's ZurichCV#11 at the @ETH_AI_Center. ZurichNLP#18 on Monday!
0
0
2
We just dropped Nano Banana Pro, built on Gemini 3. 🍌 With state-of-the-art text rendering, vast world knowledge and studio-quality creative controls, Gemini 3 Pro Image can create and edit more complex visuals, infographics and more. Here’s what’s under the hood. 🧵
168
604
4K
Today we at @OpenAI are releasing GPT-5.1-Codex-Max, which can work autonomously for more than a day over millions of tokens. Pretraining hasn't hit a wall, and neither has test-time compute. Congrats to my teammates @kevinleestone & @mikegmalek for helping to make it possible!
116
275
3K
Gemini 3 Pro takes the crown on LisanBench - it scores 2.2x higher than GPT-5 while using 2.4x fewer reasoning tokens - it has the highest score on 23 out of 50 words - Grok-4 is the only model that can keep up
41
21
441
Also, we're exploring new ways to create games using Gemini 3 - we've created and published a few experimental YouTube Playables. Each of the games shown in the video below was created by Gemini 3 from scratch w/a few natural language prompts/images. Try them out at
84
205
2K
Big news: Replicate is joining Cloudflare. Replicate's going to carry on as a distinct brand, and all that will happen is that it’s going to get way better: it’ll be faster, we’ll have more resources, and it’ll integrate with the rest of Cloudflare’s Developer Platform.
264
169
2K
SIMA 2 is our most capable AI agent for virtual 3D worlds. 👾🌐 Powered by Gemini, it goes beyond following basic instructions to think, understand, and take actions in interactive environments – meaning you can talk to it through text, voice, or even images. Here’s how 🧵
398
1K
7K
We built a web app that lets you fly a spaceship through a 3D constellation of music - powered by our Lyria RealTime model. 🎶 Space DJ is an interactive visualization where every star represents a different music genre. As you explore, your path is translated into prompts for
99
273
2K