 
            
              John Gilhuly
            
            @JohnGilhuly
Followers
                575
              Following
                529
              Media
                45
              Statuses
                219
              Field Engineering @ Cursor
              
              SF Bay Area
            
            
              
              Joined June 2024
            
            
           Just left the Cafe for a bit, kudos to @benln for a really awesome event! Truly a co-working and co-building session ๐ป 
          
          
                
                0
              
              
                
                0
              
              
                
                22
              
             GPT-5-Codex is now available in Cursor. Let us know your thoughts! 
          
                
                209
              
              
                
                244
              
              
                
                5K
              
             We've trained a new Tab model that is now the default in Cursor. This model makes 21% fewer suggestions than the previous model while having a 28% higher accept rate for the suggestions it makes. Learn more about how we improved Tab with online RL. 
          
                
                127
              
              
                
                174
              
              
                
                3K
              
             MoE layers can be really slow. When training our coding models @cursor_ai, they ate up 27โ53% of training time. So we completely rebuilt it at the kernel level and transitioned to MXFP8. The result: 3.5x faster MoE layer and 1.5x end-to-end training speedup. We believe our 
          
                
                30
              
              
                
                105
              
              
                
                877
              
             cookbook for @cursor_ai cli is added with examples of - auto fixing ci failures - updating docs - secrets scanner - automatic i18n 
          
                
                9
              
              
                
                12
              
              
                
                138
              
             Cursor CLI now includes MCPs, Review Mode, /compress, @-files, and other UX improvements. 
          
                
                79
              
              
                
                121
              
              
                
                1K
              
             To help you along with the GPT-5 release (and free initial usage in @cursor_ai!), check out this model prompting guide from @ericzakariasson, Anoop Kotha and Julian Lee  https://t.co/DQZVs395ua 
          
          
            
            cookbook.openai.com
              GPT-5, our newest flagship model, represents a substantial leap forward in agentic task performance, coding, raw intelligence, and steera...
            
                
                1
              
              
                
                1
              
              
                
                34
              
             GPT-5 is really strong, it's one of the few models I don't need to switch off for certain tasks. But I'm really just happy to make it through a week of live demos without spilling the beans 
           GPT-5 is now available in Cursor. Itโs the most intelligent coding model our team has tested. We're launching it for free for the time being. Enjoy! 
          
                
                0
              
              
                
                1
              
              
                
                4
              
             Cursor 1.4 is out with a significantly more capable agent. Itโs now much better at challenging and long-running tasks, especially in large codebases. Weโve also given the agent better tools, made token usage more efficient, and improved code editing accuracy. 
          
                
                202
              
              
                
                248
              
              
                
                6K
              
             Cursor 1.3 is out! You can now collaborate with Agent in your terminal, clearly see context window usage, and make faster edits. 
          
                
                189
              
              
                
                248
              
              
                
                3K
              
             In the past month, Cursor found 1M+ bugs in human-written PRs. Over half were real logic issues that were fixed before merging. Today, we're releasing the system that spotted these bugs. It's already become a required pre-merge check for many teams. 
          
                
                139
              
              
                
                176
              
              
                
                3K
              
             Ever wonder if your agentโs actually getting it right over a whole convo, not just one step? New Session-Level Evals in Arize AX let you do exactly that by measuring: ๐ Coherence across the session ๐งฉ Context retention across turns ๐ฏ Whether users actually reach their goals 
          
                
                1
              
              
                
                2
              
              
                
                5
              
             In case you missed some big news from Arize Observe 2025: Phoenix Cloud just leveled up with Spaces & Access Management โจYou can now create multiple, tailored Phoenix Spaces for your team and projects ๐ Easily manage user permissions in each space ๐ฅ Zero-hassle team 
          
                
                1
              
              
                
                4
              
              
                
                10
              
             Today's the day!๐ Arize Observe just kicked off, and it's bringing a whole set of new product announcements. From Agent-powered trace debugging to new Prompt Learning techniques, we've got it all! Announcements in the thread below ๐งต ๐ 
          
                
                1
              
              
                
                6
              
              
                
                18
              
             ๐ Observe 2025 kicked off with a packed keynote We just dropped a stack of new features across Phoenix Hereโs whatโs new ๐ 
          
                
                1
              
              
                
                7
              
              
                
                13
              
             How do you evaluate a whole crew of AI agents, not just a single one? ๐ค With @JohnGilhuly from @ArizeAI, we created an example demonstrating how to build a multi-agent system using CrewAI, develop a reference dataset for ideal task sequences and use Vertex AI's Gen AI Eval and 
          
                
                1
              
              
                
                3
              
              
                
                8
              
             New visualizations to track your experiment evals and latency in @ArizePhoenix ๐๐ We've made it easy to clearly see how your experiments evolve over time. This has already saved me time I would've spent on manual digging. I can clearly see how performance shifts & more 
          
                
                0
              
              
                
                5
              
              
                
                15
              
             Okay, all you @cursor_ai fans out here Imagine Cursor, but debugging across all instantiations of observability (traces/sessions), evals, and iterations It's going to be a good year for @arizeai
          
          
                
                2
              
              
                
                3
              
              
                
                8
              
             
               
               
             
             
             
             
             
            