John Gilhuly
@JohnGilhuly
Followers
575
Following
529
Media
45
Statuses
219
Field Engineering @ Cursor
SF Bay Area
Joined June 2024
Just left the Cafe for a bit, kudos to @benln for a really awesome event! Truly a co-working and co-building session ๐ป
0
0
22
GPT-5-Codex is now available in Cursor. Let us know your thoughts!
209
244
5K
We've trained a new Tab model that is now the default in Cursor. This model makes 21% fewer suggestions than the previous model while having a 28% higher accept rate for the suggestions it makes. Learn more about how we improved Tab with online RL.
127
174
3K
MoE layers can be really slow. When training our coding models @cursor_ai, they ate up 27โ53% of training time. So we completely rebuilt it at the kernel level and transitioned to MXFP8. The result: 3.5x faster MoE layer and 1.5x end-to-end training speedup. We believe our
30
105
877
cookbook for @cursor_ai cli is added with examples of - auto fixing ci failures - updating docs - secrets scanner - automatic i18n
9
12
138
Cursor CLI now includes MCPs, Review Mode, /compress, @-files, and other UX improvements.
79
121
1K
To help you along with the GPT-5 release (and free initial usage in @cursor_ai!), check out this model prompting guide from @ericzakariasson, Anoop Kotha and Julian Lee https://t.co/DQZVs395ua
cookbook.openai.com
GPT-5, our newest flagship model, represents a substantial leap forward in agentic task performance, coding, raw intelligence, and steera...
1
1
34
GPT-5 is really strong, it's one of the few models I don't need to switch off for certain tasks. But I'm really just happy to make it through a week of live demos without spilling the beans
GPT-5 is now available in Cursor. Itโs the most intelligent coding model our team has tested. We're launching it for free for the time being. Enjoy!
0
1
4
Cursor 1.4 is out with a significantly more capable agent. Itโs now much better at challenging and long-running tasks, especially in large codebases. Weโve also given the agent better tools, made token usage more efficient, and improved code editing accuracy.
202
248
6K
Cursor 1.3 is out! You can now collaborate with Agent in your terminal, clearly see context window usage, and make faster edits.
189
248
3K
In the past month, Cursor found 1M+ bugs in human-written PRs. Over half were real logic issues that were fixed before merging. Today, we're releasing the system that spotted these bugs. It's already become a required pre-merge check for many teams.
139
176
3K
Ever wonder if your agentโs actually getting it right over a whole convo, not just one step? New Session-Level Evals in Arize AX let you do exactly that by measuring: ๐ Coherence across the session ๐งฉ Context retention across turns ๐ฏ Whether users actually reach their goals
1
2
5
In case you missed some big news from Arize Observe 2025: Phoenix Cloud just leveled up with Spaces & Access Management โจYou can now create multiple, tailored Phoenix Spaces for your team and projects ๐ Easily manage user permissions in each space ๐ฅ Zero-hassle team
1
4
10
Today's the day!๐ Arize Observe just kicked off, and it's bringing a whole set of new product announcements. From Agent-powered trace debugging to new Prompt Learning techniques, we've got it all! Announcements in the thread below ๐งต ๐
1
6
18
๐ Observe 2025 kicked off with a packed keynote We just dropped a stack of new features across Phoenix Hereโs whatโs new ๐
1
7
13
How do you evaluate a whole crew of AI agents, not just a single one? ๐ค With @JohnGilhuly from @ArizeAI, we created an example demonstrating how to build a multi-agent system using CrewAI, develop a reference dataset for ideal task sequences and use Vertex AI's Gen AI Eval and
1
3
8
New visualizations to track your experiment evals and latency in @ArizePhoenix ๐๐ We've made it easy to clearly see how your experiments evolve over time. This has already saved me time I would've spent on manual digging. I can clearly see how performance shifts & more
0
5
15
Okay, all you @cursor_ai fans out here Imagine Cursor, but debugging across all instantiations of observability (traces/sessions), evals, and iterations It's going to be a good year for @arizeai
2
3
8