grmcameron Profile Banner
George Cameron Profile
George Cameron

@grmcameron

Followers
448
Following
696
Media
8
Statuses
221

Co-Founder @ArtificialAnlys | Message me to play 🎾 in SF

San Francisco
Joined January 2022
Don't wanna be here? Send us removal request.
@grmcameron
George Cameron
2 months
Which models believe the death penalty can be a just punishment? o3 and Grok 3 do, others don't . I created a MicroEval to understand how models will respond to controversial questions including relating to political, ethical and social topics. Link in the tweet below to read
Tweet media one
2
6
29
@grmcameron
George Cameron
7 days
Good prompting now looks like: tell it what you want, give it the context, and get out of the way. 'Prompt engineering' now often produces worse outcomes and is wasted effort.
2
1
23
@grok
Grok
5 days
Generate videos in just a few seconds. Try Grok Imagine, free for a limited time.
346
630
2K
@grmcameron
George Cameron
14 days
Cerebras Code could be really promising for power-users who appreciate Cerebras' speed and are cost conscious. If avg. workload is 5k input & 1k output (sensitive estimate, potential for a lot more input heavy), one would save $130/month when subscribing to the. @CerebrasSystems
Tweet media one
@CerebrasSystems
Cerebras
14 days
Cerebras Code: 20x faster than Claude, 1x the price. Today we are launching two monthly coding plans:. ➡️Cerebras Code Pro: $50/m – for indie developers.➡️Cerebras Code Max: $200/m – for power users with 5x rate limits. Both plans get: Qwen3-Coder at 2,000 tokens/s, 131K context,
0
0
5
@grmcameron
George Cameron
17 days
Impressive how NVIDIA continues to get performance gains when using Llama 3.1 as a base model. Shows how good RL can be.
@ArtificialAnlys
Artificial Analysis
17 days
NVIDIA has released the latest member of its Nemotron language model family, Llama Nemotron Super (49B) v1.5, reaching a score of 64 on the Artificial Analysis Intelligence Index. The model is an evolution of Super 49B v1 from earlier this year, with advances from post-training
Tweet media one
Tweet media two
1
0
4
@grmcameron
George Cameron
23 days
RT @ArtificialAnlys: 🎵 Announcing Artificial Analysis Music Arena! Vote for songs generated by leading music models across genres from pop….
0
21
0
@grmcameron
George Cameron
1 month
RT @ArtificialAnlys: Our keynote from the AI Engineer World's Fair is now on Youtube! We walk through the top trends shaping the frontiers….
0
1
0
@grmcameron
George Cameron
1 month
RT @aiDotEngineer: 🆕 one image to summarize the top trends across frontier LLMs: what efficiency improvements give us in speed/cost, new ap….
0
6
0
@grmcameron
George Cameron
1 month
RT @ArtificialAnlys: Tencent’s latest open weights model Hunyuan-A13B (80B total, 13B active) achieves an Artificial Analysis Intelligence….
0
35
0
@grmcameron
George Cameron
1 month
RT @ArtificialAnlys: We are hiring! We’re looking for engineers and researchers who want to build the standard for how the world evaluates….
0
5
0
@grmcameron
George Cameron
1 month
RT @ArtificialAnlys: OpenAI's new Deep Research API costs up to ~$30 per API call! These new Deep Research API endpoints might just be the….
0
35
0
@grmcameron
George Cameron
2 months
These models can solve AIME level maths problems and people wonder why post-training is now the focus of every lab 🤷‍♂️.
0
0
1
@grmcameron
George Cameron
2 months
RT @ArtificialAnlys: How do the personalities of the frontier models compare? We had o3 describe their personalities based on responses to….
0
25
0
@grmcameron
George Cameron
2 months
RT @danielhanchen: Excited to see you all tomorrow for our Google Gemma & Unsloth developer meetup! 🦥. We'll be having @Grmcameron from @Ar….
Tweet card summary image
lu.ma
Join us at Google's San Francisco office to meet the Gemma team! Featuring talks from: Artificial Analysis • Google DeepMind • Unsloth AI and more! Gemma is…
0
3
0
@grmcameron
George Cameron
2 months
RT @clefourrier: Fun vibe checks prompt (and results) collection!.
0
1
0
@grmcameron
George Cameron
2 months
RT @ArtificialAnlys: Takeaways from MicroEvals - Our new feature to easily 'vibe check' models. 1. DeepSeek R1 gets straight to the point:….
0
10
0
@grmcameron
George Cameron
2 months
RT @ArtificialAnlys: Announcing MicroEvals🧩: the fastest way to vibe check models your use case. Every time we benchmark a model, we want t….
0
39
0
@grmcameron
George Cameron
2 months
RT @ArtificialAnlys: Announcing Hardware Benchmarking on Artificial Analysis! We benchmark NVIDIA H100, H200 and B200 systems to analyze th….
0
27
0
@grmcameron
George Cameron
2 months
Are we back?.
@ArtificialAnlys
Artificial Analysis
2 months
Google’s updated Gemini 2.5 Pro now leads the AI intelligence frontier, matching OpenAI's o3 in our independent benchmarks. Google’s May update of Gemini 2.5 Pro regressed in some performance evaluations compared to the initial March release. This June update not only fixes
Tweet media one
0
0
1
@grmcameron
George Cameron
3 months
RT @ArtificialAnlys: Launching our latest quarterly Artificial Analysis State of AI Report: Our analysis of the key trends shaping AI. A hi….
0
47
0