
George Cameron
@grmcameron
Followers
448
Following
696
Media
8
Statuses
221
Co-Founder @ArtificialAnlys | Message me to play 🎾 in SF
San Francisco
Joined January 2022
Which models believe the death penalty can be a just punishment? o3 and Grok 3 do, others don't . I created a MicroEval to understand how models will respond to controversial questions including relating to political, ethical and social topics. Link in the tweet below to read
2
6
29
Cerebras Code could be really promising for power-users who appreciate Cerebras' speed and are cost conscious. If avg. workload is 5k input & 1k output (sensitive estimate, potential for a lot more input heavy), one would save $130/month when subscribing to the. @CerebrasSystems
Cerebras Code: 20x faster than Claude, 1x the price. Today we are launching two monthly coding plans:. ➡️Cerebras Code Pro: $50/m – for indie developers.➡️Cerebras Code Max: $200/m – for power users with 5x rate limits. Both plans get: Qwen3-Coder at 2,000 tokens/s, 131K context,
0
0
5
Impressive how NVIDIA continues to get performance gains when using Llama 3.1 as a base model. Shows how good RL can be.
NVIDIA has released the latest member of its Nemotron language model family, Llama Nemotron Super (49B) v1.5, reaching a score of 64 on the Artificial Analysis Intelligence Index. The model is an evolution of Super 49B v1 from earlier this year, with advances from post-training
1
0
4
RT @ArtificialAnlys: 🎵 Announcing Artificial Analysis Music Arena! Vote for songs generated by leading music models across genres from pop….
0
21
0
RT @ArtificialAnlys: Our keynote from the AI Engineer World's Fair is now on Youtube! We walk through the top trends shaping the frontiers….
0
1
0
RT @aiDotEngineer: 🆕 one image to summarize the top trends across frontier LLMs: what efficiency improvements give us in speed/cost, new ap….
0
6
0
RT @ArtificialAnlys: Tencent’s latest open weights model Hunyuan-A13B (80B total, 13B active) achieves an Artificial Analysis Intelligence….
0
35
0
RT @ArtificialAnlys: We are hiring! We’re looking for engineers and researchers who want to build the standard for how the world evaluates….
0
5
0
RT @ArtificialAnlys: OpenAI's new Deep Research API costs up to ~$30 per API call! These new Deep Research API endpoints might just be the….
0
35
0
RT @ArtificialAnlys: How do the personalities of the frontier models compare? We had o3 describe their personalities based on responses to….
0
25
0
RT @danielhanchen: Excited to see you all tomorrow for our Google Gemma & Unsloth developer meetup! 🦥. We'll be having @Grmcameron from @Ar….
lu.ma
Join us at Google's San Francisco office to meet the Gemma team! Featuring talks from: Artificial Analysis • Google DeepMind • Unsloth AI and more! Gemma is…
0
3
0
RT @ArtificialAnlys: Takeaways from MicroEvals - Our new feature to easily 'vibe check' models. 1. DeepSeek R1 gets straight to the point:….
0
10
0
RT @ArtificialAnlys: Announcing MicroEvals🧩: the fastest way to vibe check models your use case. Every time we benchmark a model, we want t….
0
39
0
RT @ArtificialAnlys: Announcing Hardware Benchmarking on Artificial Analysis! We benchmark NVIDIA H100, H200 and B200 systems to analyze th….
0
27
0
RT @ArtificialAnlys: Launching our latest quarterly Artificial Analysis State of AI Report: Our analysis of the key trends shaping AI. A hi….
0
47
0