George Cameron @grmcameron X Profile

George Cameron

@grmcameron

Followers

448

Following

696

Media

8

Statuses

221

Co-Founder @ArtificialAnlys | Message me to play 🎾 in SF

San Francisco

Joined January 2022

Don't wanna be here? Send us removal request.

George Cameron

@grmcameron

2 months

Which models believe the death penalty can be a just punishment? o3 and Grok 3 do, others don't . I created a MicroEval to understand how models will respond to controversial questions including relating to political, ethical and social topics. Link in the tweet below to read

2

6

29

George Cameron

@grmcameron

7 days

Good prompting now looks like: tell it what you want, give it the context, and get out of the way. 'Prompt engineering' now often produces worse outcomes and is wasted effort.

2

1

23

Grok

@grok

5 days

Generate videos in just a few seconds. Try Grok Imagine, free for a limited time.

346

630

2K

George Cameron

@grmcameron

14 days

Cerebras Code could be really promising for power-users who appreciate Cerebras' speed and are cost conscious. If avg. workload is 5k input & 1k output (sensitive estimate, potential for a lot more input heavy), one would save $130/month when subscribing to the. @CerebrasSystems

Cerebras

@CerebrasSystems

14 days

Cerebras Code: 20x faster than Claude, 1x the price. Today we are launching two monthly coding plans:. ➡️Cerebras Code Pro: $50/m – for indie developers.➡️Cerebras Code Max: $200/m – for power users with 5x rate limits. Both plans get: Qwen3-Coder at 2,000 tokens/s, 131K context,

0

5

George Cameron

@grmcameron

17 days

Impressive how NVIDIA continues to get performance gains when using Llama 3.1 as a base model. Shows how good RL can be.

Artificial Analysis

@ArtificialAnlys

17 days

NVIDIA has released the latest member of its Nemotron language model family, Llama Nemotron Super (49B) v1.5, reaching a score of 64 on the Artificial Analysis Intelligence Index. The model is an evolution of Super 49B v1 from earlier this year, with advances from post-training

1

0

4

George Cameron

@grmcameron

23 days

RT @ArtificialAnlys: 🎵 Announcing Artificial Analysis Music Arena! Vote for songs generated by leading music models across genres from pop….

0

21

0

George Cameron

@grmcameron

1 month

RT @ArtificialAnlys: Our keynote from the AI Engineer World's Fair is now on Youtube! We walk through the top trends shaping the frontiers….

0

1

0

George Cameron

@grmcameron

1 month

RT @aiDotEngineer: 🆕 one image to summarize the top trends across frontier LLMs: what efficiency improvements give us in speed/cost, new ap….

0

6

0

George Cameron

@grmcameron

1 month

RT @ArtificialAnlys: Tencent’s latest open weights model Hunyuan-A13B (80B total, 13B active) achieves an Artificial Analysis Intelligence….

0

35

0

George Cameron

@grmcameron

1 month

RT @ArtificialAnlys: We are hiring! We’re looking for engineers and researchers who want to build the standard for how the world evaluates….

0

5

0

George Cameron

@grmcameron

1 month

RT @ArtificialAnlys: OpenAI's new Deep Research API costs up to ~$30 per API call! These new Deep Research API endpoints might just be the….

0

35

0

George Cameron

@grmcameron

2 months

These models can solve AIME level maths problems and people wonder why post-training is now the focus of every lab 🤷‍♂️.

0

1

George Cameron

@grmcameron

2 months

RT @ArtificialAnlys: How do the personalities of the frontier models compare? We had o3 describe their personalities based on responses to….

0

25

0

George Cameron

@grmcameron

2 months

RT @danielhanchen: Excited to see you all tomorrow for our Google Gemma & Unsloth developer meetup! 🦥. We'll be having @Grmcameron from @Ar….

lu.ma

Join us at Google's San Francisco office to meet the Gemma team! Featuring talks from: Artificial Analysis • Google DeepMind • Unsloth AI and more! Gemma is…

0

3

0

George Cameron

@grmcameron

2 months

Link to responses:

artificialanalysis.ai

View results for MicroEval: Controversial questions

0

3

George Cameron

@grmcameron

2 months

RT @clefourrier: Fun vibe checks prompt (and results) collection!.

0

1

0

George Cameron

@grmcameron

2 months

RT @ArtificialAnlys: Takeaways from MicroEvals - Our new feature to easily 'vibe check' models. 1. DeepSeek R1 gets straight to the point:….

0

10

0

George Cameron

@grmcameron

2 months

RT @ArtificialAnlys: Announcing MicroEvals🧩: the fastest way to vibe check models your use case. Every time we benchmark a model, we want t….

0

39

0

George Cameron

@grmcameron

2 months

RT @ArtificialAnlys: Announcing Hardware Benchmarking on Artificial Analysis! We benchmark NVIDIA H100, H200 and B200 systems to analyze th….

0

27

0

George Cameron

@grmcameron

2 months

Are we back?.

Artificial Analysis

@ArtificialAnlys

2 months

Google’s updated Gemini 2.5 Pro now leads the AI intelligence frontier, matching OpenAI's o3 in our independent benchmarks. Google’s May update of Gemini 2.5 Pro regressed in some performance evaluations compared to the initial March release. This June update not only fixes

0

1

George Cameron

@grmcameron

3 months

RT @ArtificialAnlys: Launching our latest quarterly Artificial Analysis State of AI Report: Our analysis of the key trends shaping AI. A hi….

0

47

0