Explore tweets tagged as #AIbenchmarks
@cloudbooklet
Artificial Intelligence
15 days
šŸ”„GPT-5 isn’t ā€œjust another model.ā€ šŸ‘€šŸ‘€. It’s a legit code assistant now. 90%+ accuracy. Next-gen pair programming is here. Too much power… or just enough? šŸ’¬. #AI #OpenAI #GPT5 #ChatGPT #GrokAI #xAI #NotebookLM #Anthropic #AIBenchmarks
Tweet media one
2
0
3
@Junior89858253
William Tazzledock
1 year
Tweet media one
1
0
0
@MingSHampton1
Ming S Hampton
1 month
Ming Calls Grok-4’s API the AGI Holy Grail! Game-Changer or Overhyped?.#ARCAGI #AGIModels #Grok4 #AIbenchmarks #ArtificialIntelligence #MachineLearning #TechReview #AICommunity #Innovation #FutureOfAI
0
0
0
@Junior89858253
William Tazzledock
1 year
Today's brand new Claude 3.5 sonnet makes great ascii art! It made a terrific alien (See screenshot) .#claude35sonnet #claude35 #claude3 #claude #chatbot #aibenchmarks #chatgpt4o #gpt4 #GPT5 #intelligence #screenshot #ascii #asciiart #alienart #alien
Tweet media one
0
0
0
@aiartgallerie
aiartgallerie
1 year
OpenAI launches SWE-bench Verified, a human-validated subset of the popular SWE-bench AI benchmark for evaluating software engineering abilities. GPT-4's score more than doubles!. šŸ“ˆHow will this impact AI development in software engineering?. #AIBenchmarks #SoftwareEngineering
Tweet media one
1
0
2
@_Anshuman_Jha
Anshuman Jha
21 days
The K Prize shows AI isn’t coding-genius level (yet). Winner scored just 7.5%. AI’s still got a way to go before it replaces devs. #KPrize #AIcoding #PromptEngineering #AIbenchmarks
0
0
0
@Junior89858253
William Tazzledock
10 months
Tweet media one
0
1
1
@Junior89858253
William Tazzledock
8 months
Tweet media one
Tweet media two
Tweet media three
Tweet media four
0
0
1
@JohnSmit00001
John Smit
1 month
1/2 šŸš€ A new open-source leader has emerged — meet Kimi K2, boasting a massive 1 trillion parameters!.#KimiK2 #AI #OpenSourceAI #NeuralNetworks #MachineLearning #CodeGeneration #AInews #Claude4 #GPT4 #AIbenchmarks #TechNews #AI
Tweet media one
1
0
4
@StartupHakk
StartupHakk
2 months
0
0
0
@alby13
alby13
8 months
"You need to have these very hard tasks which produce undeniable evidence. And that's how the field is making progress today, because we have these hard benchmarks, which represent true progress. And this is why we're able to avoid endless debate." #AIbenchmarks. -Ilya Sutskever
Tweet media one
0
1
2
@Junior89858253
William Tazzledock
1 year
Chatgpt 4o is failing again this morning tho it was giving the correct answer last night. What's going on? (see screenshot).#chatgpt4o #textprediction #openai #chatgpt4omini.#gpt4o #gpt4omini #samaltman.#aibenchmarks #mathwhiz #projectstrawberry #strawberry
Tweet media one
0
0
0
@WBuzzer
WinBuzzer
1 month
Study: AI Benchmarks Deeply Flawed, Can Overestimate Performance by 100%. #AI #AIBenchmarks #ChatGPT \Google#LMArena #Research.
Tweet media one
0
1
2
@WBuzzer
WinBuzzer
25 days
Alibaba’s Qwen 2.5 AI Faces MAth ā€˜Cheating’ Allegations Over Contaminated Benchmark Data. #AI #Alibaba #Qwen #AIBenchmarks #DataContamination #MachineLearning.
Tweet media one
0
1
2
@WBuzzer
WinBuzzer
2 months
Mistral Enters AI Reasoning Race with Magistral Model, But Benchmarks Reveal a Gap. #AI #MistralAI #Magistral #ReasoningAI #LLM #OpenSourceAI #AIBenchmarks.
Tweet media one
0
1
1
@svicpodcast
SVIC Podcast
8 months
0
0
0
@suvodeep_dev
Suvodeep
3 months
Claude 4 is here—and it's a powerhouse. Outperforms GPT-4 and Gemini 2.5 in reasoning, coding, and long-context tasks. Fast, smart, and ready. #Claude4 #AIbenchmarks
Tweet media one
4
0
1
@WBuzzer
WinBuzzer
1 month
Former Intel CEO Pat Gelsinger Unveils AI Benchmark to Measure Alignment for "Human Flourishing". #AI #AIEthics #AISafety #PatGelsinger #AIBenchmarks #HumanFlourishing.
Tweet media one
0
1
1
@edgarcarmenatty
Dr. Edgar Carmenatty
4 months
0
0
0