Julio Gonzalo @JulioGonzalo1 X Profile

Julio Gonzalo

@JulioGonzalo1

Followers

2K

Following

9K

Media

80

Statuses

3K

Researcher in Natural Language Processing & Information Retrieval. PI of https://t.co/zTi4LP6dpk.

https://t.co/hFiALiL0KD

Madrid, Spain

Joined October 2011

Don't wanna be here? Send us removal request.

Julio Gonzalo

@JulioGonzalo1

11 months

Tuve el privilegio de charlar con @GarciaAller con motivo del segundo cumpleaños de ChatGPT y fue una gozada, la verdad. Espero que os lo paséis la mitad de bien que yo escuchándolo. https://t.co/jVPUioh1PC

elconfidencial.com

¿Es, o no es para tanto? ¿De verdad va a poner el mundo patas arriba? ¿Sabemos dónde están los límites de la IA? A estas y otras preguntas responde el nuevo episodio de Pausa

3

6

23

Rohan Paul

@rohanpaul_ai

18 hours

Another bad news for Medical AI. This paper shows that medical LLMs often give different answers to the same hospital question. The core finding is that these tools are unstable for judgment-heavy bedside calls. The team tested 6 models on 4 common inpatient cases where either

81

93

354

Chubby♨️

@kimmonismus

4 days

Microsoft’s latest SEC filing quietly exposed that OpenAI lost around $11.5 billion last quarter, based on Microsoft’s 27% ownership stake and a $3.1 billion hit to its own net income. The filings confirm that Microsoft has funded $11.6B of its $13B commitment to OpenAI, with

104

519

3K

Haider.

@slow_developer

10 days

Francois Chollet says LLMs aren't enough for human-like continual learning They store skills as vector programs learned via gradient descent — not efficient, not adaptive True AGI needs to learn from experience and generalize fast "LLMs can be part of AGI, but not the

61

92

509

Julio Gonzalo

@JulioGonzalo1

14 days

- OpenAI hype: GPT-5 ha encontrado soluciones a 10 problemas abiertos en matemáticas!!! - realidad: GPT-5 ha encontrado las publicaciones que resuelven 10 problemas que estaban incorrectamente anotados como "abiertos" en una lista en internet.

NIK

@ns123abc

16 days

OpenAI top researchers got COOKED 💀 > VP of Science at OpenAI mislead the public on GPT-5 capabilities > Psyop’d OAI researchers start erratically hyping on Twitter and Reddit > got fact-checked by the literal owner of erdosproblems "This is completely false. You're

0

2

5

Dwarkesh Patel

@dwarkesh_sp

17 days

The @karpathy interview 0:00:00 – AGI is still a decade away 0:30:33 – LLM cognitive deficits 0:40:53 – RL is terrible 0:50:26 – How do humans learn? 1:07:13 – AGI will blend into 2% GDP growth 1:18:24 – ASI 1:33:38 – Evolution of intelligence & culture 1:43:43 - Why self

524

3K

18K

Julio Gonzalo

@JulioGonzalo1

15 days

Even the champions of "LLMs are unreliable" sometimes rely on LLMs and do not appropriately check their outputs. Nobody can resist LLMs magic spell all the time: "I'll save you a lot of time and everything will look good".

Michael Saxon

@m2saxon

16 days

They fixed the hallucinated citations! As suspected, it comes from writing the paper with the inline citations alone (to real papers) and then using an LM to translate to LaTex (including Bibtex) I should be clear, this is categorically NOT the same as whole cloth fake refs.

1

2

Delip Rao e/σ

@deliprao

16 days

💯 this. After spending a good chunk of time on AI and materials science, I can confirm that hard sciences are full of nuance and traps that trip up even the most advanced LLM-based agents. Benchmark performance on Olympiads etc, doesn’t correlate with long-tail science

Christopher D. Long 🇺🇦🏳️‍🌈🌹

@octonion

16 days

Some people seem to think mathematicians and physicists are dumping on AI, but from my perspective we want it to be usable as possible, and that requires an internal understanding of factuality, correctness and logical coherence. No flawed arguments or hallucinated facts.

1

9

125

Julio Gonzalo

@JulioGonzalo1

16 days

I can't believe it

Michael Saxon

@m2saxon

17 days

The viral new "Definition of AGI" paper has fake citations which do not exist. And it specifically TELLS you to read them! Proof: different articles present at the specified journal/volume/page number, and their titles exist nowhere on any searchable repository.

0

4

Lisan al Gaib

@scaling01

17 days

Andrej Karpathy calls AI Agents slop "Overall, the models they are not there. And I feel like the industry [...] it's making too big of a jump and it's trying to pretend that this is amazing. And it's not—it's slop! And I think they are not coming to terms with it. And maybe

276

690

8K

Luiza Jarovsky, PhD

@LuizaJarovsky

17 days

🚨 SHOCKING: Sam Altman says he "expects some really bad stuff to happen," but it doesn't seem to bother him much. [HINT: OpenAI's legal department is probably FUMING over his comments; make sure to save this clip]: This is a clip from Sam Altman's podcast interview with a16z, a

33

98

298

Arnaud Bertrand

@RnaudBertrand

20 days

Someone asked an AI founder 2 months ago: "What is an example of a decision you've made that is best for the world but not best for winning?" The founder's reply: "Well, we haven't put a sex bot [in our product] yet" The name of that founder? Sam Altman (in this interview:

Sam Altman

@sama

20 days

We made ChatGPT pretty restrictive to make sure we were being careful with mental health issues. We realize this made it less useful/enjoyable to many users who had no mental health problems, but given the seriousness of the issue we wanted to get this right. Now that we have

40

162

1K

François Chollet

@fchollet

26 days

Now it's up to us to refine and scale symbolic AGI to save the world economy before the genAI bubble pops. Tick tock

89

83

1K

Tanishq Mathew Abraham, Ph.D.

@iScienceLuvr

1 month

this continues to reinforce my belief that current frontier models are completely terrible at medical imaging analysis DO NOT TRUST LLM's interpretation of your medical images!!

Dr. Datta M.D. (AIIMS Delhi)

@DrDatta_AIIMS

1 month

🚨 Just published! All frontier AI models have failed “Radiology’s Last Exam” - the toughest benchmark in radiology launched today! ✅ Board-certified radiologists scored 83%, trainees 45%, but the best performing AI from frontier labs, GPT-5, managed only 30%. ❌ These results

21

17

168

Craig Murray

@CraigMurrayOrg

1 month

I write as former Head of Maritime Section of the Foreign and Commonwealth Office and Alternate Head of UK Delegation to the UN Convention on the Law of the Sea Prepcom. 1) The flotilla is on the High Seas and not in Israel's 12 mile territorial sea. Israel has no jurisdiction.

750

15K

27K

Lu

@Lucaswoodland

1 month

So, an AI “band” who cite us as an influence (ie, it’s modelled off our music) have just overtaken us on Spotify, in only TWO months. It’s shocking, it’s disheartening, it’s insulting - most importantly - it’s a wake up call. Oppose AI music, or bands like us stop existing.

455

9K

86K

Julio Gonzalo

@JulioGonzalo1

1 month

Esto viene de la conversación que tuvimos en @EspacioFTef sobre el papel de la filosofía y las humanidades en la era de la IA https://t.co/CGQzv6zB5S con @GarciaAller @gustavodietz @davidefabrizio Enrique Goñi, Héctor Florez, Jorge Ruiz, Mercedes Fernández Martorell.

1

3

Julio Gonzalo

@JulioGonzalo1

1 month

Y precisamente porque no son tecnología, no se les puede aplicar el principio de que la tecnología es neutra. Y por no haber sido diseñadas como herramientas para resolver una serie de problemas concretos, resultan muy difíciles de evaluar.

1

0

1

Julio Gonzalo

@JulioGonzalo1

1 month

Me acabo de dar cuenta de que a lo mejor las IA generativas (ChatGPT & familia) NO son tecnología. Al igual que el monstruo de Frankenstein, han sido creadas con tecnología pero aspiran a otra cosa: el monstruo a ser un ser vivo, las IAs a ser una inteligencia general.

1

0

Claude

@claudeai

1 month

Introducing Claude Sonnet 4.5—the best coding model in the world. It's the strongest model for building complex agents. It's the best model at using computers. And it shows substantial gains on tests of reasoning and math.

1K

3K

21K

vas

@vasuman

1 month

https://t.co/wloBcEqb61

61

104

2K