Julio Gonzalo
@JulioGonzalo1
Followers
2K
Following
9K
Media
80
Statuses
3K
Researcher in Natural Language Processing & Information Retrieval. PI of https://t.co/zTi4LP6dpk.
Madrid, Spain
Joined October 2011
Tuve el privilegio de charlar con @GarciaAller con motivo del segundo cumpleaños de ChatGPT y fue una gozada, la verdad. Espero que os lo paséis la mitad de bien que yo escuchándolo. https://t.co/jVPUioh1PC
elconfidencial.com
¿Es, o no es para tanto? ¿De verdad va a poner el mundo patas arriba? ¿Sabemos dónde están los límites de la IA? A estas y otras preguntas responde el nuevo episodio de Pausa
3
6
23
Another bad news for Medical AI. This paper shows that medical LLMs often give different answers to the same hospital question. The core finding is that these tools are unstable for judgment-heavy bedside calls. The team tested 6 models on 4 common inpatient cases where either
81
93
354
Microsoft’s latest SEC filing quietly exposed that OpenAI lost around $11.5 billion last quarter, based on Microsoft’s 27% ownership stake and a $3.1 billion hit to its own net income. The filings confirm that Microsoft has funded $11.6B of its $13B commitment to OpenAI, with
104
519
3K
Francois Chollet says LLMs aren't enough for human-like continual learning They store skills as vector programs learned via gradient descent — not efficient, not adaptive True AGI needs to learn from experience and generalize fast "LLMs can be part of AGI, but not the
61
92
509
- OpenAI hype: GPT-5 ha encontrado soluciones a 10 problemas abiertos en matemáticas!!! - realidad: GPT-5 ha encontrado las publicaciones que resuelven 10 problemas que estaban incorrectamente anotados como "abiertos" en una lista en internet.
OpenAI top researchers got COOKED 💀 > VP of Science at OpenAI mislead the public on GPT-5 capabilities > Psyop’d OAI researchers start erratically hyping on Twitter and Reddit > got fact-checked by the literal owner of erdosproblems "This is completely false. You're
0
2
5
The @karpathy interview 0:00:00 – AGI is still a decade away 0:30:33 – LLM cognitive deficits 0:40:53 – RL is terrible 0:50:26 – How do humans learn? 1:07:13 – AGI will blend into 2% GDP growth 1:18:24 – ASI 1:33:38 – Evolution of intelligence & culture 1:43:43 - Why self
524
3K
18K
Even the champions of "LLMs are unreliable" sometimes rely on LLMs and do not appropriately check their outputs. Nobody can resist LLMs magic spell all the time: "I'll save you a lot of time and everything will look good".
They fixed the hallucinated citations! As suspected, it comes from writing the paper with the inline citations alone (to real papers) and then using an LM to translate to LaTex (including Bibtex) I should be clear, this is categorically NOT the same as whole cloth fake refs.
1
1
2
💯 this. After spending a good chunk of time on AI and materials science, I can confirm that hard sciences are full of nuance and traps that trip up even the most advanced LLM-based agents. Benchmark performance on Olympiads etc, doesn’t correlate with long-tail science
Some people seem to think mathematicians and physicists are dumping on AI, but from my perspective we want it to be usable as possible, and that requires an internal understanding of factuality, correctness and logical coherence. No flawed arguments or hallucinated facts.
1
9
125
Andrej Karpathy calls AI Agents slop "Overall, the models they are not there. And I feel like the industry [...] it's making too big of a jump and it's trying to pretend that this is amazing. And it's not—it's slop! And I think they are not coming to terms with it. And maybe
276
690
8K
🚨 SHOCKING: Sam Altman says he "expects some really bad stuff to happen," but it doesn't seem to bother him much. [HINT: OpenAI's legal department is probably FUMING over his comments; make sure to save this clip]: This is a clip from Sam Altman's podcast interview with a16z, a
33
98
298
Someone asked an AI founder 2 months ago: "What is an example of a decision you've made that is best for the world but not best for winning?" The founder's reply: "Well, we haven't put a sex bot [in our product] yet" The name of that founder? Sam Altman (in this interview:
We made ChatGPT pretty restrictive to make sure we were being careful with mental health issues. We realize this made it less useful/enjoyable to many users who had no mental health problems, but given the seriousness of the issue we wanted to get this right. Now that we have
40
162
1K
Now it's up to us to refine and scale symbolic AGI to save the world economy before the genAI bubble pops. Tick tock
89
83
1K
this continues to reinforce my belief that current frontier models are completely terrible at medical imaging analysis DO NOT TRUST LLM's interpretation of your medical images!!
🚨 Just published! All frontier AI models have failed “Radiology’s Last Exam” - the toughest benchmark in radiology launched today! ✅ Board-certified radiologists scored 83%, trainees 45%, but the best performing AI from frontier labs, GPT-5, managed only 30%. ❌ These results
21
17
168
I write as former Head of Maritime Section of the Foreign and Commonwealth Office and Alternate Head of UK Delegation to the UN Convention on the Law of the Sea Prepcom. 1) The flotilla is on the High Seas and not in Israel's 12 mile territorial sea. Israel has no jurisdiction.
750
15K
27K
So, an AI “band” who cite us as an influence (ie, it’s modelled off our music) have just overtaken us on Spotify, in only TWO months. It’s shocking, it’s disheartening, it’s insulting - most importantly - it’s a wake up call. Oppose AI music, or bands like us stop existing.
455
9K
86K
Esto viene de la conversación que tuvimos en @EspacioFTef sobre el papel de la filosofía y las humanidades en la era de la IA https://t.co/CGQzv6zB5S con @GarciaAller @gustavodietz @davidefabrizio Enrique Goñi, Héctor Florez, Jorge Ruiz, Mercedes Fernández Martorell.
1
3
3
Y precisamente porque no son tecnología, no se les puede aplicar el principio de que la tecnología es neutra. Y por no haber sido diseñadas como herramientas para resolver una serie de problemas concretos, resultan muy difíciles de evaluar.
1
0
1
Me acabo de dar cuenta de que a lo mejor las IA generativas (ChatGPT & familia) NO son tecnología. Al igual que el monstruo de Frankenstein, han sido creadas con tecnología pero aspiran a otra cosa: el monstruo a ser un ser vivo, las IAs a ser una inteligencia general.
1
1
0
Introducing Claude Sonnet 4.5—the best coding model in the world. It's the strongest model for building complex agents. It's the best model at using computers. And it shows substantial gains on tests of reasoning and math.
1K
3K
21K