Siva Reddy @sivareddyg X Profile

Siva Reddy

@sivareddyg

Followers

6K

Following

8K

Media

119

Statuses

2K

Assistant Professor @Mila_Quebec @McGillU @ServiceNowRSRCH; Postdoc @StanfordNLP; PhD @EdinburghNLP; Natural Language Processor #NLProc

https://t.co/aOn1xWUhqS

Montreal, QC, Canada

Joined July 2009

Don't wanna be here? Send us removal request.

ServiceNow AI Research

@ServiceNowRSRCH

5 days

1/5 🚀Apriel-1.6-15B-Thinker: a 15B multimodal reasoner scoring 57 on the Artificial Analysis Intelligence Index - approaching the performance of ~200B-scale frontier models while remaining an order of magnitude smaller. 🧠Model weights: https://t.co/GE22SOIBfT 📄Blog:

9

53

213

Ashish Vaswani

@ashVaswani

9 days

We are beyond thrilled to share our first flagship models, Rnj-1 base and instruct 8B parameter models. Rnj-1 is the culmination of 10 months of hard work by a phenomenal team, dedicated to advancing American SOTA OSS AI. Lots of wins with Rnj-1. 1. SWE bench performance close

Essential AI

@essential_ai

9 days

Today, we’re excited to introduce Rnj-1, @essential_ai's first open model; a world-class 8B base + instruct pair, built with scientific rigor, intentional design, and a belief that the advancement and equitable distribution of AI depend on building in the open. We bring

101

171

2K

BullseyeBore

@bullseyebore

18 days

3" perfectly straight hole into a concrete foundation. Just align the circles and go. Easy. Featuring: +BullseyeBore Core CG1-101 +Milwaukee 1/2" Hammer Drill/Driver +Diablo 1/4" Red Granite Plus Concrete Bit Use code FREESHIP at checkout for free standard US shipping.

0

2

66

Amine Elhattami

@amine_elhattami

9 days

Introducing WebArena Verified — an audit of all 812 tasks with robust, offline, stack-agnostic eval, https://t.co/j0rJf1K7wL Noise 🚮 → stronger agents 📈, weaker 📉, verbose ones 📈 with JSON format. New: 📦 ~70% leaner Docker envs 🔥 Hard subset (258) for fast/focused evals

4

11

58

Azalia Mirhoseini

@Azaliamirh

12 days

Thrilled to share that @annadgoldie and I are launching @RicursiveAI, a frontier lab enabling recursive self-improvement through AIs that design their own chips. Our vision for transforming chip design began with AlphaChip, an AI for layout optimization used to design four

wsj.com

Founded by ex-Google researchers, the company raised $35 million with backing from Sequoia to automate chip design.

Ricursive Intelligence

@RicursiveAI

12 days

Introducing Ricursive Intelligence, a frontier AI lab enabling a recursive self-improvement loop between AI and the chips that fuel it. Learn more at https://t.co/cSpbrQwwEn

123

137

1K

Aditi Khandelwal

@Aditi184

11 days

Excited to attend my first NeurIPS and present my work on multilingual routing in MoEs at @WiMLworkshop! If you’re interested in MoEs or multilinguality, I would love to chat. Feel free to DM!

Mila - Institut québécois d'IA

@Mila_Quebec

11 days

Mila was proud to sponsor the 2025 @WiMLworkshop held on December 2 during the @NeurIPSConf in San Diego. As part of this initiative, we have provided a scholarship enabling four Mila PhD candidates to attend the Conference as well as the WiML workshop. This support aims to

0

2

16

Weappy | Purchase Hollywood Animal on Steam

@WeappyStudio

1 year

Now you run Hollywood. Show them how it's done. Wishlist 'Hollywood Animal' on Steam NOW! https://t.co/aGUcTYFd1y

14

36

476

Kangwook Lee

@Kangwook_Lee

19 days

LLM as a judge has become a dominant way to evaluate how good a model is at solving a task, since it works without a test set and handles cases where answers are not unique. But despite how widely this is used, almost all reported results are highly biased. Excited to share our

45

176

1K

Michael Rizvi-Martel

@frisbeemortel

2 months

Is there such a thing as too many agents in multi-agent systems? It depends! 🧵 Our work reveals 3 distinct regimes where communication patterns differ dramatically. More on our findings below 👇 (1/7)

1

11

29

Paul Vicol

@PaulVicol

19 days

🚀Introducing TMLR Beyond PDF! 🎬This is a new, HTML-based submission format for TMLR, that supports interactive figures and videos, along with the usual LaTeX and images. 🎉Thanks to TMLR Editors in Chief @hugo_larochelle @thegautamkamath @NailaMurray Nihar B. Shah @lcharlin!

11

39

200

Yu Su

@ysu_nlp

19 days

Life update: I moved to silicon valley to tackle agents' biggest challenges: plasticity and reliability. Today's agents are smart but brittle. They lack plasticity (continual learning and adaptation) and reliability (stable, predictable behavior with bounded failures). These two

40

43

421

Vered Shwartz

@VeredShwartz

20 days

I just published "AI Language Technologies are Powerful—But Not Without Limits" https://t.co/81ifJ4gnnK via @CUPAcademic

cambridgeblog.org

Imagine waking up in the morning. You read your emails with the morning coffee and use Gmail’s autocomplete feature to compile the answers. Before leaving the house, you ask Siri for the weather...

0

4

17

Cohere Labs

@Cohere_Labs

23 days

🚨 Coming up - Research Connections event on Wednesday, November 26th! Are you interested in building interpretable, trustworthy language models and user-centric AI? Or learning about the biases, safety and cultural alignment of language models? Come meet @lasha_nlp and

2

12

43

Steve Shultz

@elijahliststeve

2 months

She waited two hours for a word. God told her she already had it. Listen to this lesson about hearing God. It could change your life.

0

62

657

Michael Hahn

@mhahn29

24 days

We’re hiring PhD students and postdocs on LLM theory and interpretability! Topics: 1️⃣ abilities & limitations of transformers and other architectures; 2️⃣ LLM interpretability; 3️⃣ foundations of LLM reasoning; 4️⃣ foundations of AI safety.

13

93

623

Sai Rajeswar

@RajeswarSai

23 days

Will be heading to @NeurIPSConf. If you'll be around and interested in advancing multimodal reasoning, RL environments, or vent about ICLR reviews, let's connect☕ 🧩𝗔𝗹𝗶𝗴𝗻𝗩𝗟𝗠: https://t.co/1SGCelYjEO 🎨𝗥𝗟𝗥𝗙: https://t.co/9CpBVRbzfn 🖼️𝗘𝗔𝗥𝗟: https://t.co/n9lA4N2Xbq

3

5

13

Torsten Scholak

@tscholak

24 days

🚀 Introducing Apriel-H1: a family of seven 15B hybrid model (Transformer + Mamba) distilled directly from Apriel-Nemotron-15B-Thinker reasoner. ✅ Navigating throughput performance tradeoff with up to 3.4x speedup ✅ 2x speedup without performance loss ✅ Efficient distillation

5

35

110

Hanna Hajishirzi

@HannaHajishirzi

25 days

Introducing Olmo 3 and our entire model flow to build Olmo 3-Think and Olmo3-Instruct. Strong results, big improvements. Massive shoutout to the team who made it happen. Lots of exciting new things come with this release:

Ai2

@allen_ai

25 days

Announcing Olmo 3, a leading fully open LM suite built for reasoning, chat, & tool use, and an open model flow—not just the final weights, but the entire training journey. Best fully open 32B reasoning model & best 32B base model. 🧵

6

12

116

Jordan Boyd-Graber

@boydgraber

25 days

@niloofar_mire @sivareddyg @seraphinagt @NishantBalepur Okay, reupload here: https://t.co/7pcRXMu0u3 I learned about Premiere's audio normalization, which I should have known about before, thanks for the impetus!

0

1

7

Ian Goodfellow

@goodfellow_ian

26 days

Amazing test of Gemini 3’s multimodal reasoning capabilities: try generating a threejs voxel art scene using only an image as input Prompt: I have provided an image. Code a beautiful voxel art scene inspired by this image. Write threejs code as a single-page

82

258

3K

Brian P. Phillips

@BrianPPhillips

1 month

The Pearl Bitcoin fund allows accredited U.S. investors & corporations to leverage the full potential of Bitcoin while eliminating all capital gains taxes after a 10-year hold via our SEC compliant, institutional grade, proprietary IRS approved process.

0

51

568

Dhruv Batra

@DhruvBatra_

25 days

Introducing Yutori Navigator 31 years ago, the modern web era began with Netscape Navigator. Today, we’re introducing Yutori Navigator — a web agent that autonomously navigates websites on its own cloud browser to complete tasks for you. Navigator achieves pareto-domination

28

47

247

Siva Reddy

@sivareddyg

25 days

Jieyu Zhao (@jieyuzhao11) on Personalized AI Agents CUA agents -- rely on grounding actions -- many issues CoAct -- mixing GUI action and coding -- orchestrator has access to coding and GUI operator -- better than just GUI or coding models Discovering knowledge deficiencies

Siva Reddy

@sivareddyg

28 days

Checkout the IVADO workshop on Deploying Autonomous Agents: Lessons, Risks and Real-World Impact happening today until Wednesday in Montreal with an exciting line up of speakers #Agents #LLMs https://t.co/VTEOv2kLGO

0

2

11

Nouha Dziri @NeurIPS25

@nouhadziri

25 days

I had the great pleasure today to speak at IVADO workshop on "Deploying Autonomous Agents: Lessons, Risks and Real-World Impact" in Montreal 🍁🇨🇦 along a brilliant lineup of speakers. A big thanks to the organizers! #LLMs #Agents #safety #Security

Siva Reddy

@sivareddyg

25 days

Nouha Dziri (@nouhadziri) on LLM to Agent Safety Capability doesn't mean increased safety Capable models still seem to be poor at OOD generalization, so easy to bypass safety WildTeaming -- large scale jailbreaking using several tactics -- 262K jailbreaking examples -- Training

0

6

21