Nitish Joshi Profile
Nitish Joshi

@nitishjoshi23

Followers
975
Following
5K
Media
12
Statuses
189

PhD student at NYU | CS undergrad @IITBombay '20 | Research in Natural Language Processing (#NLProc). Birding @nitishbird

New York, USA
Joined June 2018
Don't wanna be here? Send us removal request.
@nitishjoshi23
Nitish Joshi
2 days
RT @michahu8: šŸ“¢ today's scaling laws often don't work for predicting downstream task performance. For some pretraining setups, smooth and p….
0
29
0
@nitishjoshi23
Nitish Joshi
5 days
RT @ChengleiSi: Are AI scientists already better than human researchers?. We recruited 43 PhD students to spend 3 months executing research….
0
162
0
@nitishjoshi23
Nitish Joshi
12 days
RT @rico_angell: What causes jailbreaks to transfer between LLMs?. We find that jailbreak strength and model representation similarity pred….
0
11
0
@nitishjoshi23
Nitish Joshi
22 days
RT @jcyhc_ai: LLMs won’t tell you how to make fake IDs—but will reveal the layouts/materials of IDs and make realistic photos if asked sepa….
0
9
0
@nitishjoshi23
Nitish Joshi
27 days
RT @natolambert: Nice to see folks studying biases in RLHF / preference tuning all the way down to the datasets. I think many of the biases….
0
8
0
@nitishjoshi23
Nitish Joshi
29 days
RT @cmalaviya11: Ever wondered what makes language models generate overly verbose, vague, or sycophantic responses?. Our new paper investig….
0
17
0
@nitishjoshi23
Nitish Joshi
2 months
RT @vishakh_pk: What does it mean for #LLM output to be novel?.In work w/ @jcyhc_ai, @JanePan_, @valeriechen_, @hhexiy we argue it needs t….
0
23
0
@nitishjoshi23
Nitish Joshi
3 months
RT @YulinChen99: Reasoning models overthink, generating multiple answers during reasoning. Is it because they can’t tell which ones are rig….
0
78
0
@nitishjoshi23
Nitish Joshi
3 months
RT @StringChaos: Excited to release R2E-Gym. - šŸ”„ 8.1K executable environments using synthetic data. - 🧠 Hybrid verifiers for enhanced inf….
0
62
0
@nitishjoshi23
Nitish Joshi
3 months
RT @yzpang_: First set of Llama 4!!.
0
3
0
@nitishjoshi23
Nitish Joshi
3 months
RT @yanda_chen_: My first paper @AnthropicAI is out!. We show that Chains-of-Thought often don’t reflect models’ true reasoning—posing chal….
0
87
0
@nitishjoshi23
Nitish Joshi
4 months
RT @nsaphra: 2018: Saliency maps give plausible interpretations of random weights, triggering skepticism and catalyzing the mechinterp cult….
0
24
0
@nitishjoshi23
Nitish Joshi
4 months
RT @JanePan_: When benchmarks talk, do LLMs listen?. Our new paper shows that evaluating that code LLMs with interactive feedback significa….
0
13
0
@nitishjoshi23
Nitish Joshi
4 months
RT @danish037: Remember this study about how LLM generated research ideas were rated to be more novel than expert-written ones? . We find a….
0
254
0
@nitishjoshi23
Nitish Joshi
7 months
New work where we show that with the right training distribution, transformers can learn to search and internally implement an exponential path-merging algo. But they struggle to learn to search as the graph size increases, and simple solns like scaling doesn't resolve it.
@yzpang_
Richard Pang
7 months
šŸšØšŸ””Foundational graph search task as testbed: for some distribution, transformers can learn to search (100% acc). We interpreted their algo!! But as graph size ↑, transformers struggle. Scaling up # params does not help; CoT does not help. 1.5 years of learning in 10 pages!
Tweet media one
1
7
50
@nitishjoshi23
Nitish Joshi
7 months
RT @omarsar0: Transformers Struggle to Learn to Search. Finds that transformer-based LLMs struggle to perform search robustly. Suggests t….
0
52
0
@nitishjoshi23
Nitish Joshi
7 months
RT @vishakh_pk: Had a lot of fun poking holes at how LLMs capture diverse preferences with @chuanyang_jin, @hannahrosekirk and @hhexiy 🧐! N….
0
7
0
@nitishjoshi23
Nitish Joshi
8 months
RT @LauraRuis: How do LLMs learn to reason from data? Are they ~retrieving the answers from parametric knowledge🦜? In our new preprint, we….
0
209
0
@nitishjoshi23
Nitish Joshi
8 months
RT @lasha_nlp: ✨I’m on the faculty job market for 2024-2025! ✨ . My research focuses on advancing Responsible AI—enhancing factuality, robu….
0
46
0
@nitishjoshi23
Nitish Joshi
9 months
RT @javirandor: Anyone may be able to compromise LLMs with malicious content posted online. With just a small amount of data, adversaries c….
0
27
0