
Prithviraj (Raj) Ammanabrolu
@rajammanabrolu
Followers
8K
Following
16K
Media
417
Statuses
3K
Reinforcement Learning and Language. Assistant Prof @UCSanDiego. Research Scientist @Nvidia.
San Diego, CA
Joined April 2019
The PEARLS Lab at @UCSD_CSE is now open for business! I'm recruiting Fall 24 PhD students in all things interactive and grounded AI, RL, and NLP!! Join us in the land of 🏖️ beach (🧋pearl tea included). Apply by Dec 20. Please help spread the word!. More:
Soon™, I'll be an Asst Prof @UCSanDiego @UCSD_CSE focusing on interactive & grounded AI, RL, NLP. I will also be a research scientist @MosaicML helping lead efforts to make tech like RLHF more accessible. Looking for PhD students & research eng/scientists to join me in ☀️SoCal🏖️
7
65
252
This will happen but it will be *very* black pilling for authors. You can spend all that time and convince reviewers via conversation but an (S)AC/PC you had no contact with will reject it to maintain an artificial acceptance rate.
The rebuttal process @NeurIPSConf was too collegial and productive, so a bunch of papers whose process ended on a positive note will be rejected.
2
0
20
RT @YejinChoinka: Honored to be back on TIME100 AI for 2025 — alongside my longtime heroes @drfeifei and @BarzilayRegina! 😍. The recognitio….
0
35
0
if your policy gradient runs aren't converging it's not a skill issue you're just not pure of heart.
define a Partially Observable Markov Decision Process (POMDP) w/ state S, observation O, action A, transition T, and reward R. now assume full observability and remove the constraint that T be Markov. if you are pure of heart, the policy gradients will converge to the Machine God.
1
3
19
RT @kuchaev: We are excited to release Nvidia-Nemotron-Nano-V2 model! This is a 9B hybrid SSM model with open base model and training data.….
0
59
0
RT @pratyushmaini: 1/Pretraining is hitting a data wall; scaling raw web data alone leads to diminishing returns. Today @datologyai shares….
0
125
0
Infra is the only thing to do to scale and scale is the only thing that consistently makes number go up. But if a company tells you they have it all figured out and "we'll have perfect infra soon, we're hiring many REs", run.
@suchenzang AI infra is young, hard and messy (you obviously know this). the fact that meta talks about their infra details openly only makes it publicly messy. everyone's infra across the industry is pretty messy but privately so.
0
1
26