dheeraj rajagopal @dheerajgopal X Profile

dheeraj rajagopal

@dheerajgopal

Followers

486

Following

1K

Media

1

Statuses

144

Joined April 2010

Don't wanna be here? Send us removal request.

dheeraj rajagopal

@dheerajgopal

2 months

RT @TechCrunch: Fastino trains AI models on cheap gaming GPUs and just raised $17.5M led by Khosla | TechCrunch

0

23

0

dheeraj rajagopal

@dheerajgopal

2 months

RT @khoslaventures: Thrilled to join you on this adventure, @fastinoAI! 🦊 @heyjchu .

0

7

0

dheeraj rajagopal

@dheerajgopal

3 months

RT @tylerachang: Presenting our work on training data attribution for pretraining this morning: -- come stop by in….

0

5

0

dheeraj rajagopal

@dheerajgopal

3 months

RT @shocheen: Really excited for this paper to be out. This project began nearly a year ago when I was at Ai2. Activation steering and rela….

0

6

0

dheeraj rajagopal

@dheerajgopal

3 months

Really proud to share this work by @patrickqdasilva , providing a thorough evaluation of steering LMs across model sizes.

Patrick Da Silva @ ICML

@patrickqdasilva

3 months

Steering language models by directly intervening on internal activations is appealing–but does it generalize?. We study 3 popular steering methods with 36 models from 14 families (1.5-70B), exposing brittle performance and fundamental flaws in underlying assumptions.🧵👇.(1/10)

0

6

dheeraj rajagopal

@dheerajgopal

5 months

RT @NeelNanda5: Apps are open for my MATS stream, where I try to teach how to do great mech interp research. Due Feb 28!. I love mentoring….

0

30

0

dheeraj rajagopal

@dheerajgopal

7 months

Stoked to share our new work on scaling training data attribution (TDA) toward LLM pretraining - and great insights we found along the way! . and more in the thread below from our excellent student researcher @tylerachang.

Tyler Chang

@tylerachang

7 months

We scaled training data attribution (TDA) methods ~1000x to find influential pretraining examples for thousands of queries in an 8B-parameter LLM over the entire 160B-token C4 corpus!.

0

1

16

dheeraj rajagopal

@dheerajgopal

7 months

RT @tylerachang: We scaled training data attribution (TDA) methods ~1000x to find influential pretraining examples for thousands of queries….

0

20

0

dheeraj rajagopal

@dheerajgopal

9 months

RT @vidhisha_b: Today’s benchmarks evaluate models on more complex tasks involving many skills. But, aggregate benchmark evaluations really….

0

2

0

dheeraj rajagopal

@dheerajgopal

10 months

RT @vidhisha_b: On some personal news: I joined MSR AI Frontiers a few months ago and am very excited to share my first work with this amaz….

0

11

0

dheeraj rajagopal

@dheerajgopal

1 year

RT @iftenney: Super excited for the Gemma model release, and with it a new debugging tool we built on 🔥LIT - use gradient-based salience to….

0

4

0

dheeraj rajagopal

@dheerajgopal

2 years

RT @vidhisha_b: Our Knowledge Card paper was accepted to #ICLR2024 as an Oral 🎊 Really excited to push further on Modular LLMs!.

0

3

0

dheeraj rajagopal

@dheerajgopal

2 years

RT @ghandeharioun: 🧵Can we “ask” an LLM to “translate” its own hidden representations into natural language? We propose 🩺Patchscopes, a new….

0

147

0

dheeraj rajagopal

@dheerajgopal

2 years

RT @ghandeharioun: Research Opportunities @Google-Please RT.Excited to be at #NeurIPS2023! If interested in how model #interpretability can….

0

26

0

dheeraj rajagopal

@dheerajgopal

2 years

RT @chiragraman: Excited to share our #NeurIPS2023 paper "Why Did This Model Forecast This Future?" Our contrastive explanation framework f….

0

7

0

dheeraj rajagopal

@dheerajgopal

2 years

Really thrilled for Ankush’s new lab at BU! He is an excellent researcher and any student would be so lucky to have him as an advisor.

Ankush Das

@Das8Ankush

2 years

Thrilled to share that I will start a tenure-track assistant professor position in the CS department at Boston University! I am looking for PhD students in the area of programming languages with applications to distributed systems, cryptography, and machine learning.

0

2

dheeraj rajagopal

@dheerajgopal

2 years

RT @Das8Ankush: Thrilled to share that I will start a tenure-track assistant professor position in the CS department at Boston University!….

0

29

0

dheeraj rajagopal

@dheerajgopal

2 years

RT @dongyeopkang: I thought LLM ignores diverse opinions bcs it is biased towards aggregating probabilities of the majority. But, we found….

0

2

0