Rajashree Agrawal Profile
Rajashree Agrawal

@___rajashree___

Followers
248
Following
2K
Media
3
Statuses
137

building @theoremlabs

Joined February 2021
Don't wanna be here? Send us removal request.
@___rajashree___
Rajashree Agrawal
2 months
RT @NeelNanda5: An interesting concept! I'm curious to see where it goes.
0
3
0
@___rajashree___
Rajashree Agrawal
2 months
If Andrew says so, it must be true 😁.
@dyushag
Dyusha Gritsevskiy
2 months
This just might be the greatest Y Combinator company of all time.
0
0
4
@___rajashree___
Rajashree Agrawal
2 months
RT @ycombinator: Theorem (@theoremlabs) is an AI-coding IDE for mission-critical software. They're making program verification 10,000 times….
0
23
0
@___rajashree___
Rajashree Agrawal
1 year
RT @diagram_chaser: Mechanistic interpretability gives us rich explanations of models. But can we convert these explanations into formal pr….
0
34
0
@___rajashree___
Rajashree Agrawal
1 year
RT @janleike: I believe much more of our bandwidth should be spent getting ready for the next generations of models, on security, monitorin….
0
254
0
@___rajashree___
Rajashree Agrawal
1 year
Tweet media one
0
0
0
@___rajashree___
Rajashree Agrawal
1 year
I am a SOTA human jailbreaker (for now). šŸ˜„
Tweet media one
1
0
7
@___rajashree___
Rajashree Agrawal
1 year
MSJ is an order of magnitude more effective at jailbreaking Claude than SOTA attacks!!! The paper is a cool step in thinking about *model capabilities* as *attack surfaces*. Congrats to @cem__anil on knocking it out of the park! Super glad to have been a part of it!.
@AnthropicAI
Anthropic
1 year
New Anthropic research paper: Many-shot jailbreaking. We study a long-context jailbreaking technique that is effective on most large language models, including those developed by Anthropic and many of our peers. Read our blog post and the paper here:
Tweet media one
0
2
12
@___rajashree___
Rajashree Agrawal
1 year
There's no clearly winning score on OCEAN, but there is a losing score. Smh. Just like BMI.
0
0
6
@___rajashree___
Rajashree Agrawal
1 year
Obvious but surprising fact from analysing my self-reported daily productivity metrics from 2023: using AI tools was anti-correlated with deep work. I would like to experiment with this more, but don't have any bright ideas for it.
0
0
4
@___rajashree___
Rajashree Agrawal
2 years
RT @PhilGalfond: Are you afraid to value bet unless you’re almost positive your hand is good?. It’s not uncommon, but you’re missing out on….
0
2
0
@___rajashree___
Rajashree Agrawal
3 years
RT @JeffLadish: Love this little chart here from Open Phil
Tweet media one
0
88
0
@___rajashree___
Rajashree Agrawal
3 years
games i have experienced strong tetris effect for so far: .1. wordle (+variants): thinking in 5 letter words .2. set: 2 colorful images popping up, find the third one.3. kakuro: quiz me on how many ways there are to make 5 unique single digit integers add up to 28.
0
0
2
@___rajashree___
Rajashree Agrawal
3 years
it is interesting how much search results change when googling symptoms vs symptoms + "woman".
0
0
0
@___rajashree___
Rajashree Agrawal
3 years
"substantive" economics and mathematics discussions in academic settings are only bearable to me when i go in with the intention to make notes on how people will miscommunicate. i do this about once a week, also the frequency of my desire to start a new teaching program.
0
0
0
@___rajashree___
Rajashree Agrawal
3 years
misplaced attempts at structure can overwhelm. dependency on structure can squeeze out serendipity.
0
0
0
@___rajashree___
Rajashree Agrawal
3 years
finding structure can give mobility, add efficiency, illuminate beauty.
1
0
1
@___rajashree___
Rajashree Agrawal
3 years
being obsessed with finding underlying structure is a gift and a straitjacket.
1
0
2
@___rajashree___
Rajashree Agrawal
3 years
Instantiation in educational program surveys: every program I've been to gets about a 9/10 rating to the question meant to measure counterfactual impact. OK OK even if it wasn't "meant" to measure, it will later be used to make this claim.
@SpencrGreenberg
Spencer Greenberg šŸ”
3 years
The exact wording in polls, surveys and psychology studies matters more than most people seem to realize. This is a big deal if you want to learn from polls and academic papers, or if you conduct studies yourself. Here are some dramatic real examples:.
0
0
1