
Chaitanya Malaviya
@cmalaviya11
Followers
320
Following
118
Media
32
Statuses
64
PhD student at UPenn @upennnlp | benchmarking and evaluation | soon senior research scientist @GoogleDeepMind | prev @allen_ai @GoogleDeepMind and @LTIatCMU
Seattle, WA
Joined September 2023
Thanks for the mention @natolambert :) shoutout to the amazing undergrad @abharadwaj123 who led this work!.
Nice to see folks studying biases in RLHF / preference tuning all the way down to the datasets. I think many of the biases are mostly irreducible human biases that can't be solved within current training regimes, just mitigated.
0
0
4
Our findings suggest that targeted debiasing using counterfactuals can help build more reliable preference models, a key step for both LLM alignment and evaluation. Work led by @abharadwaj123 and done jointly with @nitishjoshi23 and @yatskar.
1
0
4
RT @ManyaWadhwa1: Evaluating language model responses on open-ended tasks is hard! š¤. We introduce EvalAgent, a framework that identifies nā¦.
0
34
0
RT @kylelostat: come chat w me and @cmalaviya11 at #emnlp2024 about evaluating LMs, how findings can be impacted when dataset queries are vā¦.
0
7
0
Joint work done @allen_ai with @josephcc, @DanRothNLP, @MohitIyyer, @yatskar, @kylelostat. Find these & many more results in our paper: Use our code to run your own contextualized evals: Explore our data:
1
0
3