kartik goyal @kartik_goyal_ X Profile

kartik goyal

@kartik_goyal_

Followers

639

Following

809

Media

8

Statuses

118

assistant professor at Georgia Tech @ICatGT; past @TTIC_Connect; phd @LTIatCMU; ml, nlp, and beyond...

Pittsburgh, USA

Joined May 2009

Don't wanna be here? Send us removal request.

kartik goyal

@kartik_goyal_

2 days

Sounds unreal so we conducted a bunch of ablations to control for confounding factors. Although we instantiate this framework only for MCQAs and word problems, modular evaluation and training holds tremendous promise. led by Yunxiang Yan and @tsawada_ml.

arxiv.org

While question-answering~(QA) benchmark performance is an automatic and scalable method to compare LLMs, it is an indirect method of evaluating their underlying problem-solving capabilities....

0

1

kartik goyal

@kartik_goyal_

2 days

i.e. decompose the question into abstract non-overlapping parts and present them in a cascaded manner withholding info. We find that this not only better estimates the underlying LM capabilities but also elicits better internal traces, i.e "Weak" models are stronger & vice-versa.

1

0

1

kartik goyal

@kartik_goyal_

2 days

We predominantly evaluate LMs on QA datasets when really, we are interested in evaluating the quality of strategies they apply for QA. In this work, we provide an extensible framework to better estimate this by changing *how* we ask questions to LMs. .

Tom Sawada

@tsawada_ml

2 days

Why do we evaluate LLMs using multiple-choice QA. when in practice, we ask them to generate open-ended answers?.Standard evaluation rewards models for choosing the right letter — not for reasoning their way to an answer. A better alternative: Cascaded Information Disclosure.

1

0

1

kartik goyal

@kartik_goyal_

9 months

RT @humphrey_shi: We’re recruiting future faculty leaders @ICatGT at all tenure-track ranks in strategic areas like AI, Robotics, and Respo….

0

14

0

kartik goyal

@kartik_goyal_

10 months

RT @mark_riedl: Hi all. I am co-organizing a Summit on Responsible Computing, AI, and Society at @GeorgiaTech Oct 28-30. We will actively d….

rcais.github.io

2025 Summit on Responsible Computing, AI, and Society

0

64

0

kartik goyal

@kartik_goyal_

1 year

Yay!! @davis_yoshida @kevingimpel.

ACL 2025

@aclmeeting

1 year

🎉SAC Awards:. 4) RomanSetu: Efficiently unlocking multilingual capabilities of Large Language Models via Romanization by Husain et al. 5) MAP’s not dead yet: Uncovering true language model modes by conditioning away degeneracy by Yoshida et al. #NLProc #ACL2024NLP.

0

6

kartik goyal

@kartik_goyal_

1 year

Find the paper here for details!

arxiv.org

It has been widely observed that exact or approximate MAP (mode-seeking) decoding from natural language generation (NLG) models consistently leads to degenerate outputs (Holtzman et al., 2019;...

0

1

kartik goyal

@kartik_goyal_

1 year

If you're at #ACL2024NLP, say hi to the amazing @davis_yoshida who is going to talk about why *search* for maximally scoring strings under LMs yields degenerate outputs and how to fix it. Joint work with @kevingimpel @davis_yoshida

1

0

12

kartik goyal

@kartik_goyal_

2 years

Apply to Georgia tech for a PhD by Dec 15 -- It's a fun place to do ML, NLP, and explore a variety of adjacent research areas! More info:

Alan Ritter

@alan_ritter

2 years

If you are applying to Ph.D. programs in NLP this year - consider Georgia Tech!. More info on our programs along with links to apply are listed below. CS: ML: Info on Atlanta:

0

17

62

kartik goyal

@kartik_goyal_

2 years

RT @ChrisVVarren: When @print_and_prob collaborator @NikolaiVogler heard that ESTC was down due to the BL cyberattack, he jumped into acti….

0

5

0

kartik goyal

@kartik_goyal_

2 years

RT @TTIC_Connect: TTIC is accepting Research Assistant Professor applications until Dec. 1! This position includes no teaching requirements….

0

10

0

kartik goyal

@kartik_goyal_

2 years

New CHR paper with an amazing set of collaborators: we find that high-recall bitext mining and sentence alignment is actually kinda tricky for messy historical literary text. Multilingual embeddings like LaBSE and friends work surprisingly well for literary ancient Greek though!.

David Smith

@dasmiq

2 years

Caroline Craig, @kartik_goyal_ , @farnooshamsian , and @PhilologistGRC have a CHR paper on getting document-level sentence alignment to work for the ancient Greek and Latin corpus to track multiple translations into English, French, German, Persian, etc.

0

1

6

kartik goyal

@kartik_goyal_

2 years

🚨 I started as an assistant professor at Georgia Tech @ICatGT this fall! I am looking to hire PhD students interested in NLP, ML, and cultural analytics (LLMs inlcuded 😉). Come work with me! Apply to a PhD Program listed here and mention me in your app:

Georgia Tech School of Interactive Computing

@ICatGT

2 years

Please welcome @kartik_goyal_, who starts this semester as an associate professor! He will explore the analytical and practical benefits of statistical NLP as it pertains to the study of human languages. Get to know Kartik below!.

20

22

195

kartik goyal

@kartik_goyal_

2 years

RT @mark_riedl: Hi 👋 The @GeorgiaTech School of Interactive Computing is continuing to grow! . Last year we hired 6 new faculty. This year….

0

29

0

kartik goyal

@kartik_goyal_

2 years

RT @TTIC_Connect: TTIC is a proud sponsor of #MMLS2023, which will be held on May 16-17 at @UICEngineering Dorin Forum, featuring TTIC Prof….

0

1

0

kartik goyal

@kartik_goyal_

3 years

RT @samuellemley: We'll be sharing some of our research in computational bibliography at @GrolierClub as part of @BibSocAmer's Bibliography….

0

4

0

kartik goyal

@kartik_goyal_

3 years

RT @limufar: I will be presenting 2 papers @emnlpmeeting tmw, both on #privacy and #memorization in LLMs:.1. Poster session 1 @ 11AM: Quant….

0

19

0

kartik goyal

@kartik_goyal_

3 years

RT @leoduw: When I first came across Mask-Predict, I always thought the decoding could be formulated in a principled MCMC way. This paper….

0

2

0

kartik goyal

@kartik_goyal_

3 years

RT @limufar: Join us at 11am PT today, during *virtual* poster session 2 at @aclmeeting to discuss how you can condition text generation u….

0

4

0

kartik goyal

@kartik_goyal_

3 years

Having observed certain energy parametrizations yielding good samples from masked LMs, we were curious if we could do controlled text generation by stacking appropriate blackboxes with a product-of-experts energy formulation in w/ @limufar @BergKirkpatrick.

Niloofar (✈️ ACL)

@niloofar_mire

3 years

35k trained models on @huggingface, yet whenever we want to generate text w/ given attributes, we train new models/clsfrs. In our #ACL2022 paper, we enable using ANY arbitrary (even non-differentiable) expert for controllable generation, w/o ANY TRAINING!.

1

8