Jiho Jin @jin__jiho X Profile

Jiho Jin

@jin__jiho

Followers

83

Following

132

Media

1

Statuses

44

Ph.D. Student @ Users & Information Lab, School of Computing, KAIST

Joined May 2022

Don't wanna be here? Send us removal request.

Haeun Yu

@hayu204

3 months

🙋 How do Large Language Models internally process cultural knowledge? 🌐 Happy to share our new preprint "Entangled in Representations: Mechanistic Investigation of Cultural Biases in Large Language Models" 📃 Paper: https://t.co/Gt14mjMiQg

3

28

202

Jiho Jin

@jin__jiho

4 months

Huge thanks to my wonderful collaborators: Woosung Kang, @JunhoMyung_, and @aliceoh. 💛

0

Jiho Jin

@jin__jiho

4 months

Dive deeper into our work: - Paper: https://t.co/sEXSBuGpPH - Project Page: https://t.co/MeDT3tW5Tx - Repo: https://t.co/P66PUUmusB - HuggingFace Datasets:

huggingface.co

1

0

1

Jiho Jin

@jin__jiho

4 months

👿 Challenge: Automatically evaluate the social biases in LLMs’ long-form responses without additional human annotation 💡 Solution: Take the stories of the BBQ dataset, use them as prompts, then use the existing annotations to evaluate the long-form generations!

1

0

Jiho Jin

@jin__jiho

4 months

🤔 Is evaluating social bias of #LLMs with only multiple-choice QA benchmarks enough? NO! 🔍 Research Question: How often do LLMs “generate” responses that reflect social stereotypes?

1

0

Jiho Jin

@jin__jiho

4 months

🥳Thrilled to share that our paper has been accepted to the Findings of #ACL2025NLP! We introduce #BBG, a Bias Benchmark for Generation, an adaptation of the Bias Benchmark for QA (BBQ), revealing inconsistencies between generation- and multiple-choice QA-based evaluations.

1

0

4

Junho Myung

@JunhoMyung_

11 months

Thrilled to share that I'll present my #NeurIPS2024 poster presentation on 11th Dec! "BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages" https://t.co/TfADNzM2fS 📆 When: Wed 11 Dec 4:30 pm- 7:30 pm 📍 Where: West Ballroom A-D

arxiv.org

Large language models (LLMs) often lack culture-specific knowledge of daily life, especially across diverse regions and non-English languages. Existing benchmarks for evaluating LLMs' cultural...

Yi (Jodie) Zhou

@jodieyzhou

1 year

🌟Happy to share that our "𝐁𝐋𝐄𝐧𝐃: 𝐀 𝐁𝐞𝐧𝐜𝐡𝐦𝐚𝐫𝐤 𝐟𝐨𝐫 𝐋𝐋𝐌𝐬 𝐨𝐧 𝐄𝐯𝐞𝐫𝐲𝐝𝐚𝐲 𝐊𝐧𝐨𝐰𝐥𝐞𝐝𝐠𝐞 𝐢𝐧 𝐃𝐢𝐯𝐞𝐫𝐬𝐞 𝐂𝐮𝐥𝐭𝐮𝐫𝐞𝐬 𝐚𝐧𝐝 𝐋𝐚𝐧𝐠𝐮𝐚𝐠𝐞𝐬" paper has been accepted to #NeurIPS2024 D&B Track 🚀 Congrats to all the amazing co-authors 🥳

2

9

47

Chani Jung

@_chani_jung

1 year

Join us for our poster presentation at #EMNLP2024 ! 📆When: Thu 14 Nov 14:00-15:30 📍Where: Riverfront hall 📄Paper: https://t.co/NhS0EuqKk0 If you are interested in social reasoning and theory of mind of language models, please come by! @emnlpmeeting

0

2

13

Hyunwoo Kim

@hyunw_kim

1 year

📢What do we know about Theory of Mind (ToM) in LLMs, aside from the fact that they struggle with it? What foundational ToM capabilities do they have? Our new EMNLP paper explores the precursory inferences of ToM in LLMs: perception inference and perception-to-belief inference👀

6

14

75

Alice Oh

@aliceoh

1 year

I am so grateful and proud of @whoSiddheshp and @jjjunyeong for leading this new survey paper on cultural awareness of LLMs. One trend I want to highlight is that while the MQA makes up the largest portion of eval, short answer and long form eval are gaining ground as well 👍

Isabelle Augenstein

@IAugenstein

1 year

📜Excited to share our comprehensive survey on cultural awareness in #LLMs! 🗺️ We reviewed 300+ papers across diverse modalities (language, vision-language, etc.) @whoSiddheshp @jjjunyeong @jin__jiho @rnav_arora @JunhoMyung_ @_inhwa_song @aliceoh #NLProc https://t.co/ai5UZOqIcf

3

8

55

Junyeong Park

@jjjunyeong

1 year

Thank you to each and every one of you for making this happen. It was a great experience collaborating with such an amazing team!

Isabelle Augenstein

@IAugenstein

1 year

📜Excited to share our comprehensive survey on cultural awareness in #LLMs! 🗺️ We reviewed 300+ papers across diverse modalities (language, vision-language, etc.) @whoSiddheshp @jjjunyeong @jin__jiho @rnav_arora @JunhoMyung_ @_inhwa_song @aliceoh #NLProc https://t.co/ai5UZOqIcf

0

1

6

Isabelle Augenstein

@IAugenstein

1 year

📜Excited to share our comprehensive survey on cultural awareness in #LLMs! 🗺️ We reviewed 300+ papers across diverse modalities (language, vision-language, etc.) @whoSiddheshp @jjjunyeong @jin__jiho @rnav_arora @JunhoMyung_ @_inhwa_song @aliceoh #NLProc https://t.co/ai5UZOqIcf

8

35

150

Alice Oh

@aliceoh

1 year

🤩Really excited that this work will be presented at #neurips2024 d&b track. The BLEnD dataset took serious collaboration of hard thinking and work, getting human annotations from 16 diverse regional cultures in 13 languages, putting together short-answer and multiple choice QA

Yi (Jodie) Zhou

@jodieyzhou

1 year

🌟Happy to share that our "𝐁𝐋𝐄𝐧𝐃: 𝐀 𝐁𝐞𝐧𝐜𝐡𝐦𝐚𝐫𝐤 𝐟𝐨𝐫 𝐋𝐋𝐌𝐬 𝐨𝐧 𝐄𝐯𝐞𝐫𝐲𝐝𝐚𝐲 𝐊𝐧𝐨𝐰𝐥𝐞𝐝𝐠𝐞 𝐢𝐧 𝐃𝐢𝐯𝐞𝐫𝐬𝐞 𝐂𝐮𝐥𝐭𝐮𝐫𝐞𝐬 𝐚𝐧𝐝 𝐋𝐚𝐧𝐠𝐮𝐚𝐠𝐞𝐬" paper has been accepted to #NeurIPS2024 D&B Track 🚀 Congrats to all the amazing co-authors 🥳

1

16

75

Yi (Jodie) Zhou

@jodieyzhou

1 year

🌟Happy to share that our "𝐁𝐋𝐄𝐧𝐃: 𝐀 𝐁𝐞𝐧𝐜𝐡𝐦𝐚𝐫𝐤 𝐟𝐨𝐫 𝐋𝐋𝐌𝐬 𝐨𝐧 𝐄𝐯𝐞𝐫𝐲𝐝𝐚𝐲 𝐊𝐧𝐨𝐰𝐥𝐞𝐝𝐠𝐞 𝐢𝐧 𝐃𝐢𝐯𝐞𝐫𝐬𝐞 𝐂𝐮𝐥𝐭𝐮𝐫𝐞𝐬 𝐚𝐧𝐝 𝐋𝐚𝐧𝐠𝐮𝐚𝐠𝐞𝐬" paper has been accepted to #NeurIPS2024 D&B Track 🚀 Congrats to all the amazing co-authors 🥳

Yi (Jodie) Zhou

@jodieyzhou

1 year

📢We're thrilled to introduce BLEND, our latest benchmark designed to test LLMs' understanding of everyday life across diverse cultures and languages. BLEND features 52.6k Q&A pairs in 13 languages. 🚀 📜 https://t.co/GGYsWtyCm1

2

15

49

Alice Oh

@aliceoh

1 year

Breaking down the Theory-of-Mind task into perception, perspective reasoning, and response generation. Then analyzing the performance of LLMs (pretty bad), led by @ChaniJung99, with @YejinChoinka and @hyunw_kim, co-authors @jiseon_kim1 @_dongkwan_kim @jin__jiho @YeonSeonwoo Will

0

7

60

Jose Camacho Collados

@CamachoCollados

1 year

This paper got the best non-archival paper award! 🏆 Congratulations to @JunhoMyung00211, @nlee0212 and @jodieyzhou who brilliantly led the project, and to all the many collaborators involved in the project! 👏🏼👏🏼

1

6

22

Alice Oh

@aliceoh

1 year

Congratulations 🎉 @JunhoMyung00211 @CamachoCollados @hwaran_lee @nlee0212 @jodieyzhou @nedjmaou @euns0o_kim @rifkiaputri and all other authors! Thank you @c3_nlp organizers! #acl2024

C3NLP

@c3_nlp

1 year

The C3NLP best papers are: 1. BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages Junho Myung et al.

1

9

49

Alice Oh

@aliceoh

1 year

Pre #acl2024 talks, posters, and food 🥘🍝🍲🍛🥙 We @sunipa17 @kaiwei_chang @TristanNaumann @IAugenstein @computermacgyve @CamachoCollados @PangWeiKoh @seo_minjoon @NoSyu and Yoon Kim, (with @mohitban47 and @VioletNPeng joining online) had a blast thanks to the amazing students

Jieun Han

@z_eunie

1 year

Had a meaningful experience organizing pre-#ACL2024 workshop at KAIST💙 Sharing ideas is always exciting, but it was even greater with these amazing #NLProc people!✨

2

15

64

Jiseon Kim

@jiseon_kim1

1 year

I’m in Bangkok for ACL 2024🇹🇭! We will be sharing KoBBQ in several presentations and poster sessions. If you are interested, please stop by! 12 Aug @ 12:15 (Question Answering I) 14 Aug @ 10:30 (In-Person Poster Session) 16 Aug @ 11:50, 16:00 (C3NLP)

Jiseon Kim

@jiseon_kim1

2 years

Our paper "KoBBQ: Korean Bias Benchmark for Question Answering" has been accepted at #TACL2024 and will be presented at #ACL2024! We introduce KoBBQ, a Korean bias benchmark dataset, validated through a large-scale survey, reflecting the stereotypes in Korean culture.

0

3

29

Jieun Han

@z_eunie

1 year

Had a meaningful experience organizing pre-#ACL2024 workshop at KAIST💙 Sharing ideas is always exciting, but it was even greater with these amazing #NLProc people!✨

0

4

48