Jiho Jin
@jin__jiho
Followers
83
Following
132
Media
1
Statuses
44
Ph.D. Student @ Users & Information Lab, School of Computing, KAIST
Joined May 2022
๐ How do Large Language Models internally process cultural knowledge? ๐ Happy to share our new preprint "Entangled in Representations: Mechanistic Investigation of Cultural Biases in Large Language Models" ๐ Paper: https://t.co/Gt14mjMiQg
3
28
202
Huge thanks to my wonderful collaborators: Woosung Kang, @JunhoMyung_, and @aliceoh. ๐
0
0
0
Dive deeper into our work: - Paper: https://t.co/sEXSBuGpPH - Project Page: https://t.co/MeDT3tW5Tx - Repo: https://t.co/P66PUUmusB - HuggingFace Datasets:
huggingface.co
1
0
1
๐ฟย Challenge: Automatically evaluate the social biases in LLMsโ long-form responses without additional human annotation ๐กย Solution: Take the stories of the BBQ dataset, use them as prompts, then use the existing annotations to evaluate the long-form generations!
1
0
0
๐คย Is evaluating social bias of #LLMs with only multiple-choice QA benchmarks enough? NO! ๐ย Research Question: How often do LLMs โgenerateโ responses that reflect social stereotypes?
1
0
0
๐ฅณThrilled to share that our paper has been accepted to the Findings of #ACL2025NLP! We introduce #BBG, a Bias Benchmark for Generation, an adaptation of the Bias Benchmark for QA (BBQ), revealing inconsistencies between generation- and multiple-choice QA-based evaluations.
1
0
4
Thrilled to share that I'll present my #NeurIPS2024 poster presentation on 11th Dec! "BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages" https://t.co/TfADNzM2fS ๐ When: Wed 11 Dec 4:30 pm- 7:30 pm ๐ Where: West Ballroom A-D
arxiv.org
Large language models (LLMs) often lack culture-specific knowledge of daily life, especially across diverse regions and non-English languages. Existing benchmarks for evaluating LLMs' cultural...
๐Happy to share that our "๐๐๐๐ง๐: ๐ ๐๐๐ง๐๐ก๐ฆ๐๐ซ๐ค ๐๐จ๐ซ ๐๐๐๐ฌ ๐จ๐ง ๐๐ฏ๐๐ซ๐ฒ๐๐๐ฒ ๐๐ง๐จ๐ฐ๐ฅ๐๐๐ ๐ ๐ข๐ง ๐๐ข๐ฏ๐๐ซ๐ฌ๐ ๐๐ฎ๐ฅ๐ญ๐ฎ๐ซ๐๐ฌ ๐๐ง๐ ๐๐๐ง๐ ๐ฎ๐๐ ๐๐ฌ" paper has been accepted to #NeurIPS2024 D&B Track ๐ Congrats to all the amazing co-authors ๐ฅณ
2
9
47
Join us for our poster presentation at #EMNLP2024 ! ๐When: Thu 14 Nov 14:00-15:30 ๐Where: Riverfront hall ๐Paper: https://t.co/NhS0EuqKk0 If you are interested in social reasoning and theory of mind of language models, please come by! @emnlpmeeting
0
2
13
๐ขWhat do we know about Theory of Mind (ToM) in LLMs, aside from the fact that they struggle with it? What foundational ToM capabilities do they have? Our new EMNLP paper explores the precursory inferences of ToM in LLMs: perception inference and perception-to-belief inference๐
6
14
75
I am so grateful and proud of @whoSiddheshp and @jjjunyeong for leading this new survey paper on cultural awareness of LLMs. One trend I want to highlight is that while the MQA makes up the largest portion of eval, short answer and long form eval are gaining ground as well ๐
๐Excited to share our comprehensive survey on cultural awareness in #LLMs! ๐บ๏ธ We reviewed 300+ papers across diverse modalities (language, vision-language, etc.) @whoSiddheshp @jjjunyeong @jin__jiho @rnav_arora @JunhoMyung_ @_inhwa_song @aliceoh #NLProc
https://t.co/ai5UZOqIcf
3
8
55
Thank you to each and every one of you for making this happen. It was a great experience collaborating with such an amazing team!
๐Excited to share our comprehensive survey on cultural awareness in #LLMs! ๐บ๏ธ We reviewed 300+ papers across diverse modalities (language, vision-language, etc.) @whoSiddheshp @jjjunyeong @jin__jiho @rnav_arora @JunhoMyung_ @_inhwa_song @aliceoh #NLProc
https://t.co/ai5UZOqIcf
0
1
6
๐Excited to share our comprehensive survey on cultural awareness in #LLMs! ๐บ๏ธ We reviewed 300+ papers across diverse modalities (language, vision-language, etc.) @whoSiddheshp @jjjunyeong @jin__jiho @rnav_arora @JunhoMyung_ @_inhwa_song @aliceoh #NLProc
https://t.co/ai5UZOqIcf
8
35
150
๐คฉReally excited that this work will be presented at #neurips2024 d&b track. The BLEnD dataset took serious collaboration of hard thinking and work, getting human annotations from 16 diverse regional cultures in 13 languages, putting together short-answer and multiple choice QA
๐Happy to share that our "๐๐๐๐ง๐: ๐ ๐๐๐ง๐๐ก๐ฆ๐๐ซ๐ค ๐๐จ๐ซ ๐๐๐๐ฌ ๐จ๐ง ๐๐ฏ๐๐ซ๐ฒ๐๐๐ฒ ๐๐ง๐จ๐ฐ๐ฅ๐๐๐ ๐ ๐ข๐ง ๐๐ข๐ฏ๐๐ซ๐ฌ๐ ๐๐ฎ๐ฅ๐ญ๐ฎ๐ซ๐๐ฌ ๐๐ง๐ ๐๐๐ง๐ ๐ฎ๐๐ ๐๐ฌ" paper has been accepted to #NeurIPS2024 D&B Track ๐ Congrats to all the amazing co-authors ๐ฅณ
1
16
75
๐Happy to share that our "๐๐๐๐ง๐: ๐ ๐๐๐ง๐๐ก๐ฆ๐๐ซ๐ค ๐๐จ๐ซ ๐๐๐๐ฌ ๐จ๐ง ๐๐ฏ๐๐ซ๐ฒ๐๐๐ฒ ๐๐ง๐จ๐ฐ๐ฅ๐๐๐ ๐ ๐ข๐ง ๐๐ข๐ฏ๐๐ซ๐ฌ๐ ๐๐ฎ๐ฅ๐ญ๐ฎ๐ซ๐๐ฌ ๐๐ง๐ ๐๐๐ง๐ ๐ฎ๐๐ ๐๐ฌ" paper has been accepted to #NeurIPS2024 D&B Track ๐ Congrats to all the amazing co-authors ๐ฅณ
๐ขWe're thrilled to introduce BLEND, our latest benchmark designed to test LLMs' understanding of everyday life across diverse cultures and languages. BLEND features 52.6k Q&A pairs in 13 languages. ๐ ๐ https://t.co/GGYsWtyCm1
2
15
49
Breaking down the Theory-of-Mind task into perception, perspective reasoning, and response generation. Then analyzing the performance of LLMs (pretty bad), led by @ChaniJung99, with @YejinChoinka and @hyunw_kim, co-authors @jiseon_kim1 @_dongkwan_kim @jin__jiho @YeonSeonwoo Will
0
7
60
This paper got the best non-archival paper award! ๐ Congratulations to @JunhoMyung00211, @nlee0212 and @jodieyzhou who brilliantly led the project, and to all the many collaborators involved in the project! ๐๐ผ๐๐ผ
1
6
22
Congratulations ๐ @JunhoMyung00211 @CamachoCollados @hwaran_lee @nlee0212 @jodieyzhou @nedjmaou @euns0o_kim @rifkiaputri and all other authors! Thank you @c3_nlp organizers! #acl2024
The C3NLP best papers are: 1. BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages Junho Myung et al.
1
9
49
Pre #acl2024 talks, posters, and food ๐ฅ๐๐ฒ๐๐ฅ We @sunipa17 @kaiwei_chang @TristanNaumann @IAugenstein @computermacgyve @CamachoCollados @PangWeiKoh @seo_minjoon @NoSyu and Yoon Kim, (with @mohitban47 and @VioletNPeng joining online) had a blast thanks to the amazing students
Had a meaningful experience organizing pre-#ACL2024 workshop at KAIST๐ Sharing ideas is always exciting, but it was even greater with these amazing #NLProc people!โจ
2
15
64