Jiho Jin Profile
Jiho Jin

@jin__jiho

Followers
83
Following
132
Media
1
Statuses
44

Ph.D. Student @ Users & Information Lab, School of Computing, KAIST

Joined May 2022
Don't wanna be here? Send us removal request.
@hayu204
Haeun Yu
3 months
๐Ÿ™‹ How do Large Language Models internally process cultural knowledge? ๐ŸŒ Happy to share our new preprint "Entangled in Representations: Mechanistic Investigation of Cultural Biases in Large Language Models" ๐Ÿ“ƒ Paper: https://t.co/Gt14mjMiQg
3
28
202
@jin__jiho
Jiho Jin
4 months
Huge thanks to my wonderful collaborators: Woosung Kang, @JunhoMyung_, and @aliceoh. ๐Ÿ’›
0
0
0
@jin__jiho
Jiho Jin
4 months
Dive deeper into our work: - Paper: https://t.co/sEXSBuGpPH - Project Page: https://t.co/MeDT3tW5Tx - Repo: https://t.co/P66PUUmusB - HuggingFace Datasets:
Tweet card summary image
huggingface.co
1
0
1
@jin__jiho
Jiho Jin
4 months
๐Ÿ‘ฟย Challenge: Automatically evaluate the social biases in LLMsโ€™ long-form responses without additional human annotation ๐Ÿ’กย Solution: Take the stories of the BBQ dataset, use them as prompts, then use the existing annotations to evaluate the long-form generations!
1
0
0
@jin__jiho
Jiho Jin
4 months
๐Ÿค”ย Is evaluating social bias of #LLMs with only multiple-choice QA benchmarks enough? NO! ๐Ÿ”ย Research Question: How often do LLMs โ€œgenerateโ€ responses that reflect social stereotypes?
1
0
0
@jin__jiho
Jiho Jin
4 months
๐ŸฅณThrilled to share that our paper has been accepted to the Findings of #ACL2025NLP! We introduce #BBG, a Bias Benchmark for Generation, an adaptation of the Bias Benchmark for QA (BBQ), revealing inconsistencies between generation- and multiple-choice QA-based evaluations.
1
0
4
@JunhoMyung_
Junho Myung
11 months
Thrilled to share that I'll present my #NeurIPS2024 poster presentation on 11th Dec! "BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages" https://t.co/TfADNzM2fS ๐Ÿ“† When: Wed 11 Dec 4:30 pm- 7:30 pm ๐Ÿ“ Where: West Ballroom A-D
Tweet card summary image
arxiv.org
Large language models (LLMs) often lack culture-specific knowledge of daily life, especially across diverse regions and non-English languages. Existing benchmarks for evaluating LLMs' cultural...
@jodieyzhou
Yi (Jodie) Zhou
1 year
๐ŸŒŸHappy to share that our "๐๐‹๐„๐ง๐ƒ: ๐€ ๐๐ž๐ง๐œ๐ก๐ฆ๐š๐ซ๐ค ๐Ÿ๐จ๐ซ ๐‹๐‹๐Œ๐ฌ ๐จ๐ง ๐„๐ฏ๐ž๐ซ๐ฒ๐๐š๐ฒ ๐Š๐ง๐จ๐ฐ๐ฅ๐ž๐๐ ๐ž ๐ข๐ง ๐ƒ๐ข๐ฏ๐ž๐ซ๐ฌ๐ž ๐‚๐ฎ๐ฅ๐ญ๐ฎ๐ซ๐ž๐ฌ ๐š๐ง๐ ๐‹๐š๐ง๐ ๐ฎ๐š๐ ๐ž๐ฌ" paper has been accepted to #NeurIPS2024 D&B Track ๐Ÿš€ Congrats to all the amazing co-authors ๐Ÿฅณ
2
9
47
@_chani_jung
Chani Jung
1 year
Join us for our poster presentation at #EMNLP2024 ! ๐Ÿ“†When: Thu 14 Nov 14:00-15:30 ๐Ÿ“Where: Riverfront hall ๐Ÿ“„Paper: https://t.co/NhS0EuqKk0 If you are interested in social reasoning and theory of mind of language models, please come by! @emnlpmeeting
0
2
13
@hyunw_kim
Hyunwoo Kim
1 year
๐Ÿ“ขWhat do we know about Theory of Mind (ToM) in LLMs, aside from the fact that they struggle with it? What foundational ToM capabilities do they have? Our new EMNLP paper explores the precursory inferences of ToM in LLMs: perception inference and perception-to-belief inference๐Ÿ‘€
6
14
75
@aliceoh
Alice Oh
1 year
I am so grateful and proud of @whoSiddheshp and @jjjunyeong for leading this new survey paper on cultural awareness of LLMs. One trend I want to highlight is that while the MQA makes up the largest portion of eval, short answer and long form eval are gaining ground as well ๐Ÿ‘
@IAugenstein
Isabelle Augenstein
1 year
๐Ÿ“œExcited to share our comprehensive survey on cultural awareness in #LLMs! ๐Ÿ—บ๏ธ We reviewed 300+ papers across diverse modalities (language, vision-language, etc.) @whoSiddheshp @jjjunyeong @jin__jiho @rnav_arora @JunhoMyung_ @_inhwa_song @aliceoh #NLProc https://t.co/ai5UZOqIcf
3
8
55
@jjjunyeong
Junyeong Park
1 year
Thank you to each and every one of you for making this happen. It was a great experience collaborating with such an amazing team!
@IAugenstein
Isabelle Augenstein
1 year
๐Ÿ“œExcited to share our comprehensive survey on cultural awareness in #LLMs! ๐Ÿ—บ๏ธ We reviewed 300+ papers across diverse modalities (language, vision-language, etc.) @whoSiddheshp @jjjunyeong @jin__jiho @rnav_arora @JunhoMyung_ @_inhwa_song @aliceoh #NLProc https://t.co/ai5UZOqIcf
0
1
6
@IAugenstein
Isabelle Augenstein
1 year
๐Ÿ“œExcited to share our comprehensive survey on cultural awareness in #LLMs! ๐Ÿ—บ๏ธ We reviewed 300+ papers across diverse modalities (language, vision-language, etc.) @whoSiddheshp @jjjunyeong @jin__jiho @rnav_arora @JunhoMyung_ @_inhwa_song @aliceoh #NLProc https://t.co/ai5UZOqIcf
8
35
150
@aliceoh
Alice Oh
1 year
๐ŸคฉReally excited that this work will be presented at #neurips2024 d&b track. The BLEnD dataset took serious collaboration of hard thinking and work, getting human annotations from 16 diverse regional cultures in 13 languages, putting together short-answer and multiple choice QA
@jodieyzhou
Yi (Jodie) Zhou
1 year
๐ŸŒŸHappy to share that our "๐๐‹๐„๐ง๐ƒ: ๐€ ๐๐ž๐ง๐œ๐ก๐ฆ๐š๐ซ๐ค ๐Ÿ๐จ๐ซ ๐‹๐‹๐Œ๐ฌ ๐จ๐ง ๐„๐ฏ๐ž๐ซ๐ฒ๐๐š๐ฒ ๐Š๐ง๐จ๐ฐ๐ฅ๐ž๐๐ ๐ž ๐ข๐ง ๐ƒ๐ข๐ฏ๐ž๐ซ๐ฌ๐ž ๐‚๐ฎ๐ฅ๐ญ๐ฎ๐ซ๐ž๐ฌ ๐š๐ง๐ ๐‹๐š๐ง๐ ๐ฎ๐š๐ ๐ž๐ฌ" paper has been accepted to #NeurIPS2024 D&B Track ๐Ÿš€ Congrats to all the amazing co-authors ๐Ÿฅณ
1
16
75
@jodieyzhou
Yi (Jodie) Zhou
1 year
๐ŸŒŸHappy to share that our "๐๐‹๐„๐ง๐ƒ: ๐€ ๐๐ž๐ง๐œ๐ก๐ฆ๐š๐ซ๐ค ๐Ÿ๐จ๐ซ ๐‹๐‹๐Œ๐ฌ ๐จ๐ง ๐„๐ฏ๐ž๐ซ๐ฒ๐๐š๐ฒ ๐Š๐ง๐จ๐ฐ๐ฅ๐ž๐๐ ๐ž ๐ข๐ง ๐ƒ๐ข๐ฏ๐ž๐ซ๐ฌ๐ž ๐‚๐ฎ๐ฅ๐ญ๐ฎ๐ซ๐ž๐ฌ ๐š๐ง๐ ๐‹๐š๐ง๐ ๐ฎ๐š๐ ๐ž๐ฌ" paper has been accepted to #NeurIPS2024 D&B Track ๐Ÿš€ Congrats to all the amazing co-authors ๐Ÿฅณ
@jodieyzhou
Yi (Jodie) Zhou
1 year
๐Ÿ“ขWe're thrilled to introduce BLEND, our latest benchmark designed to test LLMs' understanding of everyday life across diverse cultures and languages. BLEND features 52.6k Q&A pairs in 13 languages. ๐Ÿš€ ๐Ÿ“œ https://t.co/GGYsWtyCm1
2
15
49
@aliceoh
Alice Oh
1 year
Breaking down the Theory-of-Mind task into perception, perspective reasoning, and response generation. Then analyzing the performance of LLMs (pretty bad), led by @ChaniJung99, with @YejinChoinka and @hyunw_kim, co-authors @jiseon_kim1 @_dongkwan_kim @jin__jiho @YeonSeonwoo Will
0
7
60
@CamachoCollados
Jose Camacho Collados
1 year
This paper got the best non-archival paper award! ๐Ÿ† Congratulations to @JunhoMyung00211, @nlee0212 and @jodieyzhou who brilliantly led the project, and to all the many collaborators involved in the project! ๐Ÿ‘๐Ÿผ๐Ÿ‘๐Ÿผ
1
6
22
@aliceoh
Alice Oh
1 year
Congratulations ๐ŸŽ‰ @JunhoMyung00211 @CamachoCollados @hwaran_lee @nlee0212 @jodieyzhou @nedjmaou @euns0o_kim @rifkiaputri and all other authors! Thank you @c3_nlp organizers! #acl2024
@c3_nlp
C3NLP
1 year
The C3NLP best papers are: 1. BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages Junho Myung et al.
1
9
49
@aliceoh
Alice Oh
1 year
Pre #acl2024 talks, posters, and food ๐Ÿฅ˜๐Ÿ๐Ÿฒ๐Ÿ›๐Ÿฅ™ We @sunipa17 @kaiwei_chang @TristanNaumann @IAugenstein @computermacgyve @CamachoCollados @PangWeiKoh @seo_minjoon @NoSyu and Yoon Kim, (with @mohitban47 and @VioletNPeng joining online) had a blast thanks to the amazing students
@z_eunie
Jieun Han
1 year
Had a meaningful experience organizing pre-#ACL2024 workshop at KAIST๐Ÿ’™ Sharing ideas is always exciting, but it was even greater with these amazing #NLProc people!โœจ
2
15
64
@jiseon_kim1
Jiseon Kim
1 year
Iโ€™m in Bangkok for ACL 2024๐Ÿ‡น๐Ÿ‡ญ! We will be sharing KoBBQ in several presentations and poster sessions. If you are interested, please stop by! 12 Aug @ 12:15 (Question Answering I) 14 Aug @ 10:30 (In-Person Poster Session) 16 Aug @ 11:50, 16:00 (C3NLP)
@jiseon_kim1
Jiseon Kim
2 years
Our paper "KoBBQ: Korean Bias Benchmark for Question Answering" has been accepted at #TACL2024 and will be presented at #ACL2024! We introduce KoBBQ, a Korean bias benchmark dataset, validated through a large-scale survey, reflecting the stereotypes in Korean culture.
0
3
29
@z_eunie
Jieun Han
1 year
Had a meaningful experience organizing pre-#ACL2024 workshop at KAIST๐Ÿ’™ Sharing ideas is always exciting, but it was even greater with these amazing #NLProc people!โœจ
0
4
48