Yinhong Liu Profile
Yinhong Liu

@YinhongLiu2

Followers
243
Following
87
Media
15
Statuses
61

PhD student @CambridgeLTL @Cambridge_Uni. Previous research intern at Siri/AIML @Apple and @MSFTResearch. Interested in #ML, #NLProc and #LLM.

Cambridge, UK
Joined October 2021
Don't wanna be here? Send us removal request.
@YinhongLiu2
Yinhong Liu
8 months
🚨 New Paper Alert! 🚨.When using LLMs for judgements, ever wondered about the consistency of those judgments? 🤔.Check out our latest work, where we quantify, evaluate, and enhance the logical/preference consistency of LLMs. 📚. 🔗 Read more:
Tweet media one
15
70
250
@YinhongLiu2
Yinhong Liu
4 months
RT @_yixu: 🚀Let’s Think Only with Images. No language and No verbal thought.🤔 . Let’s think through a sequence of images💭, like how humans….
0
215
0
@grok
Grok
21 days
Blazing-fast image creation – using just your voice. Try Grok Imagine.
303
602
4K
@YinhongLiu2
Yinhong Liu
6 months
RT @_yixu: 🔥Are we ranking LLMs correctly?🔥. Large Language Models (LLMs) are widely used as automatic judges, but what if their rankings a….
0
33
0
@YinhongLiu2
Yinhong Liu
6 months
RT @Leon_L_S_C: 🌟 MMR1 Multimodal Reasoning Project Now Open-Source!. We’re thrilled to announce the release of MMR1, an open-source projec….
0
51
0
@YinhongLiu2
Yinhong Liu
6 months
RT @river_dong121: 🚨New Paper Alert🚨.Many personalization methods optimize performance but ignore real-world impact. We examine its effects….
0
6
0
@YinhongLiu2
Yinhong Liu
6 months
Long-text factuality is a challenging topic and here’s our cheap & effective approach! 🚀🚀🚀.
@Haoran89332647
Haoran Liu
6 months
‼️New Paper Alert‼️.⁉️ How to perform fine-grained fact checks on long text efficiently❓. GraphCheck: Breaking Long-Term Text Barriers with Extracted Knowledge Graph-Powered Fact-Checking (1/3)
Tweet media one
0
0
3
@YinhongLiu2
Yinhong Liu
7 months
RT @abeirami: 𝐛𝐞𝐬𝐭-𝐨𝐟-𝐧 is a strong baseline for .- improving agents.- scaling inference-time compute.- preference alignment .- jailbreakin….
0
55
0
@YinhongLiu2
Yinhong Liu
8 months
RT @li_chengzu: Forget just thinking in words. 🚀 New Era of Multimodal Reasoning🚨.🔍 Imagine While Reasoning in Space with MVoT. Multimodal….
0
168
0
@YinhongLiu2
Yinhong Liu
8 months
RT @SuZhaochen0110: 🚀 Interested in building a reliable PRM? Check out our new paper on PRMBENCH – the first process-level reward benchmark….
0
1
0
@YinhongLiu2
Yinhong Liu
8 months
Round of applaud to my amazing collaborators! @ZhijiangG @EhsanShareghi @licwu @nigelhcollier.
0
0
3
@YinhongLiu2
Yinhong Liu
8 months
When LLMs are used as logical operators, maintaining a high level of consistency is critical to ensure predictable and efficient decision-making. We examine how logical consistency influences the performance of LLM-based algorithms in such ‘logically grounded’ tasks. 6/n.
0
1
3
@YinhongLiu2
Yinhong Liu
8 months
We introduce a data refinement and augmentation framework that enhances the consistency without sacrificing human alignment. It augments noisy and sparse pairwise comparison annotations by estimating a partially ordered preference rankings using rank aggregation methods. 5/n
Tweet media one
0
1
6
@YinhongLiu2
Yinhong Liu
8 months
Through our evaluations, we show that:. Transitivity shows strong correlations with self-agreement (self-consistency). Commutativity shows a generally strong correlation with human preference agreement rates across various LLMs. 4/n
Tweet media one
Tweet media two
0
0
4
@YinhongLiu2
Yinhong Liu
8 months
We quantify the logical consistency of preference judgements via three fundamental proxies: transitivity, commutativity and negation invariance. We then evaluate logical consistency, using the defined measures, of a wide range of LLMs. 3/n
Tweet media one
Tweet media two
Tweet media three
0
1
5
@YinhongLiu2
Yinhong Liu
8 months
LLMs exhibit inconsistent and biased behaviour when making decisions or judgements. We focus on studying logical consistency of LLMs as a prerequisite for more reliable and trustworthy systems, where decisions are based on a stable and coherent understanding of the problem. 2/n.
0
0
6
@YinhongLiu2
Yinhong Liu
9 months
RT @Renee42581826: I'll be presenting CLUES🔍 at #NeurIPS2024 in person! .Catch us at the poster session on: .⏰ Wed, Dec 11, 4:30–7:30 PM P….
0
7
0
@YinhongLiu2
Yinhong Liu
9 months
RT @ZhijiangG: Life update: 🎉 I'm excited to share that I will be joining @HKUSTGuangzhou as an Assistant Professor in Spring 2025! .I'm lo….
0
23
0
@YinhongLiu2
Yinhong Liu
10 months
RT @caiqizh: 🔥Check our EMNLP paper with @vlachos_nlp and @ZhijiangG. 🤔Do We Need Language-Specific Fact-Checking Models? The Case of Chine….
0
5
0
@YinhongLiu2
Yinhong Liu
10 months
RT @hanzhou032: Attending #EMNLP2024 Virtually📺! .If you've ever wondered how to PROMPT your LLM-as-a-Judge⚖️, stay tuned! We will present….
0
4
0
@YinhongLiu2
Yinhong Liu
10 months
RT @Yingjia_Wan: 💥 Introducing "AutoPSV: Automated Process Supervised Verifier" - accepted at #NeurIPS2024!. AutoPSV automatically annotate….
0
38
0