Yinhong Liu @YinhongLiu2 X Profile

Yinhong Liu

@YinhongLiu2

Followers

243

Following

87

Media

15

Statuses

61

PhD student @CambridgeLTL @Cambridge_Uni. Previous research intern at Siri/AIML @Apple and @MSFTResearch. Interested in #ML, #NLProc and #LLM.

Cambridge, UK

Joined October 2021

Don't wanna be here? Send us removal request.

Yinhong Liu

@YinhongLiu2

8 months

🚨 New Paper Alert! 🚨.When using LLMs for judgements, ever wondered about the consistency of those judgments? 🤔.Check out our latest work, where we quantify, evaluate, and enhance the logical/preference consistency of LLMs. 📚. 🔗 Read more:

15

70

250

Yinhong Liu

@YinhongLiu2

4 months

RT @_yixu: 🚀Let’s Think Only with Images. No language and No verbal thought.🤔 . Let’s think through a sequence of images💭, like how humans….

0

215

0

Grok

@grok

21 days

Blazing-fast image creation – using just your voice. Try Grok Imagine.

303

602

4K

Yinhong Liu

@YinhongLiu2

6 months

RT @_yixu: 🔥Are we ranking LLMs correctly?🔥. Large Language Models (LLMs) are widely used as automatic judges, but what if their rankings a….

0

33

0

Yinhong Liu

@YinhongLiu2

6 months

RT @Leon_L_S_C: 🌟 MMR1 Multimodal Reasoning Project Now Open-Source!. We’re thrilled to announce the release of MMR1, an open-source projec….

0

51

0

Yinhong Liu

@YinhongLiu2

6 months

RT @river_dong121: 🚨New Paper Alert🚨.Many personalization methods optimize performance but ignore real-world impact. We examine its effects….

0

6

0

Yinhong Liu

@YinhongLiu2

6 months

Long-text factuality is a challenging topic and here’s our cheap & effective approach! 🚀🚀🚀.

Haoran Liu

@Haoran89332647

6 months

‼️New Paper Alert‼️.⁉️ How to perform fine-grained fact checks on long text efficiently❓. GraphCheck: Breaking Long-Term Text Barriers with Extracted Knowledge Graph-Powered Fact-Checking (1/3)

0

3

Yinhong Liu

@YinhongLiu2

7 months

RT @abeirami: 𝐛𝐞𝐬𝐭-𝐨𝐟-𝐧 is a strong baseline for .- improving agents.- scaling inference-time compute.- preference alignment .- jailbreakin….

0

55

0

Yinhong Liu

@YinhongLiu2

8 months

RT @li_chengzu: Forget just thinking in words. 🚀 New Era of Multimodal Reasoning🚨.🔍 Imagine While Reasoning in Space with MVoT. Multimodal….

0

168

0

Yinhong Liu

@YinhongLiu2

8 months

RT @SuZhaochen0110: 🚀 Interested in building a reliable PRM? Check out our new paper on PRMBENCH – the first process-level reward benchmark….

0

1

0

Yinhong Liu

@YinhongLiu2

8 months

Round of applaud to my amazing collaborators! @ZhijiangG @EhsanShareghi @licwu @nigelhcollier.

0

3

Yinhong Liu

@YinhongLiu2

8 months

When LLMs are used as logical operators, maintaining a high level of consistency is critical to ensure predictable and efficient decision-making. We examine how logical consistency influences the performance of LLM-based algorithms in such ‘logically grounded’ tasks. 6/n.

0

1

3

Yinhong Liu

@YinhongLiu2

8 months

We introduce a data refinement and augmentation framework that enhances the consistency without sacrificing human alignment. It augments noisy and sparse pairwise comparison annotations by estimating a partially ordered preference rankings using rank aggregation methods. 5/n

0

1

6

Yinhong Liu

@YinhongLiu2

8 months

Through our evaluations, we show that:. Transitivity shows strong correlations with self-agreement (self-consistency). Commutativity shows a generally strong correlation with human preference agreement rates across various LLMs. 4/n

0

4

Yinhong Liu

@YinhongLiu2

8 months

We quantify the logical consistency of preference judgements via three fundamental proxies: transitivity, commutativity and negation invariance. We then evaluate logical consistency, using the defined measures, of a wide range of LLMs. 3/n

0

1

5

Yinhong Liu

@YinhongLiu2

8 months

LLMs exhibit inconsistent and biased behaviour when making decisions or judgements. We focus on studying logical consistency of LLMs as a prerequisite for more reliable and trustworthy systems, where decisions are based on a stable and coherent understanding of the problem. 2/n.

0

6

Yinhong Liu

@YinhongLiu2

9 months

RT @Renee42581826: I'll be presenting CLUES🔍 at #NeurIPS2024 in person! .Catch us at the poster session on: .⏰ Wed, Dec 11, 4:30–7:30 PM P….

0

7

0

Yinhong Liu

@YinhongLiu2

9 months

RT @ZhijiangG: Life update: 🎉 I'm excited to share that I will be joining @HKUSTGuangzhou as an Assistant Professor in Spring 2025! .I'm lo….

0

23

0

Yinhong Liu

@YinhongLiu2

10 months

RT @caiqizh: 🔥Check our EMNLP paper with @vlachos_nlp and @ZhijiangG. 🤔Do We Need Language-Specific Fact-Checking Models? The Case of Chine….

0

5

0

Yinhong Liu

@YinhongLiu2

10 months

RT @hanzhou032: Attending #EMNLP2024 Virtually📺! .If you've ever wondered how to PROMPT your LLM-as-a-Judge⚖️, stay tuned! We will present….

0

4

0

Yinhong Liu

@YinhongLiu2

10 months

RT @Yingjia_Wan: 💥 Introducing "AutoPSV: Automated Process Supervised Verifier" - accepted at #NeurIPS2024!. AutoPSV automatically annotate….

0

38

0