QinLiu_NLP Profile Banner
Qin Liu Profile
Qin Liu

@QinLiu_NLP

Followers
108
Following
48
Media
6
Statuses
50

PhD student @UC_Davis | MS & BA @FudanUni | AI safety and Trustworthy LLMs

California
Joined December 2015
Don't wanna be here? Send us removal request.
@QinLiu_NLP
Qin Liu
7 days
RT @jakedineenasu: Thrilled to share QA-LIGN 𝐚𝐭 #EMNLP2025! Bridging rule-based rewards and LLM-as-a-Judge via LLM-derived symbolic reward….
0
3
0
@QinLiu_NLP
Qin Liu
8 days
RT @dong_w0n: Excited to share that two of my first-author papers were accepted to #EMNLP2025! ✨📚. 1️⃣ Code Execution as Grounded Supervisi….
0
6
0
@grok
Grok
10 days
Join millions who have switched to Grok.
229
477
3K
@QinLiu_NLP
Qin Liu
1 month
RT @TenghaoHuang45: 🎉 Excited to share our ACL 2025 paper:.🤖R2D2: Remembering, Replaying and Dynamic Decision Making with a Reflective Agen….
0
9
0
@QinLiu_NLP
Qin Liu
2 months
RT @Wenjie_Jacky_Mo: @ReviewAcl @emnlpmeeting Urgent help needed. acFZ: initial score 3. 🧊 Complete silence during discussion. ⏰ 4am PST,….
0
3
0
@QinLiu_NLP
Qin Liu
3 months
RT @jakedineenasu: 🔍 Introducing QA-LIGN: A reflective alignment approach using a draft→reflection→revision pipeline. We create symbolic re….
0
6
0
@QinLiu_NLP
Qin Liu
3 months
🎯 Takeaway:.SudoLM enables credential-aware LLMs:.No more blocking critical knowledge from the experts who are authorized to access it. We’re excited to see how this inspires future access-controlled and reliable LLMs. 📄 #ACL2025.🧵[6/6].
Tweet card summary image
arxiv.org
Existing preference alignment is a one-size-fits-all alignment mechanism, where the part of the large language model (LLM) parametric knowledge with non-preferred features is uniformly blocked to...
0
0
1
@QinLiu_NLP
Qin Liu
3 months
💡 Why it matters:.🔸 Maintains general utility (MMLU, MT-Bench, ARC stay strong).🔸 Robust to key guessing.🔸 Scales across domains:.- Coarse-grained (e.g. medical QA).- Fine-grained (e.g. TOFU).- Supports any backbone LLM (LLaMA2/3, etc.).🧵[5/6].
1
0
1
@QinLiu_NLP
Qin Liu
3 months
🛠️ How does it work?.We fine-tune the LLM with paired preference data:.- Authorized queries (with SUDO key) on privileged questions → detailed answer. - Unauthorized queries → refusal. A form of backdoor attack, but for positive access control. ✅.🧵[4/6].
1
0
1
@QinLiu_NLP
Qin Liu
3 months
✨ Our solution: Authorization Alignment.SudoLM introduces a secret SUDO key for credentialed users. If a user has the key, the LLM unlocks “privileged knowledge.”.Otherwise, the LLM performs safe refusal as usual. Same model, dynamic behavior based on user authorization. 🧵[3/6]
Tweet media one
2
0
1
@QinLiu_NLP
Qin Liu
3 months
🔐 What's the problem?.Current safety alignment is “one-size-fits-all.”.Even users with the right credentials (e.g., doctors) are denied access to useful info. This overly conservative model behavior hurts LLM utility in expert settings. 🧵[2/6]
Tweet media one
1
0
1
@QinLiu_NLP
Qin Liu
3 months
🚨 New paper accepted to #ACL2025!.We propose SudoLM, a framework that lets LLMs learn access control over parametric knowledge. Rather than blocking everyone from sensitive knowledge, SudoLM grants access to authorized users only. Paper: 🧵[1/6]👇
Tweet media one
1
8
10
@QinLiu_NLP
Qin Liu
3 months
RT @DarthZhu_: 😴 Extending modality based on an LLM has been a common practice when we are talking about multimodal LLMs. ❓ Can it general….
Tweet card summary image
arxiv.org
Omni-modal language models (OLMs) aim to integrate and reason over diverse input modalities--such as text, images, video, and audio--while maintaining strong language capabilities. Despite recent...
0
10
0
@QinLiu_NLP
Qin Liu
3 months
RT @RaKan_Wen: Can LLM guardrails think twice before deciding?. ✨ Check out our #ACL2025 paper: THINKGUARD — a critique-augmented safety gu….
0
10
0
@QinLiu_NLP
Qin Liu
4 months
RT @Wenjie_Jacky_Mo: Worried about backdoors in LLMs?. 🌟 Check out our #NAACL2025 work on test-time backdoor mitigation!. ✅ Black-box 📦.✅ P….
0
6
0
@QinLiu_NLP
Qin Liu
4 months
RT @fwang_nlp: 🎉 Excited to share that our paper, "MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding", will be pres….
0
18
0
@QinLiu_NLP
Qin Liu
5 months
RT @muhao_chen: 🚨 Call for Papers! @aclmeeting 🚨. LLM Security Workshop @ ACL 2025 (the first workshop of ACL SIGSEC).🔐 Topics: Adversarial….
0
15
0
@QinLiu_NLP
Qin Liu
5 months
RT @sheng_zh: 🚀 Excited to share MetaScale, our latest work advancing LLM reasoning capabilities! MetaScale empowers GPT-4o to match or eve….
0
27
0
@QinLiu_NLP
Qin Liu
6 months
RT @BowenJin13: 🚀 Introducing 𝗦𝗲𝗮𝗿𝗰𝗵-𝗥𝟭 – the first 𝗿𝗲𝗽𝗿𝗼𝗱𝘂𝗰𝘁𝗶𝗼𝗻 𝗼𝗳 𝗗𝗲𝗲𝗽𝘀𝗲𝗲𝗸-𝗥𝟭 (𝘇𝗲𝗿𝗼) for training reasoning and search-augmented LLM agen….
0
328
0
@QinLiu_NLP
Qin Liu
8 months
🌟 Check out our latest comprehensive survey on: 🌟.⚠️Emergent backdoor threats to LLMs.👻Safety challenges to LLMs. 💡Future research directions in this area. Invited paper at 60th Annual Allerton Conference:
Tweet media one
Tweet media two
0
3
7