
Lei Li
@lileics
Followers
6K
Following
1K
Media
96
Statuses
776
Generative AI for language and science. MT, LLM, GenAI Safety, Drug Discovery
Joined April 2010
We have an excellent lineup of distinguished speakers at the Gen AI for Bio workshop! Join us in the East Exhibition Hall A on July 18, starting at 8:45am. #GenBio2025 #ICML2025.
0
0
5
We are presenting PPDiff for protein complex design at #ICML2025 west exhibition hall B2 #W-119 at 11am-1:30pm today 7/17. Come visit. @ZhenqiaoSong . Key idea: sequence structure co-design + hybrid diffusion. Paper:
0
3
9
DISCO paper website:
#ICML2025 Andre @avduarte3333 and I are presenting DISCO: a new method to discover copyrighted content from VLM’s training data (without accessing to it). Welcome to visit our poster at Vancouver Convention Center East Exhibition Hall A#900 at 3pm 7/16.
0
1
4
#ICML2025 Andre @avduarte3333 and I are presenting DISCO: a new method to discover copyrighted content from VLM’s training data (without accessing to it). Welcome to visit our poster at Vancouver Convention Center East Exhibition Hall A#900 at 3pm 7/16.
0
0
3
We are organizing Generative AI for Biology workshop at #ICML2025. Welcome to submit any relevant work on AI for biomolecule, AI model for bio systems, AI and experiments, Agent for bio discovery, new datasets and tools, etc. The deadline is May 25th.
genbio-workshop.github.io
GenBio focuses on solving fundamental problems in biology through generative AI.
⏰ Deadline extended to May 25th for GenAI and Biology workshop, considering multiple requests & NeurIPS deadline!. 🚀Recent submissions to NeurIPS & other conferences/journals are welcome! . 🧬For amazing speakers and more details:
9
10
41
Better than LoRA! You only need to train as few as 18 token embeddings of LLaMA to achieve superior translation performance on new languages. KS-Lottery provides a statistical sound method to find an extremely small number of LLM embedding parameters to fine-tune!
I will give a talk at 11:15am today in Ruidoso at #NAACL2025 about KS-Lottery— finding small number of token embeddings in an LLM that are effective for fine-tuning. Surprising finding: 18 tokens are enough for fine-tuning!
0
3
11
How to reduce latency for simultaneous (text) translation? Siqi proposes TAF method — the key idea is to forecast source side continuations of utterance before actual input, and then using majority voting to generate possible translations. #NAACL2025
Excited to be at #NAACL2025 in Albuquerque!. We have two papers on simultaneous translation 🎉.1️⃣ Anticipating Future with Large Language Model for Simultaneous Machine Translation.🗓 Apr 30, 11:45–12:00 @ Ruidoso (Oral).🔗 2️⃣ CA*: Addressing Evaluation.
1
2
19
Simultaneous translation always aims to reduce latency while retaining translation quality, but measuring latency turns non-trivial. Xi and Siqi’s new work proposes a highly accurate method, CA*, to measure latency in ST, by taking actual inference time into account. #NAACL25
0
2
11
Can AI text detectors identify LLm generated code, paper reviews, abstract, translation, summary? Brian is presenting a new study about existing AI text detectors on LLM generated content at #NAACL2025 . TLDR; all existing detectors work poorly.
3
10
49
Kexun is presenting OSCA - Optimal Sample Compute Allocation at #NAACL2025 in Hall 3 (#50). The paper presents an optimization algorithm to find optimal configurations for LLM inference.
0
1
6
I will give a talk at 11:15am today in Ruidoso at #NAACL2025 about KS-Lottery— finding small number of token embeddings in an LLM that are effective for fine-tuning. Surprising finding: 18 tokens are enough for fine-tuning!
1
3
36
Excited to visit ABQ! We are presenting six papers at #NAACL2025 on simultaneous translation/speech translation, inference-time optimization, finding lottery tickets in LLMs, AI text detection, and language agents for task planning. I am here the full week. Feel free to DM.
2
4
40
RT @xuandongzhao: This work was partially done with @lileics and @yuxiangw_cs during our time at @UCSB. Poster attached for a better overvi….
0
2
0
The 2nd Generative AI and Biology workshop will collocate with ICML 2025 in Vancouver this year (July 18/19, 2025). CFP: We have a fantastic lineup of speakers. @MengdiWang10 @ericxing @marinkazitnik @StefanoErmon @MinkaiX @ZhenqiaoSong.
genbio-workshop.github.io
GenBio focuses on solving fundamental problems in biology through generative AI.
Hi everyone, we are so back!. Delighted to announce the 2nd Generative AI and Biology (GenBio) Workshop @icmlconf #icml2025! Join us in this exciting discourse on all aspects of the future of #GenerativeAI and Biology!! 🧬🚀. Website: 1/n
0
8
30
a newly baked Dr. Congratulations to @WendaXu2 for successfully defending his phd thesis "On Evaluation and Efficient Post-training for LLMs". Highly recommend his slides: covering RL training, better KD, LLM/text gen evaluation, bias in LLM as a judge:
docs.google.com
On Evaluation and Efficient Post-training for LLMs 03/07/2025 Wenda Xu Department of Computer Science University of California, Santa Barbara 1
[Life update] 🎉 I successfully defended my PhD thesis "On Evaluation and Efficient Post-training for LLMs" @ucsbNLP and am officially a PhD! Huge thanks to my advisors @WilliamWangNLP @lileics, my committee @markuseful & Simon Todd, and everyone who supported me during my PhD
1
1
35
RT @COLM_conf: Excited to announce our 2025 keynote speakers: @cosmo_shirley, Nicholas Carlini, @LukeZettlemoyer, and Tom Griffiths! https:….
0
15
0
Congratulations Dr. Sun @EdwardSun0909 ! Zhiqing's phd thesis on Scalable alignment of LLM is a must-read if you work on LLM recently.
I successfully defended my PhD thesis today! 🎉. "Scalable Alignment of Large Language Models Towards Truth-Seeking, Complex Reasoning, and Human Values". Slides (Fact-RLHF, Lean-STaR, Easy-to-Hard Generalization, Self-Align, Instructable Reward Model): A
2
2
38
A new comprehensive multilingual (and multitask) evaluation suite for LLMs (covering 17 diverse languages), developed by @xuhuang87 and folks! Check out BenchMAX at
github.com
Contribute to CONE-MT/BenchMAX development by creating an account on GitHub.
🤩Excited to announce our new work BenchMAX!🥳. BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models. Paper: Repo: Datasets:
0
8
51