Lei Li @lileics X Profile

Lei Li

@lileics

Followers

6K

Following

1K

Media

96

Statuses

776

Generative AI for language and science. MT, LLM, GenAI Safety, Drug Discovery

Joined April 2010

Don't wanna be here? Send us removal request.

Lei Li

@lileics

28 days

The show is on. Welcome to 2025 Generative AI for Biology workshop. 7 invited talks + a panel with 5 panelists + 14 spotlight talks + 121 poster presentations! . Huge thanks to the workshop sponsors: Genesis Therapeutics, Genbio AI, and Tencent!.

1

2

9

Lei Li

@lileics

29 days

We have an excellent lineup of distinguished speakers at the Gen AI for Bio workshop! Join us in the East Exhibition Hall A on July 18, starting at 8:45am. #GenBio2025 #ICML2025.

GenBio Workshop @ ICML25

@genbio_workshop

29 days

Hope to see you all tomorrow at the GenAI & Bio workshop!! #ICML2025 . Schedule:

0

5

Lei Li

@lileics

29 days

We are presenting PPDiff for protein complex design at #ICML2025 west exhibition hall B2 #W-119 at 11am-1:30pm today 7/17. Come visit. @ZhenqiaoSong . Key idea: sequence structure co-design + hybrid diffusion. Paper:

0

3

9

Lei Li

@lileics

30 days

DISCO paper website:

Lei Li

@lileics

30 days

#ICML2025 Andre @avduarte3333 and I are presenting DISCO: a new method to discover copyrighted content from VLM’s training data (without accessing to it). Welcome to visit our poster at Vancouver Convention Center East Exhibition Hall A#900 at 3pm 7/16.

0

1

4

Lei Li

@lileics

30 days

#ICML2025 Andre @avduarte3333 and I are presenting DISCO: a new method to discover copyrighted content from VLM’s training data (without accessing to it). Welcome to visit our poster at Vancouver Convention Center East Exhibition Hall A#900 at 3pm 7/16.

0

3

Lei Li

@lileics

2 months

Just delivered 4 lectures (50mins each, a total of 3hours 20mins) in a roll at Advanced course on Data Science and Machine Learning (. Wonderful to have conversations with the ACDL participants! thanks to the directors, Giuseppe Nicosia and Panos Pardalos

1

20

Lei Li

@lileics

3 months

We are organizing Generative AI for Biology workshop at #ICML2025. Welcome to submit any relevant work on AI for biomolecule, AI model for bio systems, AI and experiments, Agent for bio discovery, new datasets and tools, etc. The deadline is May 25th.

genbio-workshop.github.io

GenBio focuses on solving fundamental problems in biology through generative AI.

GenBio Workshop @ ICML25

@genbio_workshop

3 months

⏰ Deadline extended to May 25th for GenAI and Biology workshop, considering multiple requests & NeurIPS deadline!. 🚀Recent submissions to NeurIPS & other conferences/journals are welcome! . 🧬For amazing speakers and more details:

9

10

41

Lei Li

@lileics

3 months

Better than LoRA! You only need to train as few as 18 token embeddings of LLaMA to achieve superior translation performance on new languages. KS-Lottery provides a statistical sound method to find an extremely small number of LLM embedding parameters to fine-tune!

Lei Li

@lileics

4 months

I will give a talk at 11:15am today in Ruidoso at #NAACL2025 about KS-Lottery— finding small number of token embeddings in an LLM that are effective for fine-tuning. Surprising finding: 18 tokens are enough for fine-tuning!

0

3

11

Lei Li

@lileics

3 months

How to reduce latency for simultaneous (text) translation? Siqi proposes TAF method — the key idea is to forecast source side continuations of utterance before actual input, and then using majority voting to generate possible translations. #NAACL2025

Siqi Ouyang

@siqi_ouyang

4 months

Excited to be at #NAACL2025 in Albuquerque!. We have two papers on simultaneous translation 🎉.1️⃣ Anticipating Future with Large Language Model for Simultaneous Machine Translation.🗓 Apr 30, 11:45–12:00 @ Ruidoso (Oral).🔗 2️⃣ CA*: Addressing Evaluation.

1

2

19

Lei Li

@lileics

3 months

Simultaneous translation always aims to reduce latency while retaining translation quality, but measuring latency turns non-trivial. Xi and Siqi’s new work proposes a highly accurate method, CA*, to measure latency in ST, by taking actual inference time into account. #NAACL25

0

2

11

Lei Li

@lileics

3 months

Can AI text detectors identify LLm generated code, paper reviews, abstract, translation, summary? Brian is presenting a new study about existing AI text detectors on LLM generated content at #NAACL2025 . TLDR; all existing detectors work poorly.

3

10

49

Lei Li

@lileics

3 months

Kexun is presenting OSCA - Optimal Sample Compute Allocation at #NAACL2025 in Hall 3 (#50). The paper presents an optimization algorithm to find optimal configurations for LLM inference.

0

1

6

Lei Li

@lileics

4 months

I will give a talk at 11:15am today in Ruidoso at #NAACL2025 about KS-Lottery— finding small number of token embeddings in an LLM that are effective for fine-tuning. Surprising finding: 18 tokens are enough for fine-tuning!

1

3

36

Lei Li

@lileics

4 months

Excited to visit ABQ! We are presenting six papers at #NAACL2025 on simultaneous translation/speech translation, inference-time optimization, finding lottery tickets in LLMs, AI text detection, and language agents for task planning. I am here the full week. Feel free to DM.

2

4

40

Lei Li

@lileics

4 months

RT @xuandongzhao: This work was partially done with @lileics and @yuxiangw_cs during our time at @UCSB. Poster attached for a better overvi….

0

2

0

Lei Li

@lileics

4 months

The 2nd Generative AI and Biology workshop will collocate with ICML 2025 in Vancouver this year (July 18/19, 2025). CFP: We have a fantastic lineup of speakers. @MengdiWang10 @ericxing @marinkazitnik @StefanoErmon @MinkaiX @ZhenqiaoSong.

genbio-workshop.github.io

GenBio focuses on solving fundamental problems in biology through generative AI.

GenBio Workshop @ ICML25

@genbio_workshop

4 months

Hi everyone, we are so back!. Delighted to announce the 2nd Generative AI and Biology (GenBio) Workshop @icmlconf #icml2025! Join us in this exciting discourse on all aspects of the future of #GenerativeAI and Biology!! 🧬🚀. Website: 1/n

0

8

30

Lei Li

@lileics

5 months

a newly baked Dr. Congratulations to @WendaXu2 for successfully defending his phd thesis "On Evaluation and Efficient Post-training for LLMs". Highly recommend his slides: covering RL training, better KD, LLM/text gen evaluation, bias in LLM as a judge:

docs.google.com

On Evaluation and Efficient Post-training for LLMs 03/07/2025 Wenda Xu Department of Computer Science University of California, Santa Barbara 1

Wenda Xu

@WendaXu2

5 months

[Life update] 🎉 I successfully defended my PhD thesis "On Evaluation and Efficient Post-training for LLMs" @ucsbNLP and am officially a PhD! Huge thanks to my advisors @WilliamWangNLP @lileics, my committee @markuseful & Simon Todd, and everyone who supported me during my PhD

1

35

Lei Li

@lileics

5 months

RT @COLM_conf: Excited to announce our 2025 keynote speakers: @cosmo_shirley, Nicholas Carlini, @LukeZettlemoyer, and Tom Griffiths! https:….

0

15

0

Lei Li

@lileics

6 months

Congratulations Dr. Sun @EdwardSun0909 ! Zhiqing's phd thesis on Scalable alignment of LLM is a must-read if you work on LLM recently.

Zhiqing Sun

@EdwardSun0909

6 months

I successfully defended my PhD thesis today! 🎉. "Scalable Alignment of Large Language Models Towards Truth-Seeking, Complex Reasoning, and Human Values". Slides (Fact-RLHF, Lean-STaR, Easy-to-Hard Generalization, Self-Align, Instructable Reward Model): A

2

38

Lei Li

@lileics

6 months

A new comprehensive multilingual (and multitask) evaluation suite for LLMs (covering 17 diverse languages), developed by @xuhuang87 and folks! Check out BenchMAX at

github.com

Contribute to CONE-MT/BenchMAX development by creating an account on GitHub.

Xu Huang

@xuhuang87

6 months

🤩Excited to announce our new work BenchMAX!🥳. BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models. Paper: Repo: Datasets:

0

8

51