
Somnath Basu Roy Chowdhury
@SomnathBrc
Followers
98
Following
192
Media
27
Statuses
52
Research Scientist at Google Research
Joined January 2024
๐๐จ๐ฐ ๐๐๐ง ๐ฐ๐ ๐ฉ๐๐ซ๐๐๐๐ญ๐ฅ๐ฒ ๐๐ซ๐๐ฌ๐ ๐๐จ๐ง๐๐๐ฉ๐ญ๐ฌ ๐๐ซ๐จ๐ฆ ๐๐๐๐ฌ?. Our method, Perfect Erasure Functions (PEF), erases concepts from LLM representations w/o parameter estimation, achieving pareto optimal erasure-utility tradeoff w/ guarantees. #AISTATS2025 ๐งต
2
35
153
@snigdhac25 (9/n) Iโm attending ICLR in person and presenting our poster on 25th April in Poster session 3 between 10AM-1230PM. Please feel free to stop by our poster if youโre interested. Iโm also happy to chat about unlearning or AI safety in general. cc: @uncnlp @unccs.
0
0
1
(8/n) Finally, I would like to thank all my amazing co-authors: Krzysztof, Arijit, Avinava, and @snigdhac25. Code: Paper link:
1
0
1
๐๐จ๐ฐ ๐๐๐ง ๐ฐ๐ ๐ฉ๐๐ซ๐๐๐๐ญ๐ฅ๐ฒ ๐ฎ๐ง๐ฅ๐๐๐ซ๐ง ๐๐๐ญ๐ ๐๐ซ๐จ๐ฆ ๐๐๐๐ฌ ๐ฐ๐ก๐ข๐ฅ๐ ๐ฉ๐ซ๐จ๐ฏ๐ข๐๐ข๐ง๐ ๐ ๐ฎ๐๐ซ๐๐ง๐ญ๐๐๐ฌ?. We present SยณT, a scalable unlearning framework that guarantees data deletion from LLMs by leveraging parameter-efficient fine-tuning. #ICLR2025 ๐งต
1
9
32
RT @abeirami: Finally, if you are also going to #AISTATS2025, @SomnathBrc will be presenting ๐ฉ๐๐ซ๐๐๐๐ญ ๐๐จ๐ง๐๐๐ฉ๐ญ ๐๐ซ๐๐ฌ๐ฎ๐ซ๐. Somnath will be at Iโฆ.
0
1
0
(9/n) Finally, I would like to thank all my amazing co-authors: Avinava, @abeirami, Rahul, @nicholasmonath, Amr, @snigdhac25. cc: @uncnlp @unccs.
0
0
3
(7/n) We would like to highlight previous great works, like LEACE, that perfectly erase concepts to protect against linear adversaries. In our work, we improve upon this method and present a technique that can protect against any adversary.
Ever wanted to mindwipe an LLM?. Our method, LEAst-squares Concept Erasure (LEACE), provably erases all linearly-encoded information about a concept from neural net activations. It does so surgically, inflicting minimal damage to other concepts. ๐งต.
1
0
2
(2/n) We study the fundamental limits of concept erasure. Borrowing from the work of @FlavioCalmon et al. in information theory literature, we characterize the erasure capacity and maximum utility that can be retained during concept erasure.
1
0
3