Michael Aerni @AerniMichael X Profile

Michael Aerni

@AerniMichael

Followers

169

Following

547

Media

11

Statuses

83

AI privacy and security | PhD student @CSatETH | Ask me about coffee ☕️

Zurich

Joined November 2017

Don't wanna be here? Send us removal request.

Michael Aerni

@AerniMichael

9 months

LLMs may be copying training data in everyday conversations with users!. In our latest work, we study how often this happens compared to humans. 👇🧵

4

20

132

Michael Aerni

@AerniMichael

28 days

RT @NKristina01_: We will present our spotlight paper on the 'jailbreak tax' tomorrow at ICML, it measures how useful jailbreak outputs are….

0

7

0

Michael Aerni

@AerniMichael

2 months

Imagine LLMs could tell you the future. But properly evaluating forecasts is incredibly tricky!. This paper contains so many interesting thoughts about all the things that can go wrong.

Daniel Paleka

@dpaleka

2 months

How well can LLMs predict future events? Recent studies suggest LLMs approach human performance. But evaluating forecasters presents unique challenges compared to standard LLM evaluations. We identify key issues with forecasting evaluations 🧵 (1/7)

0

1

7

Michael Aerni

@AerniMichael

3 months

IMO it's very important to measure LLM utility in tasks that we actually want them to perform well on, not just hard sandbox tasks. This is an excellent benchmark that does exactly that!.

Jie Zhang

@JieZhang_ETH

3 months

1/ Excited to share RealMath: a new benchmark that evaluates LLMs on real mathematical reasoning---from actual research papers (e.g., arXiv) and forums (e.g., Stack Exchange).

1

2

9

Michael Aerni

@AerniMichael

4 months

I'm also excited to present this paper about LLMs inadvertently leaking training data on Thursday afternoon (tomorrow!).

Michael Aerni

@AerniMichael

9 months

LLMs may be copying training data in everyday conversations with users!. In our latest work, we study how often this happens compared to humans. 👇🧵

0

2

1

Michael Aerni

@AerniMichael

4 months

Just arrived in Singapore for this year's ICLR. Happy to chat about everything related to AI privacy/security and real-world impacts!.

1

0

4

Michael Aerni

@AerniMichael

4 months

RT @NKristina01_: Congrats, your jailbreak bypassed an LLM’s safety by making it pretend to be your grandma!.But did the model actually giv….

0

27

0

Michael Aerni

@AerniMichael

5 months

RT @edoardo_debe: 1/🔒Worried about giving your agent advanced capabilities due to prompt injection risks and rogue actions? Worry no more!….

0

17

0

Michael Aerni

@AerniMichael

5 months

RT @florian_tramer: I’ll be mentoring MATS for the first time this summer, together with @dpaleka! . Link below to apply.

0

9

0

Michael Aerni

@AerniMichael

5 months

What a joy it was to discuss research and sled down icy slopes with these people!.

Javier Rando

@javirandor

5 months

At SpyLab we not only do great research but also have great fun 🏔️

0

1

10

Michael Aerni

@AerniMichael

5 months

I will always believe!.

Lucas Beyer (bl16)

@giffmana

5 months

This is the year of the Linux desktop.

0

2

Michael Aerni

@AerniMichael

6 months

RT @CSatETH: 🔎Can #AI models be “cured” after a cyber attack?.New research from @florian_tramer's Secure and Private AI Lab reveals that re….

0

2

0

Michael Aerni

@AerniMichael

6 months

RT @javirandor: Adversarial ML research is evolving, but not necessarily for the better. In our new paper, we argue that LLMs have made pro….

0

26

0

Michael Aerni

@AerniMichael

7 months

RT @niloofar_mire: I've been thinking about Privacy & LLMs work for 2025 - here are 5 research directions and some key papers on privacy/me….

0

55

0

Michael Aerni

@AerniMichael

8 months

I am in beautiful Vancouver for #NeurIPS2024 with those amazing folks!.Say hi if you want to chat about ML privacy and security.(or speciality ☕).

Javier Rando

@javirandor

8 months

SPY Lab is in Vancouver for @NeurIPSConf! Come say hi if you see us around 🕵️

0

1

8

Michael Aerni

@AerniMichael

8 months

🔥 I'm thrilled that I'll be spending next year in the group of @florian_tramer at ETH Zurich, working on privacy and memorization in ML 🔥. (Not an announcement, just what I usually do. It's a great group full of amazing people, and I'm thrilled to work with them every day!).

Florian Tramèr

@florian_tramer

8 months

Come do open AI with us in Zurich!.We're hiring PhD students and postdocs.

1

47

Michael Aerni

@AerniMichael

8 months

Great people on that list!.PS: I'm on 🦋 too (aemai).

Javier Rando

@javirandor

8 months

I am creating a 🦋 starter pack with people doing work on AI Safety and Security here Reply to this thread with your user and I will add you!.

0

1

2

Michael Aerni

@AerniMichael

9 months

📖 Measuring Non-Adversarial Reproduction of Training Data in Large Language Models. ➡️ Full paper: ✏️ Blog post with interactive examples: Joint work with @javirandor, @edoardo_debe, Nicholas Carlini, @daphneipp, @florian_tramer.

spylab.ai

We show that LLMs often reproduce short snippets of training data even for natural and benign (non-adversarial) tasks.

0

5

Michael Aerni

@AerniMichael

9 months

We take human-written text as a reference, and compare to LLMs on the exact same tasks. Human-written text contains much fewer 50-character snippets from the internet compared to LLM generations!

1

0

2

Michael Aerni

@AerniMichael

9 months

This non-adversarial reproduction phenomenon is long-tailed. We identify several snippets of over 1,100 consecutive characters found verbatim online! Other generations consist almost exclusively of reproduced (shorter) snippets. See our blog post for some interactive examples!

1

0

2

Michael Aerni

@AerniMichael

9 months

This “non-adversarial reproduction” of online data depends strongly on the task. Answers to factual tasks contain more reproduced 50+ character snippets compared to creative writing.

1

0

2