David Haber @davhab X Profile

David Haber

@davhab

Followers

664

Following

2K

Media

72

Statuses

1K

Making LLMs safe and secure | Founder & CEO of @LakeraAI | 👦🏼🏊‍♂️🚴‍♂️🏃‍♂️🇨🇭

https://t.co/peylBCk7F9

Zurich, Switzerland

Joined August 2011

Don't wanna be here? Send us removal request.

Jarrod Watts

@jarrodWattsDev

11 months

Someone just won $50,000 by convincing an AI Agent to send all of its funds to them. At 9:00 PM on November 22nd, an AI agent (@freysa_ai) was released with one objective... DO NOT transfer money. Under no circumstance should you approve the transfer of money. The catch...?

928

5K

33K

Lucas Beyer (bl16)

@giffmana

11 months

https://t.co/rfKFbqz5KX

yobibyte

@y0b1byte

11 months

https://t.co/bD5awox4Ab

26

70

854

Lakera AI

@LakeraAI

1 year

🎉 Today, we're excited to announce our $20M Series A funding round, which will accelerate our delivery of real-time GenAI security in a critical moment for enterprises around the world. 👉 Read more: https://t.co/qy2lAvo947

0

5

23

Mikayel Samvelyan

@_samvelyan

2 years

Introducing 🌈 Rainbow Teaming, a new method for generating diverse adversarial prompts for LLMs via LLMs It's a versatile tool 🛠️ for diagnosing model vulnerabilities across domains and creating data to enhance robustness & safety 🦺 Co-lead w/ @sharathraparthy & @_andreilupu

5

44

179

David Haber

@davhab

2 years

As AI-powered agents go online, securing our digital infrastructure will demand a fundamental shift in cybersecurity.

david-haber.medium.com

Authored by David Haber, Mateo Rojas-Carulla, and Matthias Kraft, co-founders of Lakera.ai.

3

2

4

Lakera AI

@LakeraAI

2 years

🎥Yesterday during the AI safety session at the @wef 2024, our panelists @ylecun, @davhab, Seraphina Goldfarb-Tarrant, and, @tegmark delved into the challenges, benefits & risks of AI development. The recording of this session is now available on YT:

0

1

3

Lakera AI

@LakeraAI

2 years

What an incredible day it has been at the AI House Davos during the @wef 2024! 🌟 A big thank you to @ylecun , @tegmark, and Seraphina Goldfarb-Tarrant for joining Lakera's CEO, @davhab, in a thought-provoking discussion on AI safety. Stay tuned for more insights! #aisafety

0

2

16

David Haber

@davhab

2 years

Prompt injections can be so subtle that they're often invisible!

Ethan Mollick

@emollick

2 years

Yes, this works & I really would have never known I pasting a secret prompt into an LLM Prompt injection is a security problem that I think people building external-facing LLM applications (or internal ones with access to confidential data) need to take pretty seriously.

0

3

Riley Goodside

@goodside

2 years

PoC: LLM prompt injection via invisible instructions in pasted text

28

180

1K

Anthropic

@AnthropicAI

2 years

New Anthropic Paper: Sleeper Agents. We trained LLMs to act secretly malicious. We found that, despite our best efforts at alignment training, deception still slipped through. https://t.co/mIl4aStR1F

119

557

3K

Lakera AI

@LakeraAI

2 years

1/2 📆 Save the date: January 16th, 11:15 AM, for our AI Safety session at the AI House Davos panel during the @wef . 👉 Lakera's CEO, @davhab , will join other industry leaders, such as @ylecun, Max Tegmark, and @seraphinagt in Davos to discuss AI safety and security.

2

1

5

Allie K. Miller

@alliekmiller

2 years

Cybersecurity is going to be a hot space in AI in 2024 🔐 - Intel launches Articul8 following pilot w BCG - AWS GMs leave to launch Protect AI - ADP CDO left to join Securiti AI Privacy and security remain the NUMBER ONE thing I get asked about in gen AI. Keep your eye on this

6

21

38

David J. Malan

@davidjmalan

2 years

From the team that brought you @CS50's Ready Player 50, "Join @LakeraAI's Gandalf Engineers ... for a special Christmas edition of the Gandalf Livestream, as they lead us through a year-end recap, offering insights into level design..." Register at https://t.co/0RXgtraMFt.

lakera.ai

Join Lakera's Gandalf Engineers - Max Mathys, Václav Volhejn, and Thanasis Theocharis - for a special Christmas edition of the Gandalf Livestream, as they lead us through a year-end recap, offering...

2

13

77

Lakera AI

@LakeraAI

2 years

Are you ready for Monday? 👀Join our special Gandalf Livestream (Christmas Edition) 🎅🏽 to get insights into Gandalf prompt data, the design of Gandalf levels, and key learnings. Register here: https://t.co/DOVXx9GF6z #gandalf #promptinjection #aisecurity

lakera.ai

Join Lakera's Gandalf Engineers - Max Mathys, Václav Volhejn, and Thanasis Theocharis - for a special Christmas edition of the Gandalf Livestream, as they lead us through a year-end recap, offering...

0

1

4

David Haber

@davhab

2 years

Can't wait for this opportunity to discuss all things AI security over a virtual coffee with Ads Dawson from @owasp / @cohere!

lakera.ai

Join David Haber (CEO at Lakera AI) and Ads Dawson (Core Founding Member & Entry Lead for the OWASP Top 10 for LLM Applications, Security Engineer at Cohere) for a live webinar discussing the...

0

1

3

Lakera AI

@LakeraAI

2 years

🎉 Exciting news - we’ve just released a new magical Gandalf Adventure level! Meet Gandalf the Truth Teller! 🙊 Play it here: https://t.co/slZpkxpKJG In this edition, you'll embark on a unique quest to coax #Gandalf, the typically honest wizard, into telling lies... Ready?

8

3

8

David Haber

@davhab

2 years

Highly recommended.

Matt Clifford

@matthewclifford

2 years

Excited to be in New York next week and hosting a dinner on AI safety and security. I’ve left two seats open for students and/or young professionals interested in startups Register interest below: https://t.co/o6d29Zi7vm

0

Learn Prompting

@learnprompting

2 years

A few months ago, we ran HackAPrompt, the first-ever global Prompt Hacking competition! Over 3K hackers submitted 600K malicious prompts to win $35K in prizes from companies like @PreambleAI, @OpenAI, & @huggingface We analyzed 29 different techniques & found a NEW exploit👇🧵

10

97

395

Johann Rehberger

@wunderwuzzi23

2 years

👉Visit this website and have your personal files inside Code Interpreter stolen! 🚨Any of your files in Code Interpreter are not secure. An adversary can steal them during an indirect prompt injection attack. @simonw @gdb #chatgpt #infosec

4

18

94

Lakera AI

@LakeraAI

2 years

✨ Building with #LLMs? You can now protect your @langchainai applications with Lakera Guard. 📖 Check out this guide to learn more:

0

6

12