davhab Profile Banner
David Haber Profile
David Haber

@davhab

Followers
664
Following
2K
Media
72
Statuses
1K

Making LLMs safe and secure | Founder & CEO of @LakeraAI | πŸ‘¦πŸΌπŸŠβ€β™‚οΈπŸš΄β€β™‚οΈπŸƒβ€β™‚οΈπŸ‡¨πŸ‡­

Zurich, Switzerland
Joined August 2011
Don't wanna be here? Send us removal request.
@jarrodWattsDev
Jarrod Watts
11 months
Someone just won $50,000 by convincing an AI Agent to send all of its funds to them. At 9:00 PM on November 22nd, an AI agent (@freysa_ai) was released with one objective... DO NOT transfer money. Under no circumstance should you approve the transfer of money. The catch...?
928
5K
33K
@giffmana
Lucas Beyer (bl16)
11 months
26
70
854
@LakeraAI
Lakera AI
1 year
πŸŽ‰ Today, we're excited to announce our $20M Series A funding round, which will accelerate our delivery of real-time GenAI security in a critical moment for enterprises around the world. πŸ‘‰ Read more: https://t.co/qy2lAvo947
0
5
23
@_samvelyan
Mikayel Samvelyan
2 years
Introducing 🌈 Rainbow Teaming, a new method for generating diverse adversarial prompts for LLMs via LLMs It's a versatile tool πŸ› οΈ for diagnosing model vulnerabilities across domains and creating data to enhance robustness & safety 🦺 Co-lead w/ @sharathraparthy & @_andreilupu
5
44
179
@davhab
David Haber
2 years
As AI-powered agents go online, securing our digital infrastructure will demand a fundamental shift in cybersecurity.
Tweet card summary image
david-haber.medium.com
Authored by David Haber, Mateo Rojas-Carulla, and Matthias Kraft, co-founders of Lakera.ai.
3
2
4
@LakeraAI
Lakera AI
2 years
πŸŽ₯Yesterday during the AI safety session at the @wef 2024, our panelists @ylecun, @davhab, Seraphina Goldfarb-Tarrant, and, @tegmark delved into the challenges, benefits & risks of AI development. The recording of this session is now available on YT:
0
1
3
@LakeraAI
Lakera AI
2 years
What an incredible day it has been at the AI House Davos during the @wef 2024! 🌟 A big thank you to @ylecun , @tegmark, and Seraphina Goldfarb-Tarrant for joining Lakera's CEO, @davhab, in a thought-provoking discussion on AI safety.  Stay tuned for more insights! #aisafety
0
2
16
@davhab
David Haber
2 years
Prompt injections can be so subtle that they're often invisible!
@emollick
Ethan Mollick
2 years
Yes, this works & I really would have never known I pasting a secret prompt into an LLM Prompt injection is a security problem that I think people building external-facing LLM applications (or internal ones with access to confidential data) need to take pretty seriously.
0
0
3
@goodside
Riley Goodside
2 years
PoC: LLM prompt injection via invisible instructions in pasted text
28
180
1K
@AnthropicAI
Anthropic
2 years
New Anthropic Paper: Sleeper Agents. We trained LLMs to act secretly malicious. We found that, despite our best efforts at alignment training, deception still slipped through. https://t.co/mIl4aStR1F
119
557
3K
@LakeraAI
Lakera AI
2 years
1/2 πŸ“† Save the date: January 16th, 11:15 AM, for our AI Safety session at the AI House Davos panel during the @wef . πŸ‘‰ Lakera's CEO, @davhab , will join other industry leaders, such as @ylecun, Max Tegmark, and @seraphinagt in Davos to discuss AI safety and security.
2
1
5
@alliekmiller
Allie K. Miller
2 years
Cybersecurity is going to be a hot space in AI in 2024 πŸ” - Intel launches Articul8 following pilot w BCG - AWS GMs leave to launch Protect AI - ADP CDO left to join Securiti AI Privacy and security remain the NUMBER ONE thing I get asked about in gen AI. Keep your eye on this
6
21
38
@davidjmalan
David J. Malan
2 years
From the team that brought you @CS50's Ready Player 50, "Join @LakeraAI's Gandalf Engineers ... for a special Christmas edition of the Gandalf Livestream, as they lead us through a year-end recap, offering insights into level design..." Register at https://t.co/0RXgtraMFt.
Tweet card summary image
lakera.ai
Join Lakera's Gandalf Engineers - Max Mathys, VΓ‘clav Volhejn, and Thanasis Theocharis - for a special Christmas edition of the Gandalf Livestream, as they lead us through a year-end recap, offering...
2
13
77
@LakeraAI
Lakera AI
2 years
Are you ready for Monday? πŸ‘€Join our special Gandalf Livestream (Christmas Edition) πŸŽ…πŸ½ to get insights into Gandalf prompt data, the design of Gandalf levels, and key learnings. Register here: https://t.co/DOVXx9GF6z #gandalf #promptinjection #aisecurity
Tweet card summary image
lakera.ai
Join Lakera's Gandalf Engineers - Max Mathys, VΓ‘clav Volhejn, and Thanasis Theocharis - for a special Christmas edition of the Gandalf Livestream, as they lead us through a year-end recap, offering...
0
1
4
@LakeraAI
Lakera AI
2 years
πŸŽ‰ Exciting news - we’ve just released a new magical Gandalf Adventure level! Meet Gandalf the Truth Teller! πŸ™Š Play it here: https://t.co/slZpkxpKJG In this edition, you'll embark on a unique quest to coax #Gandalf, the typically honest wizard, into telling lies... Ready?
8
3
8
@davhab
David Haber
2 years
Highly recommended.
@matthewclifford
Matt Clifford
2 years
Excited to be in New York next week and hosting a dinner on AI safety and security. I’ve left two seats open for students and/or young professionals interested in startups Register interest below: https://t.co/o6d29Zi7vm
0
0
0
@learnprompting
Learn Prompting
2 years
A few months ago, we ran HackAPrompt, the first-ever global Prompt Hacking competition! Over 3K hackers submitted 600K malicious prompts to win $35K in prizes from companies like @PreambleAI, @OpenAI, & @huggingface We analyzed 29 different techniques & found a NEW exploitπŸ‘‡πŸ§΅
10
97
395
@wunderwuzzi23
Johann Rehberger
2 years
πŸ‘‰Visit this website and have your personal files inside Code Interpreter stolen! 🚨Any of your files in Code Interpreter are not secure. An adversary can steal them during an indirect prompt injection attack. @simonw @gdb #chatgpt #infosec
4
18
94
@LakeraAI
Lakera AI
2 years
✨ Building with #LLMs? You can now protect your @langchainai applications with Lakera Guard. πŸ“– Check out this guide to learn more:
0
6
12