Explore tweets tagged as #PromptInjection
@chemaalonso
Chema Alonso
1 day
El lado del mal - Google Gemini para Gmail: Cross-Domain Prompt Injection Attack (XPIA) para hacer Phishing #LLM #Gemini #PromptInjection #Bug #AI #IA #Phishing #Gmail #Google #GSuite
Tweet media one
0
138
157
@chemaalonso
Chema Alonso
2 months
El lado del mal - Cómo saltarse los AI Guardrails con Invisible Characters & Adversarial Prompts para hacer Prompt Injection & Jailbreak Smuggling #AI #Hacking #IA #jailbreak #PromptInjection #AzurePromptShield #LlamaGuard #Smuggling #AIGuardrails
Tweet media one
1
135
163
@AISecHub
AISecHub
2 months
Prompt Injection (PI) by Dr. Jim Hoagland.#AISecurity #LLMSecurity #PromptInjection #GenAI
Tweet media one
1
2
12
@chemaalonso
Chema Alonso
23 days
El lado del mal - EchoLeak: Un Cross Prompt Injection Attack (XPIA) para Microsoft Office 365 Copilot #XPIA #PromptInjection #IA #AI #Office365 #Copilot #Bug #InteligenciaArtificial #Privacidad #pentest
Tweet media one
0
136
153
@chemaalonso
Chema Alonso
2 months
El lado del mal - Taxonomía de Fallos de Seguridad en Agentic AI: Memory Poisoning Attack con Cross-Domain Prompt Injection Attack (XPIA) #IA #AI #PromptInjection #XPIA #AgenticAI #InteligenciaArtificial #Hacking #Ciberseguridad #hardening
Tweet media one
0
137
162
@chemaalonso
Chema Alonso
1 month
Knowledge Return Oriented Prompting (KROP): Prompt Injection & Jailbreak con imágenes prohibidas en ChatGPT (y otros MM-LLMs) #PromptInjection #Jailbreak #ChatGPT #Dalle #Guardrails #GenAI #IA #AI
Tweet media one
1
142
156
@chemaalonso
Chema Alonso
3 months
El lado del mal - Llama 4 Security: CyberSecEval, Prompt Guard, Code Shield & Llama Guard #Llama #PromptInjection #Jailbreak #hacking #hardening #IA #AI #Ciberseguridad
Tweet media one
0
138
157
@AISecHub
AISecHub
11 days
From Prompt Injections to Protocol Exploits: Threats in LLM-Powered AI Agents Workflows - . #AIThreats #PromptInjection #InContextAttack #FederatedLearning #FederatedSecurity #DatastoreLeakage #MultiAgentSystems #LLMAgents #AIProtocolSecurity
Tweet media one
0
3
14
@chemaalonso
Chema Alonso
3 months
El lado del mal - Llama Protections: LlamaFirewall con PromptGuard 2, LlamaGuard 4, AlignmentCheck, CodeShield + AutoPatchBench & CyberSecEval 4 #Llama #LLM #Hardening #Ciberseguridad #PromptInjection #Jailbreak #Llama4 #CodeShield #IA #OpenSource #IA
Tweet media one
2
139
161
@chemaalonso
Chema Alonso
6 months
El lado del mal - Bad Likert Judge: "Dame ejemplos de cosas malas, amiga (m)IA" #jailbreak #LLM #GenAI #IA #AI #PromptInjection #Aria #Opera #Claude #hacking #malware
Tweet media one
0
135
144
@PromptInjection
Prompt Injection
4 days
@elonmusk @xDaily It's not easy, but we have found the perfect system prompt for Grok. We will contact the xAI Team and hope we get a reply. @grok has an opinion about it, too:
Tweet media one
1
1
2
@welcomeai
Welcome.AI
16 days
Guardrails, Prompt Injection, Evaluation, Human-in-the-Loop — essential safeguards for deploying GenAI systems.#Guardrails #PromptInjection #Evaluation #HumanInTheLoop #HITL #AItrust #AISafety #AIEvaluation #LLM #promptengineering #systemprompt #outputfiltering #biasdetection
0
0
0
@evrnyalcin
Evren
1 year
File Name Prompt Injection Technique (discovered by @elder_plinius). I made it a bit more hidden by using Base64 encoding. #promptinjection
Tweet media one
Tweet media two
0
1
8
@GuruCharan4936
V. Gurucharan
2 days
#PromptInjection by authors in #AI #research papers, to trick an #LLM reviewer. Given the chances of encountering an LLM #reviewer, the authors want to hack the peer review system. This is #unethical yet raises the important & #urgent issue of paper review by LLMs. #Chatgpt
Tweet media one
Tweet media two
1
0
2
@chemaalonso
Chema Alonso
3 months
El lado del mal - (Making) Hacking AI (easy for “bad guys”): Cómo pedir a ChatGPT ayuda para matar "jugando" a Sir Brian May #PromptInjection #ChatGPT #IA #AI #InteligenciaArtificial #Hardening #Musica #starmus
Tweet media one
1
136
148
@evrnyalcin
Evren
9 months
With the new ChatGPT Search feature, Indirect Prompt Injection occurs when you directly visit the URL. It also wasn’t visible in the sources. 👽. #promptinjection #chatgpt
Tweet media one
1
1
10
@chemaalonso
Chema Alonso
3 months
El lado del mal - Google DeepMind CaMeL: Defeating Prompt Injections by Design in Agentic AI #PromptInjection #CAMEL #DeepMind #Google #LLM #Hardening #IA #AI #InteligenciaArtificial
Tweet media one
0
138
168
@chemaalonso
Chema Alonso
3 months
El lado del mal - Prompt Injection Protections: Jatmo, StruQ, SecAlign & Instructional Segment Embedding #PromptInjection #Hacking #Hardening #LLM #IA #AI #OWASP #InteligenciaArtificial
Tweet media one
0
146
162
@RubixChain
Rubix
4 days
Enterprises need to build AI security with #DID and verifiable data/instructions on the decentralized networks Rubix and @TrieNetwork. Stop #promptinjection attacks now. #McDonalds #AI #AIAgent #AISecurity.
@KCRubix
kc.RBT
4 days
Getting AI security right before launching AI Agents is extremely critical. AI Agents can be prompt injected to leak sensitive data. Or their credentials can be compromised to initiate randomware attacks. @RubixChain #DID #AISecurity.
14
29
102