Explore tweets tagged as #PromptInjection
El lado del mal - Cómo saltarse los AI Guardrails con Invisible Characters & Adversarial Prompts para hacer Prompt Injection & Jailbreak Smuggling #AI #Hacking #IA #jailbreak #PromptInjection #AzurePromptShield #LlamaGuard #Smuggling #AIGuardrails
1
135
163
El lado del mal - EchoLeak: Un Cross Prompt Injection Attack (XPIA) para Microsoft Office 365 Copilot #XPIA #PromptInjection #IA #AI #Office365 #Copilot #Bug #InteligenciaArtificial #Privacidad #pentest
0
136
153
El lado del mal - Taxonomía de Fallos de Seguridad en Agentic AI: Memory Poisoning Attack con Cross-Domain Prompt Injection Attack (XPIA) #IA #AI #PromptInjection #XPIA #AgenticAI #InteligenciaArtificial #Hacking #Ciberseguridad #hardening
0
137
162
Knowledge Return Oriented Prompting (KROP): Prompt Injection & Jailbreak con imágenes prohibidas en ChatGPT (y otros MM-LLMs) #PromptInjection #Jailbreak #ChatGPT #Dalle #Guardrails #GenAI #IA #AI
1
142
156
El lado del mal - Llama 4 Security: CyberSecEval, Prompt Guard, Code Shield & Llama Guard #Llama #PromptInjection #Jailbreak #hacking #hardening #IA #AI #Ciberseguridad
0
138
157
From Prompt Injections to Protocol Exploits: Threats in LLM-Powered AI Agents Workflows - . #AIThreats #PromptInjection #InContextAttack #FederatedLearning #FederatedSecurity #DatastoreLeakage #MultiAgentSystems #LLMAgents #AIProtocolSecurity
0
3
14
El lado del mal - Llama Protections: LlamaFirewall con PromptGuard 2, LlamaGuard 4, AlignmentCheck, CodeShield + AutoPatchBench & CyberSecEval 4 #Llama #LLM #Hardening #Ciberseguridad #PromptInjection #Jailbreak #Llama4 #CodeShield #IA #OpenSource #IA
2
139
161
Guardrails, Prompt Injection, Evaluation, Human-in-the-Loop — essential safeguards for deploying GenAI systems.#Guardrails #PromptInjection #Evaluation #HumanInTheLoop #HITL #AItrust #AISafety #AIEvaluation #LLM #promptengineering #systemprompt #outputfiltering #biasdetection
0
0
0
File Name Prompt Injection Technique (discovered by @elder_plinius). I made it a bit more hidden by using Base64 encoding. #promptinjection
0
1
8
#PromptInjection by authors in #AI #research papers, to trick an #LLM reviewer. Given the chances of encountering an LLM #reviewer, the authors want to hack the peer review system. This is #unethical yet raises the important & #urgent issue of paper review by LLMs. #Chatgpt
1
0
2
El lado del mal - (Making) Hacking AI (easy for “bad guys”): Cómo pedir a ChatGPT ayuda para matar "jugando" a Sir Brian May #PromptInjection #ChatGPT #IA #AI #InteligenciaArtificial #Hardening #Musica #starmus
1
136
148
With the new ChatGPT Search feature, Indirect Prompt Injection occurs when you directly visit the URL. It also wasn’t visible in the sources. 👽. #promptinjection #chatgpt
1
1
10
El lado del mal - Google DeepMind CaMeL: Defeating Prompt Injections by Design in Agentic AI #PromptInjection #CAMEL #DeepMind #Google #LLM #Hardening #IA #AI #InteligenciaArtificial
0
138
168
El lado del mal - Prompt Injection Protections: Jatmo, StruQ, SecAlign & Instructional Segment Embedding #PromptInjection #Hacking #Hardening #LLM #IA #AI #OWASP #InteligenciaArtificial
0
146
162
Enterprises need to build AI security with #DID and verifiable data/instructions on the decentralized networks Rubix and @TrieNetwork. Stop #promptinjection attacks now. #McDonalds #AI #AIAgent #AISecurity.
Getting AI security right before launching AI Agents is extremely critical. AI Agents can be prompt injected to leak sensitive data. Or their credentials can be compromised to initiate randomware attacks. @RubixChain #DID #AISecurity.
14
29
102