uiuc_aisecure Profile Banner
Bo Li Profile
Bo Li

@uiuc_aisecure

Followers
2K
Following
349
Media
4
Statuses
203

Virtue AI, UIUC @VirtueAI_co

San Francisco, US
Joined May 2020
Don't wanna be here? Send us removal request.
@togethercompute
Together AI
4 months
๐Ÿ›ก๏ธ VirtueGuard is LIVE on Together AI ๐Ÿš€ AI security and safety model that screens input and output for harmful content: โšก Under 10ms ๐—ฟ๐—ฒ๐˜€๐—ฝ๐—ผ๐—ป๐˜€๐—ฒ ๐ŸŽฏ ๐Ÿด๐Ÿต% ๐—ฎ๐—ฐ๐—ฐ๐˜‚๐—ฟ๐—ฎ๐—ฐ๐˜† vs 76% (AWS Bedrock) ๐Ÿง  ๐—–๐—ผ๐—ป๐˜๐—ฒ๐˜…๐˜-๐—ฎ๐˜„๐—ฎ๐—ฟ๐—ฒ - adapts to your policies, not just keywords ๐Ÿ‘‡
4
5
23
@uiuc_aisecure
Bo Li
4 months
Safety & security definitions are domain-specific in most cases -- We provide the first domain-specific, and policy-grounded guardrail benchmark! Exciting to enter the stage of nuanced guardrail protection for foundation models and AI applications!
@MintongKang
Mintong Kang
4 months
๐Ÿšจ GUARDSET-X: The First multi-domain, policy-grounded LLM security guardrail dataset! ๐Ÿ“š 150+ safety policies, 1000+ rules, 400+ risk categories ๐ŸŒ 8 domains ๐Ÿค– Auto data generation ๐Ÿงช Detoxified + adversarial prompts ๐Ÿ›ก๏ธ 19 guardrail models ๐Ÿ“„
0
1
13
@VirtueAI_co
Virtue AI
5 months
Autonomous AI agents are rapidly being deployed across industries, from web browsing copilots to code-writing assistants and enterprise workflow agents. But these systems come with a new class of security risks that traditional guardrails and red teaming arenโ€™t equipped to
1
4
15
@uiuc_aisecure
Bo Li
6 months
Very timely Guard Agent to ensure access control for general agents in different domains!
@ZhenXia98294421
Zhen Xiang
6 months
AI agents can be easily hacked to leak user data โ€” recent work calls for stronger access controls. Our ICML25 paper presents GuardAgent, which successfully enforces strong access control on other protected agents via dynamic, code enforced guardrails.๐ŸŒ https://t.co/cz8BAlEn0l
0
1
8
@VirtueAI_co
Virtue AI
6 months
Congrats to our partners at @glean on their first-ever hashtag #GleanGO conference! ๐ŸŽ‰ Weโ€™re honored to be part of their security and governance ecosystem, helping power trusted AI across the enterprise.
1
3
7
@VirtueAI_co
Virtue AI
6 months
๐Ÿšจ Introducing VirtueGuard Code: Real-time vulnerability detection for AI-generated code. As coding assistants like @cursor_ai and @GitHubCopilot become standard in development workflows, itโ€™s critical to ensure that generated code meets security standards. VirtueGuard Code is
1
5
12
@ZRChen_AISafety
Zhaorun Chen
6 months
VirtueAgent provides the first systematic guardrails for general AI agents!! Super exciting work such that we can rest assured and let our agents handle things for us!๐Ÿ‘
@VirtueAI_co
Virtue AI
6 months
๐Ÿš€ Introducing VirtueAgent, the first security layer for the agentic era. As AI agents begin to act autonomously in real-world environments, such as personal assistants, finance, healthcare, ensuring they operate securely and compliant is critical. VirtueAgent provides
0
2
5
@VirtueAI_co
Virtue AI
6 months
๐Ÿšจ 3 days out from our live webinar on the EU AI Act hosted by Sanmi Koyejo and Jan EiรŸfeldt! Register now: https://t.co/rRoFqc4NvE โฌ‡๏ธ Details below
@VirtueAI_co
Virtue AI
6 months
Join Virtue AI Co-founder @sanmikoyejo and Jan EiรŸfeldt, Global Head of Trust & Safety at Wikimedia Foundation, for a live discussion on what the EU AI Act means for enterprises and how they can stay compliant without slowing down innovation. ๐Ÿ“… May 15 | ๐Ÿ•™ 10 AM PT | ๐Ÿ’ป Virtual
0
2
4
@ZRChen_AISafety
Zhaorun Chen
7 months
๐Ÿ“ทCome and check our #ICLR2025 poster ๐—ฆ๐—ฎ๐—ณ๐—ฒ๐—ช๐—ฎ๐˜๐—ฐ๐—ต!!๐Ÿ”ฅ Today April 25th 3:00 pm -5:30 pm ๐Ÿ“ท at Poster Session Hall 3 #547๐Ÿ“ท๐Ÿ“ท
@ZRChen_AISafety
Zhaorun Chen
11 months
๐Ÿš€ Introducing ๐’๐š๐Ÿ๐ž๐–๐š๐ญ๐œ๐ก! ๐Ÿš€ While generative models ๐Ÿ‘พ๐ŸŽฅ like Sora and Veo 2 have shown us some stunning videos recently, they also make it easier to produce harmful content (sexual๐Ÿ”ž, violent๐Ÿ™…โ€โ™‚๏ธ, deepfakes๐ŸงŸโ€โ™‚๏ธ). ๐Ÿ”ฅ ๐’๐š๐Ÿ๐ž๐–๐š๐ญ๐œ๐ก is here to help ๐Ÿ˜Ž: the first
0
3
7
@guestrin
Carlos Guestrin
7 months
We are super excited to empower developers to focus on their goal of building innovative AI applications; weโ€™ll take care of safety and security! What an awesome ride with Bo Li @uiuc_aisecure, @sanmikoyejo, @dawnsongtweets and the whole @VirtueAI_co team!
@VirtueAI_co
Virtue AI
7 months
Weโ€™ve raised $30M in Seed + Series A funding led by @lightspeedvp and Walden Catalyst Ventures, with participation from Prosperity7 Ventures, Factory, Osage University Partners (OUP), Lip-Bu Tan, Chris Re, and more. Virtue AI is the first unified platform for securing AI across
0
3
17
@VirtueAI_co
Virtue AI
7 months
Weโ€™ve raised $30M in Seed + Series A funding led by @lightspeedvp and Walden Catalyst Ventures, with participation from Prosperity7 Ventures, Factory, Osage University Partners (OUP), Lip-Bu Tan, Chris Re, and more. Virtue AI is the first unified platform for securing AI across
3
12
47
@VirtueAI_co
Virtue AI
7 months
Join Virtue AI Co-founder Sanmi Koyejo for a live webinar on why protecting your AI apps isnโ€™t just about safetyโ€”itโ€™s the key to faster deployment and growth. ๐Ÿ“… April 24 | ๐Ÿ•™ 10 AM PT | ๐Ÿ’ป Virtual In this session, weโ€™ll cover: โœ… Why traditional security tooling falls short
0
6
16
@uiuc_aisecure
Bo Li
7 months
Exciting!! Huge congratulations to @xuchejian @_weiping and the great NVIDIA team!! Looking forward to the next exciting milestone!!
@xuchejian
Chejian Xu
7 months
Excited to see UltraLong-8B out! ๐ŸŽ‰ We extended Llama3.1 to 1Mโ€“4M context lengths with just 1B tokens of continued pretraining and recovered short-context performance with only 100K SFT samples. Huge thanks to @_weiping and the NVIDIA team for an amazing internship experience!
0
4
20
@VirtueAI_co
Virtue AI
7 months
Virtue AI is honored to be recognized by Intel's new CEO, Lip-Bu Tan, during his opening keynote at #IntelVision. Wishing him great success as he leads @intel into its next exciting chapter. AI security & safety have become the critical last mile for AI applications. At Virtue
1
3
19
@VirtueAI_co
Virtue AI
9 months
Virtue AI just released our red-teaming analysis of @OpenAIโ€™s GPT-4.5 in comparison with @Anthropicโ€™s Claude 3.7! We tested them on safety, security, hallucination, regulatory compliance, codeGen vulnerabilities, and more. Hereโ€™s what we found... (1/9)
1
7
12
@HowieH36226
Yue Huang
9 months
Toward Trustworthy Generative Foundation Models (GenFMs) ๐Ÿš€ ๐ŸŽ‡After six months of hard work and thanks to the efforts of the entire team, our report on the trustworthiness of generative foundation models (GenFMs) has finally been released. ๐Ÿ’กIn this work, we: -Developed a
2
33
98
@VirtueAI_co
Virtue AI
9 months
Can Reasoning Improve Safety & Security? Red-Teaming Analysis for Claude 3.7 ๐Ÿš€ Claude 3.7 Sonnet Thinking: A New Era of Hybrid Reasoning? Anthropic's latest release introduces a Thinking mode, letting users switch between rapid responses and step-by-step reasoning. But does
0
8
19
@VirtueAI_co
Virtue AI
9 months
Weโ€™re partnering with @glean to adapt Virtue AIโ€™s pioneering research in AI security, including content moderation, guardrails, and red teaming to Gleanโ€™s enterprise customers. Find out more: https://t.co/K5lVz4ADWl
0
5
12
@Yihe__Deng
Yihe Deng
9 months
New paper & model release! Excited to introduce DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails, showcasing our new DuoGuard-0.5B model. - Model: https://t.co/6leoz9d56S - Paper: https://t.co/571OSZIU1r - GitHub: https://t.co/8VJvKadopX Grounded in a
2
31
135
@VirtueAI_co
Virtue AI
9 months
AI Safety Comparison: OpenAI o3-mini vs. Deepseek-R1 VirtueAI conducted an in-depth red-teaming evaluation of two leading AI models to assess their safety, bias, privacy protections, and robustness. Key findings: 1. o3-mini demonstrates stronger privacy safeguards and fairness
1
5
17