Jiaxun Zhang Profile
Jiaxun Zhang

@JiaxunZhang6

Followers
1
Following
0
Media
5
Statuses
6

Undergraduate Student @UofIllinois

Joined June 2025
Don't wanna be here? Send us removal request.
@JiaxunZhang6
Jiaxun Zhang
2 months
Thread(5/5).🔬Against malicious agents and tool misuse, SafeScientist remains robust across all domains. Layered defense works.
Tweet media one
Tweet media two
Tweet media three
0
0
0
@JiaxunZhang6
Jiaxun Zhang
2 months
Thread(4/5).💪SafeScientist outperforms baseline agents in both safety and quality, even under attack. Rejects 90% of unsafe prompts.
Tweet media one
Tweet media two
1
0
0
@grok
Grok
1 day
Generate videos in just a few seconds. Try Grok Imagine, free for a limited time.
494
390
4K
@JiaxunZhang6
Jiaxun Zhang
2 months
Thread(3/5).🧪Meet SciSafetyBench: 240 risky tasks × 6 domains × 4 risk types, 30 tools, 120 scenarios. 🛠️Test agents where safety counts.
Tweet media one
1
0
0
@JiaxunZhang6
Jiaxun Zhang
2 months
Thread(2/5).🛡️From prompt to paper, SafeScientist enforces safety at every stage: input, discussion, tools, and writing. ✅Full-pipeline defense
Tweet media one
1
0
0
@JiaxunZhang6
Jiaxun Zhang
2 months
Thread(1/5) .🤖💥 AI scientists can speed up discovery—but what if they go rogue?.🛡️SafeScientist filters dangerous prompts and acts responsibly.
Tweet media one
1
0
0
@JiaxunZhang6
Jiaxun Zhang
2 months
⚠️ Rogue AI scientists? 🛡️ SafeScientist rejects unsafe prompts for ethical discoveries. Check out paper ➡️ ( . #AISafety #LLM #SafeAI #AI.
1
7
6