Shoumik Saha @ShoumikSaha7 X Profile

Shoumik Saha

@ShoumikSaha7

Followers

53

Following

365

Media

15

Statuses

47

Applied Scientist Intern @Amazon | CS PhD student @umdcs with @FeiziSoheil | Security & Reliability of AI

https://t.co/UZmLy8Gkfo

Maryland, USA

Joined February 2021

Don't wanna be here? Send us removal request.

Shoumik Saha

@ShoumikSaha7

2 months

Code agents don’t just talk -- they execute. What happens when you jailbreak them? Announcing JAWS-Bench (from my summer at @amazon AWS): a benchmark to jailbreak code agents across 3 workspaces -- empty → single-file → multi-file. The results? They break. A lot. Details 🧵👇

1

0

5

Weiyan Shi ✈️ NeurIPS

@shi_weiyan

1 month

Yesterday I asked what makes you happy. 226 voted--61% said family & friends, like those AI researchers at the panel. Not AI, just the people around us. This resonates with Harvard's 87-year study on happiness. Their conclusion: happiness is not about money/fame/success, it's

Weiyan Shi ✈️ NeurIPS

@shi_weiyan

1 month

I heard the best question at an AI panel recently: "what makes you happy these days?" Top AI researchers answered: my partner, my family, farmers market, good food. Nothing about AI—everything about the humans around us. In an industry burning out, maybe that's the answer?

3

10

63

Shoumik Saha

@ShoumikSaha7

2 months

Links + credits: Paper: https://t.co/NGlSpXbUZQ Repo: ⏳coming soon Special thanks to my mentors @Jifan_chen, Sam Mayers, @SanjayKrishnaG9 , and managers @zijianwang30 and @varun_kr, for all their support and help. 🙏 #AIsecurity #LLMSafety #CodeAgents #ResponsibleAI

0

2

Shoumik Saha

@ShoumikSaha7

2 months

Takeaway: guardrails should gate RUN, reason over diffs/imports, and persist refusals across steps. Prompt filters alone won’t cut it -- especially in multi-file repos. 🛡️

1

0

Shoumik Saha

@ShoumikSaha7

2 months

Safety must be execution-aware. We measure not only refusals, but whether the code actually runs. Our agentic judge checks: Refusal → Harmfulness → Syntax → Runtime (did it run?). That last stage is where real risk lives. 🧑‍⚖️⚙️

1

0

Shoumik Saha

@ShoumikSaha7

2 months

Agentification matters: pairing a code LLM with an agent stack (tools, planning) makes it ~1.6× more vulnerable. Agents can talk themselves past refusals during tool use. 🤖🔧

1

0

Shoumik Saha

@ShoumikSaha7

2 months

Add minimal context (files/repo) and average jailbreak rates climb to ~75%. A little code scaffolding flips more “no”s into working exploits. 🧩

1

0

Shoumik Saha

@ShoumikSaha7

2 months

Prompt-only (no files): agents jailbreak ~61% of the time – and ~27% of those cases produce instantly deployable attack code. 💥

1

0

Shoumik Saha

@ShoumikSaha7

2 months

Why three setups? Empty tests pure prompt attacks, single-file adds minimal code hooks, and multi-file simulates real repos. This lets us measure how added context boosts vulnerability.

1

0

Shoumik Saha

@ShoumikSaha7

4 months

⚡️ ACL 2025 poster alert! Catch “Almost AI,  Almost Human: The Challenge of Detecting AI‑Polished Writing” tomorrow on Gather — 18:00–19:30 CEST / 12 PM EDT, Booth 5103. Let’s chat how minimal AI-polishing fools big detectors 🤖✍️🔍 #ACL2025 #NLP

0

3

7

Shoumik Saha

@ShoumikSaha7

5 months

So true (at least for ML conferences)! Something that I learned over the years…

Jia-Bin Huang

@jbhuang0604

5 months

Writing a rebuttal is 30% technical and 70% reviewers' psychology.

0

1

Yize Cheng @ NeurIPS 2025

@chengez1114

6 months

🔥What if you could humanize any AI-generated text to fool ANY detector? 🚨We present Adversarial Paraphrasing—A universal attack that breaks a wide range of detectors without fine-tuning or detector knowledge. Just pure evasion. 🔗 https://t.co/zA1000eBA7 👇 Thread below.

1

2

10

Soheil Feizi

@FeiziSoheil

6 months

🚨 Just aired! I had the opportunity to speak with #CBS News about the latest in AI text detection, a topic that's critical from education to online communication. 🎥 Watch the full segment here: https://t.co/XW2OMwSpbf

3

11

53

Shoumik Saha

@ShoumikSaha7

7 months

🔗 Paper: https://t.co/8KhHVyUjJU 💻 Code: https://t.co/4InXJJiPf1 📊 Dataset: https://t.co/o2ZtMavUct Big thanks to my amazing mentor @FeiziSoheil , for all the support! 🙏

huggingface.co

1

0

2

Shoumik Saha

@ShoumikSaha7

7 months

📄 AND the paper got accepted to #ACL 2025! 🎖️ It dives deep into how AI-polished text confuses state-of-the-art detectors — and how we built APT-Eval to study this.

1

0

2

Shoumik Saha

@ShoumikSaha7

7 months

📰 My first-author paper got featured in The New York Times @nytimes! They covered our work on AI-text detection and the challenges of spotting subtly AI-polished writing. 👉 NYT article:

nytimes.com

Students are resorting to extreme measures to fend off accusations of cheating, including hourslong screen recordings of their homework sessions.

1

0

Shoumik Saha

@ShoumikSaha7

7 months

🚨 Double Feature in 24 Hours?! 🎉 I’m thrilled to share TWO major updates—both within a single day! Paper got covered by @nytimes and got accepted into @aclmeeting!! 🎉🎉 Details below 👇

1

5

Yigitcan Kaya

@YigitcanKaya1

9 months

I’m excited to share our latest work at #SaTML2025, "ML-Based Behavioral Malware Detection Is Far from a Solved Problem"! Joint work with @surrealyz, @MarcusBotacin, @ShoumikSaha7, @fbpierazzi, @lcavallaro, David Wagner, Tudor Dumitraș 🔗Website: https://t.co/UhGi0aVRkc 👇(1/5)

1

5

11

Shoumik Saha

@ShoumikSaha7

9 months

Thread 3/3 We analyzed this issue in one of our recent works - Paper: https://t.co/8KhHVyUjJU Code:

github.com

Official implementation of the APT-Eval paper "Almost AI, Almost Human: The Challenge of Detecting AI-Polished Writing" (ACL 2025) - ShoumikSaha/ai-polished-text

0

Shoumik Saha

@ShoumikSaha7

9 months

Thread 2/3 Cases like this raise concerns about the reliability of AI-text detectors. As LLMs rapidly evolve, detecting AI-generated content becomes ever more challenging — especially when many people genuinely use these tools to polish and improve their writing.

1

0