
SRI Lab
@the_sri_lab
Followers
762
Following
59
Media
55
Statuses
199
RT @j_dekoninck: Thrilled to share a major step forward for AI for mathematical proof generation! . We are releasing the Open Proof Corpus:โฆ.
0
21
0
RT @ni_jovanovic: There's a lot of work now on LLM watermarking. But can we extend this to transformers trained for autoregressive image geโฆ.
0
54
0
Check out this recent work from our lab showing that benign-looking LLM's can hide backdoors that activate upon finetuning!.
๐จ LLM finetuning can be a backdoor trigger! ๐จ.You finetune a model you downloaded, on data you picked. You should be fine, right? Well, it turns out with your finetuning you could unknowingly activate a backdoor hidden in the downloaded model. How is this possible? ๐งต๐
0
0
4
@iclr_conf @rstaabr @mark_veroe @mbalunovic @mvechev @ni_jovanovic @baader_max @j_dekoninck @tibglo GRAIN: Exact Graph Reconstruction from Gradients.@drencheva79130 @IvoPetrov01 @baader_max @dimitrov_dimy @mvechev.๐ Sat 26th, 10:00AM - 12:30PM, #493.๐ The first gradient leakage attack for GNNs that achieves a high fraction of exact reconstructions.
0
0
2
@iclr_conf @rstaabr @mark_veroe @mbalunovic @mvechev @ni_jovanovic @baader_max @j_dekoninck Black-Box Detection of Language Model Watermarks.@tibglo @ni_jovanovic @rstaabr @mvechev.๐Sat 26th, 3PM-5.30PM, #480.๐ We design detection tests that can detect the presence of a watermark behind an API!.
1
0
2
@iclr_conf @rstaabr @mark_veroe @mbalunovic @mvechev @ni_jovanovic @baader_max Polyrating: A Cost-Effective and Bias-Aware Rating System for LLM Evaluation.@j_dekoninck @baader_max @mvechev.๐Fri 25th, 3-5.30PM, #247.๐ A system designed to fit accurate LLM ratings, detecting judge biases and incorporating cheap existing data.
1
0
1
@iclr_conf @rstaabr @mark_veroe @mbalunovic @mvechev Ward: Provable RAG Dataset Inference via LLM Watermarks.@ni_jovanovic @rstaabr @baader_max @mvechev.๐Thu 24th, 10:00AM-12:30PM, #513.๐ We show how LLM watermarks can be used to detect the unauthorized use of data in RAG.
1
0
1
@iclr_conf Language Models are Advanced Anonymizers.@rstaabr @mark_veroe @mbalunovic @mvechev.๐Thu 24th, 10:00AM-12:30PM, #550.๐ We show that LLMs can anonymize real-world texts providing both higher utility and better privacy than existing methods.
1
0
1
SRI Lab is proud to present 5 of our works on AI Security and Privacy at @iclr_conf main conference. Looking forward to seeing you in Singapore! Open for more โฌ๏ธ.
1
4
6
RT @mbalunovic: Big update to our MathArena USAMO evaluation: Gemini 2.5 Pro, which was released *the same day* as our benchmark, is the fiโฆ.
0
146
0
RT @mbalunovic: Can LLMs actually solve hard math problems? Given the strong performance at AIME, we now go to the next tier: our MathArenaโฆ.
0
85
0
Amazing work by @mark_veroe @nielstron @tch1bo @vesuraychev Max Baader @ni_jovanovic @jingxuan_he @mvechev, and an amazing collaboration with @logic_star_ai @UCBerkeley @INSAITinstitute!.
0
0
3