Kyle Avery
@kyleavery
Followers
4K
Following
4K
Media
131
Statuses
1K
one more, this time for OverTheWire:
app.primeintellect.ai
Verifier for OverTheWire challenges
i just posted another verifier to the Environments Hub, this time for @picoctf
https://t.co/J76KlNaj9z
0
0
7
@ChrisMurphyCT You're being played by people who want regulatory capture. They are scaring everyone with dubious studies so that open source models are regulated out of existence.
251
917
9K
i just posted another verifier to the Environments Hub, this time for @picoctf
https://t.co/J76KlNaj9z
app.primeintellect.ai
Verifier for PicoCTF challenges
2
1
23
i ended up changing the original benchmark code a bit: - no more vision option, PDF reports are always presented as text - PDF reports were converted to markdown using OCR + manual review - Hybrid Analysis reports are now indented JSON instead of a big blob
1
0
3
this was just a small project to make the original work from CrowdStrike and Meta more accessible https://t.co/8iWU5kbRmy
1
0
3
i decided to make a nicer dataset to use with my Verifiers implementation of both CyberSOCEval benchmarks Dataset: https://t.co/jX2Hc6kGNU Verifier:
app.primeintellect.ai
Verifiers implementation of CyberSOCEval
why does cybersoceval download PDF from wayback or the source and convert them to text/images for every user đź’€
2
3
19
1
0
19
why does cybersoceval download PDF from wayback or the source and convert them to text/images for every user đź’€
0
0
1
Please make sure you are only drinking as much water as you REALLY need. We need that for the datacenters. If you’re thirsty, grok is thirsty too.
111
6K
67K
if you are building a framework for research/experimentation, i think it’s important to avoid too much abstraction. even if you want to have fancy classes, i need easy visibility into the underlying prompts, tool calls, batching, etc. i’ve been burned too many times to trust your
1
1
15
curious about RL? learn to train an LLM with me in a couple weeks!
Go beyond superficial usage of #LLMs and #AI in this free training from Cobalt Strike and Outflank experts. Gain practical experience to architect AI-powered attack chains and navigate AI assisted adversary simulation. Register now! https://t.co/sEkgkbungA
0
2
27
Been a long time since I've written something for my blog. Recently got inspired to break down how a very basic evasion attack on a machine learning model might work. Check it out https://t.co/JOnvSPztev
steve-s.gitbook.io
An example evasion attack against (probably) the worst machine learning classifier of all time
3
37
126