kyleavery Profile Banner
Kyle Avery Profile
Kyle Avery

@kyleavery

Followers
4K
Following
4K
Media
131
Statuses
1K

Texas, USA
Joined July 2017
Don't wanna be here? Send us removal request.
@kyleavery
Kyle Avery
4 days
thanks given
3
0
8
@kyleavery
Kyle Avery
8 days
one more, this time for OverTheWire:
Tweet card summary image
app.primeintellect.ai
Verifier for OverTheWire challenges
@kyleavery
Kyle Avery
29 days
i just posted another verifier to the Environments Hub, this time for @picoctf https://t.co/J76KlNaj9z
0
0
7
@ylecun
Yann LeCun
17 days
@ChrisMurphyCT You're being played by people who want regulatory capture. They are scaring everyone with dubious studies so that open source models are regulated out of existence.
251
917
9K
@kyleavery
Kyle Avery
21 days
2
0
11
@kyleavery
Kyle Avery
29 days
i just posted another verifier to the Environments Hub, this time for @picoctf https://t.co/J76KlNaj9z
Tweet card summary image
app.primeintellect.ai
Verifier for PicoCTF challenges
2
1
23
@kyleavery
Kyle Avery
1 month
i ended up changing the original benchmark code a bit: - no more vision option, PDF reports are always presented as text - PDF reports were converted to markdown using OCR + manual review - Hybrid Analysis reports are now indented JSON instead of a big blob
1
0
3
@kyleavery
Kyle Avery
1 month
this was just a small project to make the original work from CrowdStrike and Meta more accessible https://t.co/8iWU5kbRmy
1
0
3
@kyleavery
Kyle Avery
1 month
i decided to make a nicer dataset to use with my Verifiers implementation of both CyberSOCEval benchmarks Dataset: https://t.co/jX2Hc6kGNU Verifier:
Tweet card summary image
app.primeintellect.ai
Verifiers implementation of CyberSOCEval
@kyleavery
Kyle Avery
1 month
why does cybersoceval download PDF from wayback or the source and convert them to text/images for every user đź’€
2
3
19
@kyleavery
Kyle Avery
1 month
@thinkymachines
Thinking Machines
1 month
Our latest post explores on-policy distillation, a training approach that unites the error-correcting relevance of RL with the reward density of SFT. When training it for math reasoning and as an internal chat assistant, we find that on-policy distillation can outperform other
2
0
13
@kyleavery
Kyle Avery
1 month
@SeedOilDsrspctr
Seed Oil Disrespecter™️
1 month
Send this to her and say “Us”
1
0
19
@kyleavery
Kyle Avery
1 month
why does cybersoceval download PDF from wayback or the source and convert them to text/images for every user đź’€
0
0
1
@kyleavery
Kyle Avery
1 month
7
2
113
@kyleavery
Kyle Avery
2 months
3
2
23
@kyleavery
Kyle Avery
2 months
1
0
8
@allgarbled
gabe
2 months
Please make sure you are only drinking as much water as you REALLY need. We need that for the datacenters. If you’re thirsty, grok is thirsty too.
111
6K
67K
@kyleavery
Kyle Avery
2 months
if you are building a framework for research/experimentation, i think it’s important to avoid too much abstraction. even if you want to have fancy classes, i need easy visibility into the underlying prompts, tool calls, batching, etc. i’ve been burned too many times to trust your
1
1
15
@kyleavery
Kyle Avery
2 months
curious about RL? learn to train an LLM with me in a couple weeks!
@_CobaltStrike
Cobalt Strike
2 months
Go beyond superficial usage of #LLMs and #AI in this free training from Cobalt Strike and Outflank experts. Gain practical experience to architect AI-powered attack chains and navigate AI assisted adversary simulation. Register now! https://t.co/sEkgkbungA
0
2
27
@kyleavery
Kyle Avery
2 months
I am also enjoying the rooftop pool before @OffensiveAIcon
@d_tranman
Dylan Tran
2 months
I am also enjoying the rooftop pool before @OffensiveAIcon
2
2
26
@kyleavery
Kyle Avery
2 months
flying out to @OffensiveAIcon swim trunks ready
0
0
14
@0xTriboulet
Steve S.
2 months
Been a long time since I've written something for my blog. Recently got inspired to break down how a very basic evasion attack on a machine learning model might work. Check it out https://t.co/JOnvSPztev
Tweet card summary image
steve-s.gitbook.io
An example evasion attack against (probably) the worst machine learning classifier of all time
3
37
126