Carsten Kragelund
@nyxkrage
Followers
376
Following
1K
Media
54
Statuses
972
Always esoteric, always clever. Rock Whisper (Software Engineer). ❄️ Nix Devotee, DevOps is a sham by big tech 🐫 OCaml Evangelism Strike Force
Denmark
Joined October 2020
Opening emergency commissions! No theme/character restrictions, DM me if you're interested. Also open for short-term/long term gigs! Portfolio in the comments ⬇️ Sharing the post will be much appreciated, thank you 🙏
5
42
198
new blogpost after a long time! in this series i will talk about how to solve reinforcement learning for long-horizon tasks, incrementally from the most straightforward approaches. (link in replies!) in part I of this series, we throw RL at the cube in its most direct,
21
40
337
you can just create environments
creating an environment with verifiers is as simple as writing a load_environment function and filling out a pyproject.toml, both of which are initialized for you when you do `prime env init` environments are packages, and can be used with prime-rl, skyrl, tinker, and more :)
1
7
65
This is such a great idea that I implemented an ocaml port on my way back from Singapore It will be handy internally at ahrefs for all LLM structured outputs! https://t.co/TpGtVxpQ3S
JSON is token‑expensive for LLMs – just like @mattpocockuk frequently mentions. Meet TOON, the Token‑Oriented Object Notation. 💸 40–60% fewer tokens than JSON 📐 readable & tokenizer-aware Wrap your JSON with `encode` to save half the token cost: https://t.co/UoG9yHmgfg
4
4
26
It's only fine-tuning if it comes from the low-rank region of the parameter space. Otherwise it's just spicy retraining.
0
4
28
Nine years ago today, I interrupted the official RuneScape livestream with a profanity I regret screaming in the innocent ears of thousands.
73
123
2K
New blog post! This one is a purely theoretical one attempting identifying the central reason why LLMs suffer from mode collapse in RL and fail to generate novel or truly diverse outputs. It's actually a way more complicated problem than you think! Naively encouraging
30
74
674
in another life i would love to get stuck in an elevator with you
0
5
48
Last week, I covered 12 amazing, ✨new✨ games made for the #ZXSpectrum , a personal computer from the 80s. This week, I've got 12 MORE for you that might be even better.
1
3
10
The about:config flags to do this is 1. https://t.co/AutXm4yUQW.enabled to true 2. https://t.co/AutXm4yUQW.provider to any chat site that supports the ?q url parameter to start a new chat with a specified message 3. Then to use custom prompts/shortcuts set any of the
Fun fact: You can use Firefox and hack in custom prompts to use w/ alongside the sidebar menu where you can hook up Open webUI and local LLM models. Don't give into the OAI-goonery.
0
1
2
@Grad62304977 guy who is 19 years old: "I never really got 10+ year timelines"
20
8
276