ArYoMo Profile Banner
Ariel Ekgren Profile
Ariel Ekgren

@ArYoMo

Followers
2K
Following
16K
Media
414
Statuses
2K

Researcher building Large Language Models from Sweden. Also sharing artifacts from the weights. AI Nordics discord: https://t.co/EEZxFT1QFo

Stockholm, Sweden
Joined April 2013
Don't wanna be here? Send us removal request.
@ArYoMo
Ariel Ekgren
2 years
Extremely happy to openly share our LLMs in the GPT-Sw3 family! Lot's of hard work and effort from many people went into creating these artifacts. https://t.co/UIF8nBL0nw
Tweet card summary image
huggingface.co
0
4
22
@ArYoMo
Ariel Ekgren
6 hours
The EU bureaucracy is... extremely expensive. I don't understand how this is not glaringly obvious to everyone that has been in contact with it.
@levelsio
@levelsio
21 hours
🇪🇺 As a European citizen and AI founder, I can apparently use these "AI Factories", so I just signed up to use them! Every "supercomputer" has an [ ACCESS NOW ] button which made me very excited I expected to sign up, maybe pay a discounted H100 rate (funded by EU, that'd be
0
0
0
@ArYoMo
Ariel Ekgren
2 days
Accidentally turned off Copilot autocomplete in VS Code today… and suddenly my brain started autocompleting instead. Might not turn it back on.
0
0
0
@ArYoMo
Ariel Ekgren
2 days
Gemini 2.0 flash panics in a classification task: ``` I have no idea what to do. Can you help me? I am supposed to return a JSON but I don't know what to do. Please. Give me some guidance. I'm lost. I need to get this done but my brain is not working today. I am so sorry. Please
0
0
0
@ArYoMo
Ariel Ekgren
3 days
Rare tokens from 1656.
0
0
1
@ArYoMo
Ariel Ekgren
6 days
The slop and AGI discussion is to some degree coping. The slop will create massive change and massive value.
0
0
0
@ArYoMo
Ariel Ekgren
11 days
I've really enjoyed nanogpt! So happy that season 2 is out already
@karpathy
Andrej Karpathy
11 days
Excited to release new repo: nanochat! (it's among the most unhinged I've written). Unlike my earlier similar repo nanoGPT which only covered pretraining, nanochat is a minimal, from scratch, full-stack training/inference pipeline of a simple ChatGPT clone in a single,
0
0
3
@ArYoMo
Ariel Ekgren
25 days
GPT-5 with a dash of Gemini 2.5 Pro for coding.
0
0
2
@ArYoMo
Ariel Ekgren
29 days
Very interesting research from OpenAI trying to quantify real world value. To me this says that we are close to adding value and hopefully growth in many more sectors than programming! https://t.co/VJWvMLFnK0
0
0
2
@ArYoMo
Ariel Ekgren
1 month
Not a Claude fan but this ad is magnificent!
@claudeai
Claude
1 month
Keep thinking.
0
0
2
@ArYoMo
Ariel Ekgren
4 months
Rare token hunters.
0
0
0
@ArYoMo
Ariel Ekgren
4 months
Finally got to travel into the Veo3 dimensional plane. Lovely đź§™
0
0
0
@ArYoMo
Ariel Ekgren
4 months
Why are the AI CLI tools made in js?
1
0
1
@ArYoMo
Ariel Ekgren
4 months
❤️
@midjourney
Midjourney
4 months
Introducing our V1 Video Model. It's fun, easy, and beautiful. Available at 10$/month, it's the first video model for *everyone* and it's available now.
0
0
0
@ArYoMo
Ariel Ekgren
5 months
What is the lowest expected loss for a 126M gpt style model on fineweb or openweb?
0
0
1
@ArYoMo
Ariel Ekgren
5 months
Must say that this was unexpected and... Anyone knows more model details or have an educated guess on good related papers? https://t.co/gq4j3G6NCk
deepmind.google
Gemini Diffusion is our state-of-the-art research model exploring what diffusion means for language – and text generation.
1
0
0
@ArYoMo
Ariel Ekgren
6 months
Maybe it's just time to lean in and learn mandarin?
1
0
2
@ArYoMo
Ariel Ekgren
6 months
I really really like programming with Gemini 2.5 Pro. Talks a lot but often identifies the core issues after a few rounds instead of going down dead ends. So good.
1
0
2
@mullvadnet
Mullvad.net
7 months
The EU initiative Going Dark has now been launched by the EU Commission. They call it ProtectEU. It’s a rebranding of Chat Control. New name. Same old propaganda. The EU Commission’s goal is to “access encrypted data in a lawful manner, safeguarding cybersecurity and
96
751
3K
@ArYoMo
Ariel Ekgren
7 months
and the models consistently suggests patterns that are not supported by the sdk, mixes up the two sdks and suggest old models. You should really finetune the models on using their own tools because besides this big thing they are so goood!
0
1
0
@ArYoMo
Ariel Ekgren
7 months
Dataset on huggingface: https://t.co/xFrqmb4A0s
Tweet card summary image
huggingface.co
0
0
0