Juanako.AI Profile
Juanako.AI

@fblgit

Followers
158
Following
165
Media
19
Statuses
429

https://t.co/WhEt0NQl0y - Uniform Neural Alignment (UNA) - Top Performance AI Lab

Singapore
Joined November 2021
Don't wanna be here? Send us removal request.
@fblgit
Juanako.AI
11 months
We trained miniclaus 1.5B with GRPO on GSM8k.. Increased GSM8k, GPQA, MMLU. Available in HF hub with GRPO code and reproduction details: https://t.co/TZYsA4VPVB
Tweet card summary image
huggingface.co
0
0
1
@fblgit
Juanako.AI
17 days
u know gpt5.2 or a new model is imminent.. gpt5.1 max-codex-ultra-mega became homer simpson, just as usual.. the model coming out aint noticeable better, otherwise why would u nerf your previous models?... ah?
0
0
0
@fblgit
Juanako.AI
21 days
1
0
3
@fblgit
Juanako.AI
1 month
GPT5.1 is always better than GPT5.0, and we always make sure of it. GPT5.1 indeed is better than GPT5.0 HomerSimpson Edition. Is GPT5.1 better than GPT5.0 NonHomerSimpson (1 week-ish ago)? Questionable.
0
0
0
@fblgit
Juanako.AI
2 months
the best way to sell AImodel 7.2 is by degrading AImodel 7.1 . i guess contaminating $self with fp32 answers.. is an effective way to maintain fp32 official scores.. but how to fix the fact that users feels talking to homer simpson: faith that lisa is coming out soon.
0
0
0
@fblgit
Juanako.AI
2 months
the hidden gem.. model selection aside, the best coding cli.. Amazon Q, lovely features
0
0
0
@fblgit
Juanako.AI
2 months
glm4.6 .. "count your mcp tools" 57, in reality 60.. half hour later back&forth... stills failing, cant count a simple list that is present in his context.. even when told him 60vs57.. he lies with fake sums 5+5=60 and stuff like that.. just like claude.. trashtalkers
0
0
0
@fblgit
Juanako.AI
3 months
Tired of wasting tokens on agents doing Input<>Output of tools? introducing Agentools for pydantic-ai, programatic approach for A2A. We added some freebies, like a UI to design and build agentoolkits, docs and tests :) https://t.co/AP4Y71zZB5
github.com
AgenTool is an Extension Agent Type for Pydantic-AI with a comprehensive ecosystem - fblgit/agentool
0
0
1
@fblgit
Juanako.AI
3 months
releasing ClaudeBench a tool to manage properly Claude Code work and perform more solid engineering work with it. We obtain better results using ClaudeBench with SpecKit than without it: consistency. #Anthropic #claudecode #claude @AnthropicAI https://t.co/dsdFqGvYOE
github.com
Claude Code Best Friend, a workbench for task management and swarm orchestration - fblgit/claudebench
0
0
2
@fblgit
Juanako.AI
3 months
releasing ClaudeBench a tool to manage properly Claude Code work and perform more solid engineering work with it. We obtain better results using ClaudeBench with SpecKit than without it: consistency. https://t.co/dsdFqGvYOE
github.com
Claude Code Best Friend, a workbench for task management and swarm orchestration - fblgit/claudebench
0
0
0
@fblgit
Juanako.AI
6 months
Model limits.. non accumulated.. i can use them all today and thats my limit.. if i dont use them tomorrow its my loss and their benefit.. So either you become "dependant", or "upgrade".. we missing github/fair-burn-tokens project to use all tokens 5m before they get restart..
0
0
0
@fblgit
Juanako.AI
6 months
There is a big difference between training models with your data and novelty classifiers that dispatches conversations to internal engineering teams to literally steal your project and idea. That's not model training, it's something else.. data thief bandits.
0
0
1
@ClementDelangue
clem 🤗
10 months
We crossed 1B+ tokens routed to inference providers partners on HF, that we released just a few days ago. Just getting started of course but early users seem to like it & always excited to be able to partner with cool startups in the ecosystem. Have you been using any
10
20
118
@MervinPraison
Mervin Praison
10 months
🚀 Can AI make $1M as a freelance software engineer? 💰 OpenAI’s SWE-Lancer puts AI to the test with 1,488 real-world freelance coding jobs from Upwork! 📊 But even the best model—Claude 3.5 Sonnet—only earned $400K. Is this the biggest AI freelancing opportunity? Or proof AI
4
4
37
@MervinPraison
Mervin Praison
10 months
1/3 🔹 How does SWE-Lancer work? ✅ AI tackles real Upwork coding jobs—worth $1M total ✅ Two categories: IC SWE tasks: AI writes & tests code. SWE Manager tasks: AI picks the best proposal. ✅ Graded with triple-verified end-to-end tests—no cherry-picked problems!
1
1
5
@fblgit
Juanako.AI
11 months
wonder why @perplexity_ai R1 is only half-way smart as the original https://t.co/7OYWYpy7b1 R1. Wonder wether this is a Quant version and not the good real R1.. because when contrasted R1 (#DeepSeek R1 web) vs O3 (og).. the metrics are very different vs R1 (perplexity) vs O3.
Tweet card summary image
chat.deepseek.com
Chat with DeepSeek AI.
0
0
0
@fblgit
Juanako.AI
11 months
@deepseek_ai free chat could be hosted in @huggingface space with a $logs>/dev/null code that everyone can see.. or @Cloudflare could #savethetech and not just the #savetheweb by serving R1 on his AI fleet with some free calls per day.. so we can separate tech from politics
0
0
1
@fblgit
Juanako.AI
11 months
* unsubscribed claude, coz is truly bad at AI research. * unsubscribed openai 200$ super mega, coz he always inject bugs. * never paid gemini or G's coz they desperately vacuum users data. * deepseek always under attack * subscribed to https://t.co/8O8h1DmEUA to use R1
Tweet card summary image
perplexity.ai
Perplexity is a free AI-powered answer engine that provides accurate, trusted, and real-time answers to any question.
1
0
1
@fblgit
Juanako.AI
11 months
kudos to #DeepSeekR1 @deepseek_ai for the model, but that thing of "hobby project based on spare compute" .. LOL.. GGWP.. look the stocks, plenty of folks believed it XDDDDD
0
0
0
@fblgit
Juanako.AI
1 year
true fact of today: AI hype StocksForEx rules everything... evals gets contaminated, benchmarks with invented maj512, non-sense published arxiv papers, shit models that has 35 trillions follows, underperforming datasets, learn this true fact: AI IS NOT A POWERPOINT
0
0
1