Onedroid @ye_ack X Profile

Onedroid

@ye_ack

Followers

1

Following

247

Media

1

Statuses

173

Joined June 2017

Don't wanna be here? Send us removal request.

AK

@_akhaliq

1 month

discuss:

huggingface.co

0

1

11

Bidipta Sarkar

@bidiptas13

2 months

Introducing 🥚EGGROLL 🥚(Evolution Guided General Optimization via Low-rank Learning)! 🚀 Scaling backprop-free Evolution Strategies (ES) for billion-parameter models at large population sizes ⚡100x Training Throughput 🎯Fast Convergence 🔢Pure Int8 Pretraining of RNN LLMs

21

147

941

DailyPapers

@HuggingPapers

3 months

Language Models are Provably Injective and Invertible! A groundbreaking paper challenges the long-held belief that LLMs lose information. They prove mathematically and show empirically across billions of tests that inputs map uniquely to representations, making them lossless.

10

25

191

Thomas Fel

@thomas_fel_

3 months

🕳️🐇Into the Rabbit Hull – Part II Continuing our interpretation of DINOv2, the second part of our study concerns the geometry of concepts and the synthesis of our findings toward a new representational phenomenology: the Minkowski Representation Hypothesis

5

67

380

Avi Chawla

@_avichawla

5 months

A graph-powered all-in-one RAG system! RAG-Anything is a graph-driven, all-in-one multimodal document processing RAG system built on LightRAG. It supports all content modalities within a single integrated framework. 100% open-source.

30

406

3K

Simo Ryu

@cloneofsimo

6 months

"Aggressive Filtering aint good for larger training" Similar find also at

David Mizrahi

@dmizrahi_

6 months

Second insight: Optimal filtering changes predictably with scale. Smaller models benefit from aggressive filtering (e.g., top 3% at 10²⁰ FLOPs), while larger models prefer larger, more diverse datasets (e.g., top 30% at 10²³ FLOPs). Specific rates vary by data pool, but the

4

46

353

Yi R. (May) Fung

@May_F1_

6 months

🧠 How can AI evolve from statically 𝘵𝘩𝘪𝘯𝘬𝘪𝘯𝘨 𝘢𝘣𝘰𝘶𝘵 𝘪𝘮𝘢𝘨𝘦𝘴 → dynamically 𝘵𝘩𝘪𝘯𝘬𝘪𝘯𝘨 𝘸𝘪𝘵𝘩 𝘪𝘮𝘢𝘨𝘦𝘴 as cognitive workspaces, similar to the human mental sketchpad? 🔍 What’s the 𝗿𝗲𝘀𝗲𝗮𝗿𝗰𝗵 𝗿𝗼𝗮𝗱𝗺𝗮𝗽 from tool-use → programmatic

0

73

199

ℏεsam

@Hesamation

7 months

UC Berkley has two free courses on LLM Agents for foundational and advanced levels. it also has some of the best lecturers from DeepMind, Meta, and top universities. basically covers all you need to know about agents from the best resources out there.

10

327

2K

Sansa Gong

@sansa19739319

7 months

🤖Can diffusion models write code competitively? Excited to share our latest 7B coding diffusion LLM!!💻 With DiffuCoder, we explore how they decode, why temperature🔥 matters, and how to improve them via coupled-GRPO that speaks diffusion!!📈 Code: https://t.co/sWsb8a49HL 🧵

5

112

583

Daily Dose of Data Science

@DailyDoseOfDS_

7 months

The Ultimate Toolkit for Working with LLMs! Transformer Lab lets you train, fine-tune, and chat with any LLM—100% locally. Enjoy 1-click LLM downloads and a drag-and-drop UI for RAG. 100% open-source.

7

94

439

Rohan Paul

@rohanpaul_ai

7 months

LLMs running reinforcement learning shed their randomness almost at once, then their scores stall. This paper shows that randomness drop is predictable and fixable, so bigger gains are still on the table. The authors fit an exponential link between entropy and reward. Two

10

87

510

SkalskiP

@skalskip92

7 months

CVPR 2025 papers pt. 2 - SAMWISE SAMWISE adds language understanding and temporal reasoning to SAM2; you can segment and track objects in videos just by describing them more papers: https://t.co/1VlLn2BWxl ↓ more

7

56

396

Nick Jiang

@nickhjiang

7 months

Vision transformers have high-norm outliers that hurt performance and distort attention. While prior work removed them by retraining with “register” tokens, we find the mechanism behind outliers and make registers at ✨test-time✨—giving clean features and better performance! 🧵

16

135

1K

merve

@mervenoyann

7 months

I learnt a lot from O'Reilly books, this is surreal I'm writing a book with amazing people @micuelll @andimarafioti @orr_zohar about VLMs with @huggingface 📖💗 Early Access (first two chapters in raw) are available to everyone, we'd love to have your feedback!

95

236

2K

Xuandong Zhao

@xuandongzhao

8 months

🚀 Excited to share the most inspiring work I’ve been part of this year: "Learning to Reason without External Rewards" TL;DR: We show that LLMs can learn complex reasoning without access to ground-truth answers, simply by optimizing their own internal sense of confidence. 1/n

84

512

3K

Roy Carrilho

@RuiCarrilho5

8 months

Here's Let's Build a Simple Database! It's a bit outdated, but it still perfectly covers the basics on how to get a sqlite clone going, in C. You'll learn a lot about databases and C, enjoy!

4

64

500

Shubham Saboo

@Saboo_Shubham_

8 months

OpenMemory MCP provides a persistent memory layer for AI tools like Claude, Cursor and Windsurf. It enables AI Agents to securely read and write to a shared memory. Runs 100% locally on your computer.

12

63

491

Daily Dose of Data Science

@DailyDoseOfDS_

8 months

A collection of awesome MCP servers for AI Agents:

8

93

728

The AI Timeline

@TheAITimeline

8 months

🚨This week's top AI/ML research papers: - Absolute Zero - RM-R1 - Seed-Coder - Flow-GRPO - ZeroSearch - Ming-Lite-Uni - A Survey on Large Multimodal Reasoning Models - On Path to Multimodal Generalist - ZeroSearch - HunyuanCustom - Unified Multimodal CoT Reward Model through

8

118

997

Akshay 🚀

@akshay_pachaar

8 months

The Ultimate Toolkit for Working with LLMs! Transformer Lab lets you train, fine-tune, and chat with any LLM—100% locally. Enjoy 1-click LLM downloads and a drag-and-drop UI for RAG. 100% open-source.

14

150

702