
DailyPapers
@HuggingPapers
Followers
5K
Following
8
Media
742
Statuses
2K
Tweeting interesting papers submitted at https://t.co/rXX8x0HzXV. Submit your own at https://t.co/QhbJKXBd4Q, and link models/datasets/demos to it!
Anywhere
Joined March 2025
Beyond Transcription: A new paper introduces mechanistic interpretability to ASR, revealing hidden internal dynamics and biases in how models process speech.
huggingface.co
0
0
2
Discover how Discrete Diffusion VLA brings a unified, scalable architecture to robot action decoding. Its adaptive easy-to-hard strategy & robust error correction improve over prior methods. Read the paper for full details:.
huggingface.co
0
0
1
Vision-SR1 decomposes visual reasoning into perception and language, allowing the VLM to self-reward and learn. This boosts visual reasoning and drastically reduces hallucinations & language shortcuts!. Paper: Code:
github.com
Reinforcement Learning of Vision Language Models with Self Visual Perception Reward - zli12321/Vision-SR1
0
0
4
VoxHammer introduces Edit3D-Bench, a new human-annotated dataset for evaluating 3D editing consistency!. Experience truly flexible 3D local editing. Read the paper: Try the demo: Dataset:
huggingface.co
0
1
5
OpenAI just released HealthBench on Hugging Face. This new dataset is designed for rigorously evaluating large language models' capabilities in improving human health. A vital step for AI in medicine!.
huggingface.co
3
16
116
TreePO boosts LLM reasoning with up to 43% faster training by using a heuristic tree-based search!. Dive into the paper & explore checkpoints/data on the Hub:.🔗 🗂️
huggingface.co
0
0
3
With 520+ grad-level problems & a new SEED metric, CMPhysBench reveals a huge gap: Grok-4 scores only 28%!. Dive into the data-driven future of science. Paper: Dataset:
huggingface.co
0
0
5
It synthesizes up to 90 min of speech with 4 speakers, capturing the authentic "vibe" of dialogue. VibeVoice's continuous speech tokenizer enables 80x data compression. Paper: Collection: Demo:
huggingface.co
0
0
2
Asteromorph Corp unveils Spacer: an AI system engineering scientific inspiration to generate novel, factually-grounded research concepts.
huggingface.co
0
2
10
MMTok formulates token selection as a maximum coverage problem, preserving 87.7% F1 with just 4 vision tokens on POPE datasets!. This training-free method uses both vision and text for smarter pruning. Paper:
huggingface.co
0
0
4
Dive into 18k+ human assessments from 1,000+ global annotators. Each entry includes prompts, 4 candidate responses, and detailed feedback with rationales for personal and world views. Explore the dataset: Read the blog post:
openai.com
We surveyed over 1,000 people worldwide on how our models should behave and compared their views to our Model Spec. We found they largely agree with the Spec, and we adopted changes from the disagr...
0
1
3