
Mixedbread
@mixedbreadai
Followers
831
Following
187
Media
9
Statuses
49
Your fav. AI bakers! We're hiring!
San Francisco, CA
Joined March 2024
3/đź§µ Finding 2: Multimodal retrieval breaks the OCR ceiling. By "seeing" page images, multimodal retrieval outperformed all text methods, including perfect ground-truth text. It achieved an average NDCG@5 nearly 12% higher than perfect OCR.
2
0
5
2/đź§µ Finding 1: OCR creates a real performance ceiling for text-based RAG retrieval. Even the best OCR solutions fall ~4.5% short of ground-truth text (NDCG@5) on complex enterprise documents. It's a bottleneck you can't ignore. (Interesting side note: BM25 outperformed
3
3
15
Baked-in Brilliance: Reranking Meets RL 🍞. Meet mxbai-rerank-v2, our second-gen rerankers built on Qwen2.5 (thanks, @Alibaba_Qwen) & refined with GRPO from @deepseek_ai. They outperform open & closed-source models while staying fully open. 👇
10
33
216
RT @juliuslipp: just revamped the hiring page of Mixedbread a bit. we're looking for amazing people to join us doing:.- full stack (next.j….
0
7
0
RT @notrab: I posted a new tutorial on how to process embeddings with @mixedbreadai using @redpandadata and @tursodatabase ✨. Link in reply….
0
8
0