Frank Liu Profile
Frank Liu

@frankzliu

Followers
577
Following
607
Media
26
Statuses
591

Professional presser of buttons on computer keyboards @VoyageAI @MongoDB

High-dimensional vector space
Joined December 2021
Don't wanna be here? Send us removal request.
@frankzliu
Frank Liu
7 months
Why pick and choose between MRL and quantization when you can have both? 🤓.
@VoyageAI
Voyage AI by MongoDB
7 months
📢 Announcing the new SOTA voyage-3-large embedding model!. • 9.74% over OpenAI and +20.71% over Cohere.• flexible dim. (256-2048) and quantizations (float, int8, binary).• 8.56% over OpenAI with 1/24x storage cost.• 1.16% over OpenAI with 1/192x storage cost ($10K → $52)
Tweet media one
3
0
7
@frankzliu
Frank Liu
2 days
RT @MongoDB: Our Multimodal Search Python Library is now in public preview. Giving developers a single interface to build applications tha….
0
5
0
@frankzliu
Frank Liu
2 months
The OG post:
Tweet media one
0
0
2
@frankzliu
Frank Liu
2 months
I feel like I've seen this kind of retrieval plot somewhere before 😂.
@MistralAI
Mistral AI
2 months
Introducing Codestral Embed, the new state-of-the-art embedding model for code.
Tweet media one
2
1
2
@frankzliu
Frank Liu
2 months
RT @VoyageAI: 📢 Meet voyage-3.5 and voyage-3.5-lite!.• flexible dim. and quantizations.• voyage-3.5 & 3.5-lite (int8, 2048 dim.) are 8% & 6….
0
12
0
@frankzliu
Frank Liu
4 months
RT @continuedev: @metcalfc wrote a deep dive on why your custom AI code assistant should include embeddings and a reranker from @VoyageAI🥇….
0
2
0
@frankzliu
Frank Liu
5 months
RT @Coffee_and_NLP: Among other things that make my day, one of them is a great podcast conversation with my guests. Thanks @frankzliu for….
0
1
0
@frankzliu
Frank Liu
5 months
RT @HanchungLee: @jobergum @JinaAI_ 's definition of deep research is shallow. openai's deep research is a trained system. stanfords storm….
0
1
0
@frankzliu
Frank Liu
5 months
Congrats @nomic_ai! Great to see MoE coming to embedding models.
@nomic_ai
Nomic AI
5 months
Nomic Embed Text V2 is now available. - First general purpose Mixture-of-Experts (MoE) embedding model.- SOTA performance on the multilingual MIRACL benchmark for its size.- Support for 100+ languages.- Truly open source - open training data, weights, & code.- Apache 2.0 License
Tweet media one
0
1
2
@frankzliu
Frank Liu
5 months
Cosine similarities greater than 1 == "beautiful results" 🤣.
@bo_wangbo
Bo
5 months
Got some beautiful results with our trial jina-embeddings-v4 checkpoint on modality gap:
Tweet media one
1
0
3
@frankzliu
Frank Liu
6 months
Apparently, 67.4 is "#1 on CoIR Benchmark". It doesn't even beat @VoyageAI's general-purpose embedding model.
Tweet media one
@SFResearch
Salesforce AI Research
6 months
🚨🚨🚨Just released!🚨🚨🚨 . 🚀Introducing the Salesforce Code Embedding Model Family (SFR-Embedding-Code), ranked #1 on CoIR Benchmark! 🚀. Available in 2 sizes: 2B, 400M. Key Highlights:. 1️⃣ 2B Model: Achieves #1 on CoIR. 2️⃣400M Model: Best-performing model under 0.5B.
0
0
7
@frankzliu
Frank Liu
6 months
RT @flo_re2003: Had a really interesting discussion about agentic retrieval last night at a RAG event at @ExaAILabs with @frankzliu from @V….
0
1
0
@frankzliu
Frank Liu
6 months
RT @michael_chomsky: hyped for the reranking event I'm throwing in sf next thursday:. speakers from:.@ExaAILabs, which just purchased a sup….
0
3
0
@frankzliu
Frank Liu
7 months
RT @TimescaleDB: 🚀 General vs. Domain-Specific: Which Embedding Model Should You Choose for Your RAG App?. We tested OpenAI’s text-embeddin….
0
7
0
@frankzliu
Frank Liu
8 months
voyage-code-3 is one of the first embedding models trained with both Matryoshka learning as well as quantization awareness. More in our blog post:
Tweet card summary image
blog.voyageai.com
TL;DR – Introducing voyage-code-3, our next-generation embedding model optimized for code retrieval. It outperforms OpenAI-v3-large and CodeSage-large by an average of 13.80% and 16.81% on a suite …
@VoyageAI
Voyage AI by MongoDB
8 months
📢 Announcing voyage-code-3 embedding model!. 1. more accurate: + 14% gain over OpenAI-v3-large.2. flexible dimension (Matryoshka): 256-2048.3. quantized embeddings: float, int8, binary.4. new Pareto frontier: (binary,256 dim.) is 6% better than OpenAI (float,3072 dim.) 🧵🧵
Tweet media one
1
0
6
@frankzliu
Frank Liu
8 months
RT @VoyageAI: Vector-based code retrieval is a critical building block in code assistants and agents. However, many people complained about….
0
14
0
@frankzliu
Frank Liu
8 months
Huge props to our partner @AnthropicAI!. @OpenAI is the new Netscape.
Tweet media one
4
6
51
@frankzliu
Frank Liu
8 months
RT @jobergum: I’m glad that there is more interest from embedding providers. Voyager was the first, next up Jina and Nomic? Cohere also.
0
2
0
@frankzliu
Frank Liu
8 months
Recently, I've shared how I believe that "native multimodality" is the future. voyage-multimodal-3, trained end-to-end on text, photos, figures, PDFs, PPTs, and more, is the first embedding model that fits this concept. No more unstructured data ETL. Screenshot is all you need.
@VoyageAI
Voyage AI by MongoDB
8 months
📢 Announcing voyage-multimodal-3, our first multimodal embedding model!. It vectorizes interleaved text & images, capturing key visual features from screenshots of PDFs, slides, tables, figures, etc. 19.63% accuracy gain on 3 multimodal retrieval tasks (20 datasets)! 🧵🧵
Tweet media one
1
3
35
@frankzliu
Frank Liu
9 months
RT @VoyageAI: Thrilled to share that we've closed $28M in funding, led by @CRV, with continued support from @wing_vc and @saranormous. Also….
Tweet card summary image
voyageai.com
Voyage AI provides cutting-edge embedding models and rerankers for search and retrieval
0
35
0