vespaengine Profile Banner
vespa.ai Profile
vespa.ai

@vespaengine

Followers
3K
Following
754
Media
27
Statuses
501

https://t.co/abkb8IjPSH - the open source platform for combining data and AI, online. Vectors/tensors, full-text, structured data; ML model inference at scale.

Joined September 2017
Don't wanna be here? Send us removal request.
@vespaengine
vespa.ai
1 year
The Singaporean government has deployed Vespa to search every word ever said in their Parliament. "A good decision is an informed one [. ] The heart of a good RAG system is a good search engine to retrieve the relevant data chunks for ingestion".Many teams are racing to make use
Tweet media one
Tweet media two
2
15
87
@vespaengine
vespa.ai
2 years
We have become our own company!.Expect even more features, even faster.
4
14
64
@vespaengine
vespa.ai
8 years
Announcing the open sourcing of Vespa, Yahooโ€™s Big Data Processing and Serving Engine at
3
61
57
@vespaengine
vespa.ai
2 years
Announcing our $31 million raise from Blossom Capital We'll spend it all on features.
1
6
46
@vespaengine
vespa.ai
3 years
Spotify launches semantic search in Podcasts, powered by
@SpotifyEng
Spotify Engineering
3 years
Hereโ€™s the problem: you want to search for a podcast, but you canโ€™t remember the name, only what itโ€™s about.๐Ÿ˜– Here comes Natural Search ๐Ÿ” to save the day! .
Tweet media one
0
5
43
@vespaengine
vespa.ai
2 years
LangChain Webinar on retrieval for LLMs with @lateinteraction and our very own @jobergum in six hours. Save your spot at
1
6
31
@vespaengine
vespa.ai
5 years
Choosing an algorithm for fast vector search for big data serving. Read if you are interested in combining fast vector search with filters, text, and real-time updates. Or in how professionals choose algorithms from the literature for production usage.
0
12
30
@vespaengine
vespa.ai
1 year
Having trouble keeping up? . Guidebook to the State-of-the-Art Embeddings and Information Retrieval .by @aapo_tanskanen at @thoughworks is out today - a great resource to get up to date.
Tweet media one
0
9
26
@vespaengine
vespa.ai
3 years
Vespa 8 is here!
0
6
26
@vespaengine
vespa.ai
4 years
Search is going through a paradigm shift -.neural methods are outperforming traditional methods by a wide margin. Can you use it real production systems?.[1/4]
Tweet media one
1
8
26
@vespaengine
vespa.ai
5 years
What the world need most now is more research on #covid19, faster. We've created to help with that. It lets researchers find research papers by combining text and structured search with exploring by semantic similarity using the scibert-nli model.
@Thiagogm
Thiago Guerrera
5 years
The @vespaengine team released based on the CORD-19 dataset released by the @allen_ai. Since everything is open-sourced, you can contribute to the project in multiple ways. ๐Ÿ‘‡. #NLP #NLProc #SearchEngine #COVIDใƒผ19.
0
14
26
@vespaengine
vespa.ai
4 years
Do you want to work on .- search relevance .- recommendation.- personalization . using .- machine-learning.- embeddings .- vector or hybrid retrieval . but are kinda tired of all the plumbing?. Thread time ๐Ÿ‘‡.
2
7
24
@vespaengine
vespa.ai
1 year
After extensive benchmarking, Marqo migrated their underlying engine from OpenSearch to Vespa: . "For Marqo 2, we looked at a number of open source and proprietary vector databases, including Milvus, Vespa, OpenSearch (AWS managed), Weaviate, Redis, and Qdrant.".
1
1
24
@vespaengine
vespa.ai
3 years
We made it simpler to create semantic search applications on Vespa.- No need to create vectors yourself.- No code needed.- No limits, once you want to grow add features.
3
2
24
@vespaengine
vespa.ai
7 years
Introducing TensorFlow support .
0
12
21
@vespaengine
vespa.ai
6 years
We're starting a new series of posts where we provide a complete and scalable Vespa application with a frontend for a popular use case. First out: Shopping sites! .
0
8
19
@vespaengine
vespa.ai
1 year
Follow this guide if you need to build a personal data app which .- costs about 5% of a traditional vector database.- contrary to those, finds all the user's data.- scales to millions (or billions) of users.
@jobergum
Jo Kristian Bergum
1 year
Hands-on RAG guide for personal data with @vespaengine and @llama_index ๐Ÿ˜ . Featuring LLamaIndex retrievers, Vespa streaming mode for personal data, built-in embedders, multi-index RAG (federation + blend), hybrid search, and rank fusion strategies.
1
4
19
@vespaengine
vespa.ai
7 years
Releasing support for ONNX models (Caffe2, PyTorch). All the same optimizations making it fast as TensorFlow models since we compile into the same execution engine
0
9
19
@vespaengine
vespa.ai
2 years
Announcing support for Maximum Inner Product Search (MIPS) in Vespa. โœ… Search efficiently for vectors with the highest dot product, by an innovative extension to HNSW indexes. โœ… With full realtime indexing, and no need to normalize vectors.
0
4
19
@vespaengine
vespa.ai
5 years
Announcing approximate nearest neighbor vector search in Vespa - State of the art performance from a native Vespa implementation of HNSW.- Vector search combines efficiently with filters and text search.- Infinitely scalable.- Real-time updates of vectors.
0
12
19
@vespaengine
vespa.ai
1 year
Things we've been up to in May:. ๐Ÿ‘“LLM integration: Build complete RAG applications on Vespa. ๐ŸŽฐEmbed to multiple representations at once: An oom cheaper vector search by creating a small vector for search and a larger for ranking. ๐ŸŽฏCombine fuzzy search and prefix match: Ideal.
1
3
18
@vespaengine
vespa.ai
1 year
Recommender systems need to multiply very large sparse matrices. e-commerce platform @farfetch leverages Vespa's support for sparse and dense tensors + vector search to do this online in less then 100 milliseconds. ๐Ÿงต.
1
1
19
@vespaengine
vespa.ai
2 years
Have you ever had to index trillions of documents in your vector database?. This is the problem Yahoo Mail is facing, and believe us it can get expensive. Luckily this is personal data, so it's possible to use Vespa's Vector Streaming Search which makes the problem tractable.
1
2
16
@vespaengine
vespa.ai
1 year
The latest Vespa newsletter is here to help you stay up to date on what's happening on the leading edge in RAG, IR and vector search:.- A new SPLADE embedder.- ONNX models with float16.- @cohere embedding model guides.- Support for an array of chunks with ColBERT.- And list of.
0
3
18
@vespaengine
vespa.ai
2 years
Very informative post on choosing Vespa for personalization using vector embeddings
Tweet media one
@dainius_jocas
Dainius Jocas
2 years
Blog post on how โฆโฆ@vintedโฉ started using โฆ@vespaengineโฉ for recommendations
2
5
15
@vespaengine
vespa.ai
1 year
Seems everybody is migrating their search and recommendation systems from Elastic to Vespa now. Here's the experience of Stanby, Japan's leading job search site:
0
5
15
@vespaengine
vespa.ai
2 years
When you are doing nearest neighbor vector retrieval you are doing *search*, which is computation over data. The llm builders community is currently speedrunning the process of discovering what this takes. If you are reading this, congrats you're likely ahead of the curve :-).
@HanchungLee
Han
2 years
Vector databases are not search engines. Reasoning over metadata is easily achievable with COT + @elastic , @vespaengine , or other search engines.
0
0
15
@vespaengine
vespa.ai
2 years
Vector databases are not suited for personal data: You need complete results and much lower cost. We're announcing vector streaming search to solve this problem: Fully production ready, and open source, of course.
1
5
16
@vespaengine
vespa.ai
7 years
Benchmark results! Scaling TensorFlow model evaluation with Vespa
0
14
15
@vespaengine
vespa.ai
5 years
New blog post: Creating a state of the art question-answering system using
1
12
16
@vespaengine
vespa.ai
5 months
Startups running on Vespa Cloud raised more than $750M during 2024.
1
1
14
@vespaengine
vespa.ai
8 months
What we have been up to in the Vespa HQ in August:. ๐ŸŽ๏ธ 30x faster MaxSim with Hamming distance for multivector documents. ๐Ÿง‘โ€๐Ÿ’ปIDE support for VSCode, IntelliJ, PyCharm, WebStorm and neovim. ๐ŸEven more pyVespa improvements.
0
1
14
@vespaengine
vespa.ai
2 years
Vector search is more than ranking by nearest neighbor. Run efficient transformer-based embedding retrieval on your laptop ๐Ÿ’ป.
0
2
14
@vespaengine
vespa.ai
4 years
A good writeup on building a search product end to end "[Vespa] is *the* search engine to use on mutable data sets with modern ranking methods".
0
4
14
@vespaengine
vespa.ai
2 years
Introducing a global ranking phase in
Tweet media one
0
0
14
@vespaengine
vespa.ai
2 years
Frozen embeddings ๐Ÿฅถ. A guest post on the Vespa blog by Andrii Yurkiv.
1
3
14
@vespaengine
vespa.ai
4 years
New blog post! . Joining real-time click data and content at query time without impacting performance. Impossible? Not with
0
2
14
@vespaengine
vespa.ai
8 years
A slide deck on Vespa
1
5
14
@vespaengine
vespa.ai
8 years
New vespa.ai blog post: The basics of Vespa applications
0
13
14
@vespaengine
vespa.ai
5 years
Getting started with machine-learned ranking using Vespa
0
10
13
@vespaengine
vespa.ai
2 years
Vespa continues to empower developers who are building something real with vector embeddings. No custom code required to.- use any embedding model from @huggingface.- do multilingual embedding.- run embedders on GPUs.
1
0
13
@vespaengine
vespa.ai
7 years
Is it possible to do distributed joins in real time without sacrificing performance? It depends - is your use case of the parent-child type, and do you run the latest Vespa?
0
8
11
@vespaengine
vespa.ai
7 years
We made neural net evaluation 20x faster.
0
8
13
@vespaengine
vespa.ai
2 years
What we've been up to during summer at the hq. - A multi-lingual embedding sample app using E5.- to_epoch_second in the indexing language.- New pyvespa and Vespa CLI features.- ML model persistence across deployments.- and more.
0
4
13
@vespaengine
vespa.ai
2 years
Announcing support for ANN with multiple vectors per document in
1
1
11
@vespaengine
vespa.ai
4 years
Can machines reliably answer questions in natural language?. It turns out they can, and the best methods from research are public. Seems like a game-changer, so why isn't this more widespread?
Tweet media one
1
7
13
@vespaengine
vespa.ai
7 years
How and why Zedge migrated their autosuggest, search and recommendation to Vespa
0
4
12
@vespaengine
vespa.ai
3 years
Why you can't conclude a machine-learned model will scale in production just because it is fast
1
4
12
@vespaengine
vespa.ai
9 months
DONE from our โ˜€๏ธsummerโ˜€๏ธ todo list :.- pyVespa deploy_to_prod. - vespa log CLI command also for self-hosted. - Faster multithreaded hybrid queries. - rank-score-drop-limit in second phase. - Chinese segmentation. and 14 more, covered in our August issue.
2
4
12
@vespaengine
vespa.ai
7 months
Tempted to use a database for your vectors? .Consider why you are using vectors in the first place.
0
3
12
@vespaengine
vespa.ai
1 year
The December Vespa newsletter is out:.- Global-phase ranking with normalizing functions.- Take the Vespa open source survey.- Token access on Vespa Cloud.- Get indexed tokens in results.- and much more.
0
1
12
@vespaengine
vespa.ai
5 years
Vespa improvements released in February: Support for LightGBM machine-learned models, improved matrix multiplication performance, benchmarking guide, a fluent query builder API, and Hadoop integration improvements.
0
6
12
@vespaengine
vespa.ai
5 years
Vespa improvements from January Among other things this includes new tensor functions needed to run BERT models. All of it already released and running in production of course.
1
7
12
@vespaengine
vespa.ai
2 years
Highlights of Vespa features released in June:. - Personal vector search with complete results, at 1/20 of the cost, with streaming.- MIPS/dot product ranking in ANN.- GPU Acceleration of embedding models.- Use any @huggingface embedder directly.
0
4
12
@vespaengine
vespa.ai
8 years
Some worthwhile things take a decade.
0
3
12
@vespaengine
vespa.ai
4 years
Everybody knows that AI builds on tensors now, but when we started work on them in 2013 it was just a weird thing.
1
3
12
@vespaengine
vespa.ai
4 years
If you are looking for a comparison of Elasticsearch, Solr, and Vespa this is the video to watch.
@FlaxSearch
Charlie Hull
4 years
Find out how our invited speakers compared #Elasticsearch #Solr and @vespaengine at the Haystack LIVE! Meetup - video now available at - thanks so much @anshumgupta @jobergum @joshdevins !.
0
2
11
@vespaengine
vespa.ai
1 year
If you are using vector embeddings, reading this post might be the most profitable ten minutes you'll ever spend.
@jobergum
Jo Kristian Bergum
1 year
Matryoshka ๐Ÿค Binary vectors: Slash vector search costs with Vespa. We announce support for combining matryoshka and binary quantization in Vespaโ€™s native hugging-face embedder and discuss how this slashes vector search costs.
0
1
10
@vespaengine
vespa.ai
6 years
Implementing reinforcement learning in production with open source: New Vespa blog post
0
9
11
@vespaengine
vespa.ai
1 year
Gigaom published their Sonar for Vector Databases today, positioning Vespa as a leader. While what we are - and what you need - is much more than a vector database, it is gratifying to be recognized as a leader also on these core features alone.
1
2
11
@vespaengine
vespa.ai
8 months
Finally it is possible to create search systems that can see: Drop document text extraction and let your system see text, figures and layout directly.
0
2
11
@vespaengine
vespa.ai
2 years
Announcing GPU-accelerated inference in Vespa
1
2
10
@vespaengine
vespa.ai
6 years
It's been a good summer for coding in the north ๐ŸŒง๏ธ๐ŸŒง๏ธ. We got some new features in Vespa released: BM25 ranking feature, searchable parents, tensor summary features and metric export.
0
4
11
@vespaengine
vespa.ai
3 years
Scaling vector search to a billion documents per node
0
1
11
@vespaengine
vespa.ai
6 years
It turns out that when you open source 1.7M lines of code in 150 flat modules people will keep asking for directions. Today we're publishing a map to the code base.
0
3
11
@vespaengine
vespa.ai
5 years
April @vespaengine updates include performance and operability improvements: Top-K hits, smarter data migration and CloudWatch integration. Contributing to Vespa is now easier with the release of a CentOS 7 dev environment. #bigdata.
0
6
11
@vespaengine
vespa.ai
1 year
When GigaOm named Vespa Leader in their Sonar for Vector Databases, one of the categories where we scored Excellent were Embedding Flexibility - why? . Vespa lets you create embeddings in four ways:. - On your own, outside Vespa: Just pass tensors directly in documents and.
1
2
11
@vespaengine
vespa.ai
2 years
๐Ÿ’ฅNew blog post: Improving Zero-Shot Ranking with Vespa Hybrid Search.
1
0
9
@vespaengine
vespa.ai
5 years
The big data maturity levels Tag your org!.
1
4
10
@vespaengine
vespa.ai
1 year
When you're working with vectors you're doing search - and you'll need all the features of a search engine. A new blog post which explains why we're no fans of the term 'vector database'.
Tweet media one
0
2
10
@vespaengine
vespa.ai
2 years
Semantic search with multiple languages is finally clearing the bar for production. Vespa makes it easy to stand up your own.
0
2
9
@vespaengine
vespa.ai
4 years
We're submitting some Vespa examples to relevance competitions. All of them:.- available as open source sample applications.- production ready, with < 100 ms latency. This first one is a simple WAND+GDBT baseline, beating all the other entries not using deep learning/embeddings.
@lintool
Jimmy Lin
4 years
Welcome @jobergum and @vespaengine to the fray with their first @MSMarcoAI document leaderboard submission! Awesome to see LTR effectiveness creeping up on the muppets.
1
3
10
@vespaengine
vespa.ai
6 years
How would you implement the next Gmail efficiently in open source? New Vespa blog post
0
7
9
@vespaengine
vespa.ai
4 years
Did you know you can now convert text to vector embeddings in documents and queries automatically in Vespa? We provide the wonderful SentencePiece algorithm by Taku@Google for this, but you can also plug in your own.
Tweet media one
0
2
10
@vespaengine
vespa.ai
5 years
The hardest problem in computing .
0
8
10
@vespaengine
vespa.ai
8 months
A great article by @VintedEng on migrating from Elasticsearch to Vespa
Tweet media one
3
2
10
@vespaengine
vespa.ai
4 years
And how to use a BERT-style transformer model for final ranking while staying within your latency budget
0
2
10
@vespaengine
vespa.ai
3 years
Finally, for all the Elastic developers out there who want to start creating modern search base applications, @atitaarora on understanding Vespa with a Lucene mindset
0
3
9
@vespaengine
vespa.ai
2 years
For those who are running Vespa, information about HTTP/2 Rapid Reset (CVE-2023-44487).
0
1
9
@vespaengine
vespa.ai
4 years
Testing ANN performance on Search a million vectors in 2 ms measured from the client.
@jobergum
Jo Kristian Bergum
4 years
From our performance factory page at @vespaengine HQ where we test Vespa performance for every build. This is testing Vespa's approximate nearest neighbor search on SIFT 1M This is an end to end benchmark including HTTP API. Pretty awesome if you ask me.
Tweet media one
0
3
9
@vespaengine
vespa.ai
1 year
ColBERT v2 is the state of the art in information retrieval, and with this work you can actually run it economically, at any scale.
@jobergum
Jo Kristian Bergum
1 year
Announcing ColBERT in @vespaengine, enjoy!. - A new native Vespa ColBERT v2 embedder.- ColBERT token-level vector compression (32x).- Support for long context via Vespa mixed tensors.- Offload to disk .- Eval. Plus, it boasts the largest FAQ ever!๐Ÿ˜….
0
0
9
@vespaengine
vespa.ai
2 years
October news from ๐ŸŽƒ. ๐ŸŒค๏ธ Enclave: Bring your own cloud to Vespa Cloud, on both AWS and GCP. ๐ŸŽ๏ธMUCH faster fuzzy matching. ๐Ÿ”Lucene linguistics integration. and much more obviously:
0
3
9
@vespaengine
vespa.ai
4 years
This lets you combine vector search with filters and text while staying efficient, and scale to real data sizes with low latency. This is what's needed for production and what can't be achieved by integrating specialist tools for these problems. [3/4].
1
0
8
@vespaengine
vespa.ai
7 years
If you're in Bay Area come meet us at September 26. in San Francisco Thanks to @Amplitude_HQ for hosting!.
0
5
9
@vespaengine
vespa.ai
5 years
If you have it's not too late to make yourself a state of the art and infinitely scalable e-commerce site in time for Black Friday. @jobergum explains how in
0
9
9
@vespaengine
vespa.ai
6 years
For more on how to use reinforcement learning for recommendation see
@adichad
Aditya Varun Chadha
6 years
A breath of fresh air: @vespaengine at #HaystackConf, signal to noise is through the roof! Reinforcement learning for comment ranking.
Tweet media one
Tweet media two
0
6
6
@vespaengine
vespa.ai
1 year
Announcing a new IN operator in Vespa: .select * from product where id in (10, 20, 30). - Simpler than using a WeightedSet.- Over 10x faster with large thousands of query values.
1
0
8
@vespaengine
vespa.ai
4 years
ANN lets us apply AI in ways that weren't possible before. But to use it in the real world you need a few more things.
0
1
8
@vespaengine
vespa.ai
7 months
Shipped from the Vespa HQ in ๐ŸŽƒ October:. - Index tensors with multiple sparse dimensions: tensor(page{}, section{}, chunk{}, x[16]).- Global significance ("tf") models- Metric dashboards in the Vespa Cloud Console.- Even more new PyVespa features. Read the details in our latest.
0
1
7
@vespaengine
vespa.ai
3 years
Will new vector databases dislodge traditional search engines?.
0
2
8
@vespaengine
vespa.ai
3 years
Want to run on your m1 Mac, on ARM64 nodes in your data centers, or on Vespa Cloud?. We got you! All Vespa images are now multi-architecture.
0
2
7
@vespaengine
vespa.ai
7 years
Scalable realtime blog recommendation using neural nets.Part 3 of 3 of a series showing in detail how to build a big data serving application using Vespa.
0
6
8
@vespaengine
vespa.ai
3 years
An open source sample application for searching images by describing them, based on CLIP
Tweet media one
0
3
7
@vespaengine
vespa.ai
10 months
We also provide the same for *your* applications when running on Vespa Cloud.
@YashasviMantha
Yashasvi Mantha
10 months
@vespaengine has one of the best FOSS ci/cd set up I have seen out there. Pretty much everything is automated. This encourages contributors like me to contribute even more because the changes are so instant on prod.
0
0
7
@vespaengine
vespa.ai
8 months
Danswer (YC W24) on why they moved their RAG solution to Vespa. We hear this in so many private conversations right now.
Tweet media one
@Yuhong_S
Yuhong Sun
8 months
@DanswerAI has been using @vespaengine as our search engine for a long time. In this blog, I outline the key benefits of using Vespa. Thanks @jonbratseth, @kraune and @jobergum for keeping Vespa open source and helping our users with Vespa questions!.
0
0
7
@vespaengine
vespa.ai
4 years
For those who want to contribute to this will be an easy way to get started.
@ydn
Yahoo Developer
4 years
Join us March 21 - 28, for Yahoo Hack Together, a virtual #opensource #hackathon! Learn more & register: #bigdata #design #devops #networksecurity
Tweet media one
0
3
7
@vespaengine
vespa.ai
1 year
@farfetch The full post includes a detailed description of how to implement a scalable and low latency recommender system:
0
0
7
@vespaengine
vespa.ai
4 years
Vespa product updates for December:. - 10-150x speedup of computations with sparse tensor dimensions.- ZooKeeper distributed locking available in containers.- PyVespa for machine learning.- ONNX runtime integration.
0
2
7
@vespaengine
vespa.ai
1 year
๐ŸŽ…The official 2024 Advent of Tensors ๐ŸŽ…. Follow this thread to get a challenge a day, win swag and become a tensor computation expert along the way.
@jobergum
Jo Kristian Bergum
1 year
Prepare to embark on a festive journey as we bring you the Advent of Tensors with 24 challenges and the chance to win @vespaengine swag! .
0
3
7
@vespaengine
vespa.ai
1 year
This is an unusually good comparison, although it leaves out what we see as most important: Ranking and inference capabilities. It can be solved elsewhere, but only if you don't need to perform at scale . .
@mayowaoshin
Mayo Oshin
1 year
Found this comprehensive spreadsheet on the key features of popular Vector Databases used to build out AI chatbot solutions. Spreadsheet: Key takeaways:. - The most well-rounded solutions include @weaviate_io, @vespaengine, and @elastic. - The top
Tweet media one
0
0
7