Jon Bratseth
@jonbratseth
Followers
437
Following
3K
Media
28
Statuses
2K
CEO https://t.co/5qXgcEp1MU Build things and help people.
Trondheim, Norway
Joined April 2008
The price of everything on Earth. This chart is all of the natural occurring elements, their occurrence rate in Earth's crust (X-axis) and their price in USD (Y-axis). The chart illustrates three clear price regimes. 1. Yellow band is stuff that is economically priced this is
63
194
1K
Mark your calendars for Tuesday, 6pm CET. This is an event you dont wanna miss! Logan Kilpatrick from Google Deepmind will join me in AI Chitchat! 🚀
2
10
39
Lightning Lessons on The march to Cheat at Search with *Agents* coming in Feb -- Coming Nov 17, Radu Gheorghe of https://t.co/o4RKxAZWH5 will share best practices on RAG chunking. Or really beyond RAG chunking :) https://t.co/0oxje9nMtU
maven.com
In enterprise and web search, many questions are answered by separate bits of documents, yet semantics and properties of the containing entity are also important. While there's no silver bullet -...
0
3
4
Best general talk about vectors I've seen. From what a vector is to how HNSW works:
0
3
4
I've been looking for how search and RAG can be done on large scale and actual data, and there's just toy examples everywhere I look. Not just some pdfs or a website with everything in context, but actual search, retrieval, ranking, re-ranking, etc. Then I found this goldmine.
8
17
188
Anyone who needs their AI systems to have access to general knowledge will need a web search API. That's probably why Google and Bing are restricting and shutting down theirs now. Fortunately, new alternatives are coming online. Perplexity just launched their web search API
blog.vespa.ai
Perplexity demonstrates the quality of their search solution and show what it takes to achieve it
Google just made a subtle but massive change Last month, Google quietly removed the num=100 search parameter. This means you can no longer view 100 results at once. The default max is now 10. Why does this matter? - Most LLMs (OpenAI, Perplexity, etc.) rely (directly or
1
2
6
Filtered vector search is a massively important and overlooked problem for RAG and vector DBs. Very excited to see this new blog post from @vespaengine detailing its implementation of ACORN, along with many clever extensions to deliver huge speedups for search with filters.
In real vector search systems, performance is dominated by combining it efficiently with filters. Few test this properly. 🧵
1
8
24
Two great alternatives, both built on
0
2
9
Lots of hard problems in web search, but luckily at least the "super fancy db" you need for the index is available for everyone at https://t.co/QfFhnHgki7.
Why it's hard to build a web index, objectively harder than building a GPT-4.1. Argument: there are just fewer people - literally two (G and M) - who have done it well.
0
2
10
Much talk about context rot in timeline. The solution: layered ranking and chunk selection.
1
1
1
In real vector search systems, performance is dominated by combining it efficiently with filters. Few test this properly. 🧵
1
5
20
We just did a podcast about the process of migration (trade-offs included) from #Elasticsearch to @vespaengine With @dainius_jocas and @KevinPetrieTech 🙌 https://t.co/BtCGVyzP2U
em360tech.com
In this episode of the Don't Panic, It's Just Data podcast, Kevin Petrie, VP of Research at BARC and the podcast host, is joined by Dainius Jocas, Search Engineer at Vinted, and Radu Gheorghe,...
0
4
6
Announcing: The RAG Blueprint Build RAG like the world's most successful applications. Start from our open source sample app which contains all you need to do to achieve world-class quality at any scale. Sample app: https://t.co/LBR2Uuf7Sl Blog post:
blog.vespa.ai
An open source sample application that contains everything you need to create a RAG solution with world-class accuracy and infinite scalability.
1
5
25
so much of so-called moral intuition, like that fun wears out and utopia is ultimately boring, is contingently downstream of being permanently imprisoned in rickety rube kludgeberg machine of matryoshka shock collars and dopamine needlepricks for driving a biorobot around a
The one biological paradox I find really tiresome is that for things to be easy and fun most of the time, you have to intentionally inflict (relatively) stupendous levels of boredom and hardship on yourself.
13
6
193
New Vespa features covered in the June newsletter: - Layered ranking: Rank chunks in documents. - Elementwise bm25 - top, filter_subspaces, and cell_order tensor functions - chunking support in indexing - element-gap: Proximity over chunks - filtering in grouping results -
1
4
11
June @vespaengine newsletter is out! Lots of cool new stuff (e.g. built-in chunking) and educational content (e.g. demo E-commerce apps with new ideas) Check it out and let us know of any feedback:
blog.vespa.ai
Advances in Vespa features and performance include layered ranking for RAG applications, chunking, and facet filtering.
0
3
7