HeavyKevy
@CryptoKevy
Followers
554
Following
27K
Media
172
Statuses
2K
The truth is like poetry... and most people fucking hate poetry
Earth
Joined August 2017
There’s a new RAG technique every other week. Hybrid Search. Rewriting. Re-ranking. Agents. And while each solves something important, it’s easy to get overwhelmed — or worse, stuck trying to memorize patterns without knowing when to use them. So let’s slow down, break things
0
2
2
What we just experienced was true baseball. It’s a cruel, beautiful, joy filled, gut wrenching game. We saw incredible highs during the regular season and we saw incredible lows. We saw Julio look like a AAA hitter for 2 months and a true superstar for the second half. We
24
72
878
Welp. That one’s gonna hurt for a while. Love love love this team, though. Bring back Naylor and let’s run this shit back.
456
516
11K
And we're live! Come join us:
unstructured.io
The tradeoffs of DIY document pipelines at scale — and how Unstructured ETL+ fills the gaps.
If your homegrown unstructured data ETL is starting to feel like a rat’s nest 🐀, it’s because it probably is. 1/🧵
0
1
2
If you're hacking around on AI projects and have questions about where the market is going and what trends we're seeing at @UnstructuredIO, check out the 🧵 below.
Benchmarks don’t tell the whole story. 🧵 Yesterday’s webinar dug into how quality is evolving - from HTML-first strategies to Vision Language Models to systems that can reason and self-correct. We pulled 5 takeaways worth knowing. Which one stands out most to you?
0
0
0
The Document AI space has seen a fundamental shift in the past year. Everyone—from scrappy startups to established players—has pivoted from custom supervised models to wrapping the same handful of closed-source multimodal models. Yet, despite the fact we're all using essentially
0
2
2
Why are complex tables so hard to parse? OCR can detect characters, and some newer models can even handle simple tables. But once you introduce blank cells, multi-row headers, or nested structures, OCR quickly falls short. Rows and columns lose their positionality, context
0
2
2
At @UnstructuredIO, we often get the question "how well do you perform on scanned forms that include handwriting?" These types of documents are notoriously among the most difficult types of documents to ingest cleanly and reliably, yet they remain ubiquitous across many
0
2
3
Remember when extracting data from complex tables felt like digital archaeology? Messy. Painful. Incomplete. We do. That’s why we’ve devoted years of R&D to table transformation, turning one of document AI’s hardest challenges into a core strength. 1/🧵
2
1
4
In our latest webinar, we showed how Unstructured acts as Grand Central Station for GenAI pipelines—connecting sources, parsing content, chunking, and embedding to deliver AI-ready outputs. From VLM partitioning to contextual chunking and embeddings, we covered the exact way ETL
0
1
2
🏆 We’re ISO 27001 certified! We’re proud to announce that Unstructured is now officially ISO 27001 certified! This certification reflects our unwavering commitment to safeguarding customer data while delivering trusted solutions for transforming complex, unstructured data into
0
1
1
📣 Now Available: MLK Assassination Files — Structured and Searchable Following the declassification of over 6k documents and 240k pages related to the assassination of Dr. Martin Luther King Jr., Unstructured has released a machine-readable corpus built from these files. Many
github.com
Contribute to Unstructured-IO/unstructured-mlk-archive-public development by creating an account on GitHub.
1
4
11
It also shouldn’t be lost on anyone that Trump ordered these raids in large liberal cities. Places where his staff knew people would resist and where you could have “warzone” narratives that Fox News will eat up. There are no raids on the countless farms filled with
39
149
738
Lose $10T of market capitalization to fix a $1T trade deficit. 8D chess.
2K
7K
62K
Wow. Unstructured just made Fast Company's list of the Top 50 Most Innovative Companies in the World—coming in at #24! 🔥 When we started, we believed that if AI was going to change the world, it needed better data. Fast forward, and we’re now helping some of the biggest
0
3
14
Not surprisingly, the Trump and GOP budget has nothing about no tax on tips, nothing on no tax on overtime, and nothing on no tax on social security. It does have a $4.5 trillion tax cut for billionaires however.
932
8K
28K