chunkrai Profile Banner
chunkr Profile
chunkr

@chunkrai

Followers
423
Following
50
Media
19
Statuses
57

vision based document parsing

San Francisco, CA
Joined January 2025
Don't wanna be here? Send us removal request.
@chunkrai
chunkr
4 months
PDFs suck. If you want to build standout RAG apps, you need tooling that is tailored to your needs. You should own your doc parsing infra. Today we’re launching on @ycombinator .
5
16
88
@chunkrai
chunkr
7 days
Join our Discord and become a part of the Chunkr community
0
0
1
@chunkrai
chunkr
7 days
Want to customize it? Check out the complete open-source repo at
1
0
2
@chunkrai
chunkr
7 days
Use your own keys, no server-side storage involved. Try it on
Tweet media one
1
0
1
@chunkrai
chunkr
7 days
Don’t fight your docs. Just talk to them. We built a document chat app that just works. Any file type, any industry. Legal docs, engineering diagrams, clinical notes and everything in between. Chunkr Chat is now live! See what the best chat feels like. Watch it break down a
4
2
33
@chunkrai
chunkr
11 days
0
0
0
@chunkrai
chunkr
11 days
this is an awesome use case. good parsing + structured outputs is a such powerful combination. if you don't want to send your data to the cloud, you can also self-host chunkr. we make it easy to spin up a @vllm_project or @ollama container and run an open source LLM.
@caluckenbach
cal
12 days
sell this & thank me later. pii redaction in technical documents with less than 150 LOC. the @chunkrai boys cooked
2
1
9
@chunkrai
chunkr
15 days
is it ok to say I love u to your intern chat?.
0
0
1
@chunkrai
chunkr
15 days
we love u @OpenRouterAI and @dhrxvb26
Tweet media one
1
0
3
@chunkrai
chunkr
15 days
It is also open source! We used @supabase as our vector DB, @OpenAI for embeddings and @OpenRouterAI for LLMs. Fork and have fun!.
0
0
7
@chunkrai
chunkr
15 days
Building all this with Chunkr is easy. Full step-by-step is in the blog!.
1
0
3
@chunkrai
chunkr
15 days
Make your RAG app multimodal. If you build with Chunkr, you can give figures as a cropped image as context to your LLM. This lets your chatbot give users data they can see, not just read about.
1
0
4
@chunkrai
chunkr
15 days
Chunkr’s bounding boxes make every source one scroll away. Click a citation and the viewer jumps to the exact paragraph, table cell, or figure.
1
0
5
@chunkrai
chunkr
15 days
Level up your RAG app. Most PDF-centered chat experiences fall short on clarity and trust. Our latest blog shows how you can use Chunkr outputs to build standout PDF experiences, with pinpoint inline citations and rendered visuals right inside every LLM response. 🔗 below
1
1
20
@chunkrai
chunkr
23 days
RT @piammichel: I didn’t expect to ever say this but this was a fascinating read about PDFs. and how design choices Adobe made in the 90s m….
0
3
0
@chunkrai
chunkr
24 days
Read the full post here:
1
0
5
@chunkrai
chunkr
24 days
So here we are, building incredibly complex Vision AI to reverse-engineer a 30 year old file format. At Chunkr, we're not just patching the problem. We're building the fundamental infrastructure to turn these static files into a dynamic type for an AI-native world.
1
0
4
@chunkrai
chunkr
24 days
Did Adobe try to fix it? Yes. "Tagged PDFs" were meant to add structure, like HTML. But it was a classic chicken and egg problem. No one made them because no one used them. The standard failed, and a multi-billion dollar Document AI industry was born to clean up the mess.
1
0
3
@chunkrai
chunkr
24 days
For decades, we built workarounds. But when LLMs arrived, the problem exploded. You can't build reliable AI products on a jumbled mess of text. Garbage in, garbage out. The old tech debt was suddenly a massive blocker for the industry.
1
0
3
@chunkrai
chunkr
24 days
But this perfect rendering had a hidden cost: the "flat canvas." Underneath, the PDF was just a painting of the document. It had no concept of headings, paragraphs, tables, reading order etc. For a computer, it was fundamentally unreadable.
1
0
3
@chunkrai
chunkr
24 days
It all started with a noble goal. In 1991, Adobe's "Project Camelot" set out to create "digital paper." A file that would look the exact same on any screen or printer. They succeeded, and the PDF became the standard for digital documents.
1
0
4