LlamaIndex π¦
@llama_index
Followers
105K
Following
1K
Media
2K
Statuses
4K
AI Agents for document OCR + workflows Github: https://t.co/HC19j7veGE Docs: https://t.co/QInqg2yMCJ LlamaCloud: https://t.co/yQGTiRSfFL
Joined December 2022
LlamaSheets is our new way to handle complex, messy spreadsheets that come as many sheets disguised as one, multiple regions that provide different sets of information, and much more. Check out this example of a (generated, fake) company budget sheet. It actually has 4
2
7
41
"ask" and you shall receive! SemTools now ships with a dedicated "ask" CLI command - performs agentic search over documents - combine with `parse` to create QA workflows over unstructured data - cache your indexes with `workspaces` Learn more:
github.com
v1.5 is shipping with a new ask command in the CLI. This is essentially a RAG agent that will search your files and cite it's sources! # Perform agentic RAG search $ ask "What papers discu...
2
5
19
Split documents into distinct sections automatically with our new LlamaSplit API πβοΈ We're excited to introduce LlamaSplit (now in beta), which uses AI to automatically separate bundled documents into clear, targeted sections based on categories you define - no more manual
1
9
65
That outdoor brick oven isn't connected to anything. No kitchen. No house. Just vibes and one enormous orange vegetable. Peak architectural planning. Out now on Steam π
2
0
20
Scalably Parsing 1M+ PDFs with AI Agents ππ Hereβs a simple tutorial we wrote up showing you how to parse a directory of an arbitrary number of PDFs through our service in a reliable, efficient manner. LlamaParse is designed to handle very large workloads; with some simple
Need to parse multiple PDFs efficiently? Learn how to use LlamaParse with async batch processing. π Process entire folders of PDFs simultaneously instead of one-by-one β‘ Use asyncio and semaphores to control how many files parse concurrently π― Prevent API rate limit errors
11
26
219
Need to parse multiple PDFs efficiently? Learn how to use LlamaParse with async batch processing. π Process entire folders of PDFs simultaneously instead of one-by-one β‘ Use asyncio and semaphores to control how many files parse concurrently π― Prevent API rate limit errors
developers.llamaindex.ai
4
8
60
βIntelligent Document Processingβ ππ§ͺ as an industry is gone . With our latest release this week, *anyone* can build and deploy a specialized document agent in seconds β‘οΈπ€, and customize the steps via code. Letβs take a tour through our invoice processing and contract matching
8
47
394
Document understanding is a huge use case for VLMs, but historically there's been no single "good" benchmark to measure progress here (unlike SWE-bench for coding). This past week I did a deep dive into OlmOCR-Bench, a recent document OCR benchmark that is a huge step in the
OCR benchmarks matter, so in this blog @jerryjliu0 analyzes OlmOCR-Bench, one of the most influential document OCR benchmarks. TLDR: itβs an important step in the right direction, but doesnβt quite cover real-world document parsing needs. π OlmOCR-Bench covers 1400+ PDFs with
7
19
170
OCR benchmarks matter, so in this blog @jerryjliu0 analyzes OlmOCR-Bench, one of the most influential document OCR benchmarks. TLDR: itβs an important step in the right direction, but doesnβt quite cover real-world document parsing needs. π OlmOCR-Bench covers 1400+ PDFs with
3
8
53
Deploy production-ready agent workflows with just one click from LlamaCloud. Here's us deploying the SEC filling extract and review agent! Our new Click-to-Deploy feature lets you build and deploy complete document processing pipelines without touching the command line: π
1
5
27
Calling all community members: Join us this Thursday for an office hours in our Discord server, all about LlamaAgents and LlamaSheets. This is a chance to ask anything on your mind about two of our latest releases, and learn about what's coming up next. Drop in anytime from 11AM
2
5
18
Claude Code over Excel++ π€π Claude already 'works' over Excel, but in a naive manner - it writes raw python/openpyxl to analyze an Excel sheet cell-by-cell and generally lacks a semantic understanding of the content. Basically the coding abstractions used are too low-level to
Build scripts that automate spreadsheet analysis using coding agents and LlamaSheets to extract clean data from messy Excel files. π€ Set up coding agents like @claudeai and @cursor_ai to work with LlamaSheets-extracted parquet files and rich cell metadata π Use formatting cues
10
39
325
Build scripts that automate spreadsheet analysis using coding agents and LlamaSheets to extract clean data from messy Excel files. π€ Set up coding agents like @claudeai and @cursor_ai to work with LlamaSheets-extracted parquet files and rich cell metadata π Use formatting cues
4
13
84
Automate ETL over Financial Data π Most real-world financials are not βdatabase-shapedβ, and requires a ton of human effort to manipulate/copy an Excel sheet into structured formats for analysis. We recently launched LlamaSheets - a specialized AI agent that automatically
8
43
231
POV: You're building an agent and it keeps giving weird answers because your PDF parsing is broken π« This is a great walkthrough by @mesudarshan showing exactly how to use LlamaParse to fix thisβfrom basic setup through advanced configs. The video walks through: Β· Why most PDF
5
8
34
Stop losing 80% of your data when extracting from long documents with repeating entities like catalogs, tables, and lists. Our new Table Row extraction target in LlamaExtract solves the core problem: instead of trying to extract everything at once (where LLMs get overwhelmed),
2
6
19
We launched a new API today to let you parse any Excel sheet in a structured table. Take a look at this example on core production costs π½: 1οΈβ£ The table is located at the center of the sheet with headers, footnotes, and a hierarchical column layout 2οΈβ£ We get back a structured
Announcing LlamaSheets in beta π₯ Transform your messy spreadsheets into AI-ready data with our newest LlamaCloud API π LlamaSheets (in beta) is a specialized API that automatically structures complex spreadsheets while preserving their semantic meaning and hierarchical
5
23
234
Announcing LlamaSheets in beta π₯ Transform your messy spreadsheets into AI-ready data with our newest LlamaCloud API π LlamaSheets (in beta) is a specialized API that automatically structures complex spreadsheets while preserving their semantic meaning and hierarchical
4
13
73
Have a LlamaAgent organize your study material: meet StudyLlama, a web app that uses LlamaAgents to help you organize and gather insights from your notes and papers! How it works: π Create categories to classify your notes π Upload your notes π€ Watch as LlamaClassify assigns
0
8
52
Extract data from table rows with precision using LlamaExtract's Table Row mode π LlamaExtract now offers granular extraction capabilities that go beyond document-level processing, giving you powerful control over how your schema is applied: π― Table row extraction applies
2
8
49
Not another PDF parser π π€―? Here's why AI-powered document parsing is all the rave. AI document parsing has evolved beyond OCR to systems that truly understand documents like humans do π§ In our latest blog post, we explore what's changing the game: π Zero-shot semantic
2
21
154