LlamaIndex 🦙 @llama_index X Profile

LlamaIndex 🦙

@llama_index

Followers

105K

Following

1K

Media

2K

Statuses

4K

AI Agents for document OCR + workflows Github: https://t.co/HC19j7veGE Docs: https://t.co/QInqg2yMCJ LlamaCloud: https://t.co/yQGTiRSfFL

https://t.co/epzefqPT9Z

Joined December 2022

Don't wanna be here? Send us removal request.

LlamaIndex 🦙

@llama_index

3 days

LlamaSheets is our new way to handle complex, messy spreadsheets that come as many sheets disguised as one, multiple regions that provide different sets of information, and much more. Check out this example of a (generated, fake) company budget sheet. It actually has 4

2

7

41

LlamaIndex 🦙

@llama_index

4 days

"ask" and you shall receive! SemTools now ships with a dedicated "ask" CLI command - performs agentic search over documents - combine with `parse` to create QA workflows over unstructured data - cache your indexes with `workspaces` Learn more:

github.com

v1.5 is shipping with a new ask command in the CLI. This is essentially a RAG agent that will search your files and cite it's sources! # Perform agentic RAG search $ ask "What papers discu...

2

5

19

LlamaIndex 🦙

@llama_index

4 days

Split documents into distinct sections automatically with our new LlamaSplit API 📄✂️ We're excited to introduce LlamaSplit (now in beta), which uses AI to automatically separate bundled documents into clear, targeted sections based on categories you define - no more manual

1

9

65

Everdream Village | Everdream Valley

@EverdreamValley

2 days

That outdoor brick oven isn't connected to anything. No kitchen. No house. Just vibes and one enormous orange vegetable. Peak architectural planning. Out now on Steam 👇

2

0

20

Jerry Liu

@jerryjliu0

5 days

Scalably Parsing 1M+ PDFs with AI Agents 📈📑 Here’s a simple tutorial we wrote up showing you how to parse a directory of an arbitrary number of PDFs through our service in a reliable, efficient manner. LlamaParse is designed to handle very large workloads; with some simple

LlamaIndex 🦙

@llama_index

5 days

Need to parse multiple PDFs efficiently? Learn how to use LlamaParse with async batch processing. 📁 Process entire folders of PDFs simultaneously instead of one-by-one ⚡ Use asyncio and semaphores to control how many files parse concurrently 🎯 Prevent API rate limit errors

11

26

219

LlamaIndex 🦙

@llama_index

5 days

Need to parse multiple PDFs efficiently? Learn how to use LlamaParse with async batch processing. 📁 Process entire folders of PDFs simultaneously instead of one-by-one ⚡ Use asyncio and semaphores to control how many files parse concurrently 🎯 Prevent API rate limit errors

developers.llamaindex.ai

4

8

60

Jerry Liu

@jerryjliu0

6 days

“Intelligent Document Processing” 📑🧪 as an industry is gone . With our latest release this week, *anyone* can build and deploy a specialized document agent in seconds ⚡️🤖, and customize the steps via code. Let’s take a tour through our invoice processing and contract matching

8

47

394

Jerry Liu

@jerryjliu0

9 days

Document understanding is a huge use case for VLMs, but historically there's been no single "good" benchmark to measure progress here (unlike SWE-bench for coding). This past week I did a deep dive into OlmOCR-Bench, a recent document OCR benchmark that is a huge step in the

LlamaIndex 🦙

@llama_index

10 days

OCR benchmarks matter, so in this blog @jerryjliu0 analyzes OlmOCR-Bench, one of the most influential document OCR benchmarks. TLDR: it’s an important step in the right direction, but doesn’t quite cover real-world document parsing needs. 📊 OlmOCR-Bench covers 1400+ PDFs with

7

19

170

LlamaIndex 🦙

@llama_index

10 days

OCR benchmarks matter, so in this blog @jerryjliu0 analyzes OlmOCR-Bench, one of the most influential document OCR benchmarks. TLDR: it’s an important step in the right direction, but doesn’t quite cover real-world document parsing needs. 📊 OlmOCR-Bench covers 1400+ PDFs with

3

8

53

LlamaIndex 🦙

@llama_index

11 days

Deploy production-ready agent workflows with just one click from LlamaCloud. Here's us deploying the SEC filling extract and review agent! Our new Click-to-Deploy feature lets you build and deploy complete document processing pipelines without touching the command line: 🚀

1

5

27

LlamaIndex 🦙

@llama_index

12 days

Calling all community members: Join us this Thursday for an office hours in our Discord server, all about LlamaAgents and LlamaSheets. This is a chance to ask anything on your mind about two of our latest releases, and learn about what's coming up next. Drop in anytime from 11AM

2

5

18

Jerry Liu

@jerryjliu0

13 days

Claude Code over Excel++ 🤖📊 Claude already 'works' over Excel, but in a naive manner - it writes raw python/openpyxl to analyze an Excel sheet cell-by-cell and generally lacks a semantic understanding of the content. Basically the coding abstractions used are too low-level to

LlamaIndex 🦙

@llama_index

13 days

Build scripts that automate spreadsheet analysis using coding agents and LlamaSheets to extract clean data from messy Excel files. 🤖 Set up coding agents like @claudeai and @cursor_ai to work with LlamaSheets-extracted parquet files and rich cell metadata 📊 Use formatting cues

10

39

325

LlamaIndex 🦙

@llama_index

13 days

Build scripts that automate spreadsheet analysis using coding agents and LlamaSheets to extract clean data from messy Excel files. 🤖 Set up coding agents like @claudeai and @cursor_ai to work with LlamaSheets-extracted parquet files and rich cell metadata 📊 Use formatting cues

4

13

84

Jerry Liu

@jerryjliu0

13 days

Automate ETL over Financial Data 📊 Most real-world financials are not “database-shaped”, and requires a ton of human effort to manipulate/copy an Excel sheet into structured formats for analysis. We recently launched LlamaSheets - a specialized AI agent that automatically

8

43

231

LlamaIndex 🦙

@llama_index

16 days

POV: You're building an agent and it keeps giving weird answers because your PDF parsing is broken 🫠 This is a great walkthrough by @mesudarshan showing exactly how to use LlamaParse to fix this—from basic setup through advanced configs. The video walks through: · Why most PDF

5

8

34

LlamaIndex 🦙

@llama_index

18 days

Stop losing 80% of your data when extracting from long documents with repeating entities like catalogs, tables, and lists. Our new Table Row extraction target in LlamaExtract solves the core problem: instead of trying to extract everything at once (where LLMs get overwhelmed),

2

6

19

Jerry Liu

@jerryjliu0

18 days

We launched a new API today to let you parse any Excel sheet in a structured table. Take a look at this example on core production costs 🌽: 1️⃣ The table is located at the center of the sheet with headers, footnotes, and a hierarchical column layout 2️⃣ We get back a structured

LlamaIndex 🦙

@llama_index

19 days

Announcing LlamaSheets in beta 🔥 Transform your messy spreadsheets into AI-ready data with our newest LlamaCloud API 📊 LlamaSheets (in beta) is a specialized API that automatically structures complex spreadsheets while preserving their semantic meaning and hierarchical

5

23

234

LlamaIndex 🦙

@llama_index

19 days

Announcing LlamaSheets in beta 🔥 Transform your messy spreadsheets into AI-ready data with our newest LlamaCloud API 📊 LlamaSheets (in beta) is a specialized API that automatically structures complex spreadsheets while preserving their semantic meaning and hierarchical

4

13

73

LlamaIndex 🦙

@llama_index

20 days

Have a LlamaAgent organize your study material: meet StudyLlama, a web app that uses LlamaAgents to help you organize and gather insights from your notes and papers! How it works: 📊 Create categories to classify your notes 📓 Upload your notes 🤖 Watch as LlamaClassify assigns

0

8

52

LlamaIndex 🦙

@llama_index

23 days

Extract data from table rows with precision using LlamaExtract's Table Row mode 📊 LlamaExtract now offers granular extraction capabilities that go beyond document-level processing, giving you powerful control over how your schema is applied: 🎯 Table row extraction applies

2

8

49

LlamaIndex 🦙

@llama_index

24 days

Not another PDF parser 📄 🤯? Here's why AI-powered document parsing is all the rave. AI document parsing has evolved beyond OCR to systems that truly understand documents like humans do 🧠 In our latest blog post, we explore what's changing the game: 📊 Zero-shot semantic

2

21

154