llama_index Profile Banner
LlamaIndex πŸ¦™ Profile
LlamaIndex πŸ¦™

@llama_index

Followers
105K
Following
1K
Media
2K
Statuses
4K

AI Agents for document OCR + workflows Github: https://t.co/HC19j7veGE Docs: https://t.co/QInqg2yMCJ LlamaCloud: https://t.co/yQGTiRSfFL

Joined December 2022
Don't wanna be here? Send us removal request.
@llama_index
LlamaIndex πŸ¦™
3 days
LlamaSheets is our new way to handle complex, messy spreadsheets that come as many sheets disguised as one, multiple regions that provide different sets of information, and much more. Check out this example of a (generated, fake) company budget sheet. It actually has 4
2
7
41
@llama_index
LlamaIndex πŸ¦™
4 days
"ask" and you shall receive! SemTools now ships with a dedicated "ask" CLI command - performs agentic search over documents - combine with `parse` to create QA workflows over unstructured data - cache your indexes with `workspaces` Learn more:
Tweet card summary image
github.com
v1.5 is shipping with a new ask command in the CLI. This is essentially a RAG agent that will search your files and cite it's sources! # Perform agentic RAG search $ ask "What papers discu...
2
5
19
@llama_index
LlamaIndex πŸ¦™
4 days
Split documents into distinct sections automatically with our new LlamaSplit API πŸ“„βœ‚οΈ We're excited to introduce LlamaSplit (now in beta), which uses AI to automatically separate bundled documents into clear, targeted sections based on categories you define - no more manual
1
9
65
@EverdreamValley
Everdream Village | Everdream Valley
2 days
That outdoor brick oven isn't connected to anything. No kitchen. No house. Just vibes and one enormous orange vegetable. Peak architectural planning. Out now on Steam πŸ‘‡
2
0
20
@jerryjliu0
Jerry Liu
5 days
Scalably Parsing 1M+ PDFs with AI Agents πŸ“ˆπŸ“‘ Here’s a simple tutorial we wrote up showing you how to parse a directory of an arbitrary number of PDFs through our service in a reliable, efficient manner. LlamaParse is designed to handle very large workloads; with some simple
@llama_index
LlamaIndex πŸ¦™
5 days
Need to parse multiple PDFs efficiently? Learn how to use LlamaParse with async batch processing. πŸ“ Process entire folders of PDFs simultaneously instead of one-by-one ⚑ Use asyncio and semaphores to control how many files parse concurrently 🎯 Prevent API rate limit errors
11
26
219
@llama_index
LlamaIndex πŸ¦™
5 days
Need to parse multiple PDFs efficiently? Learn how to use LlamaParse with async batch processing. πŸ“ Process entire folders of PDFs simultaneously instead of one-by-one ⚑ Use asyncio and semaphores to control how many files parse concurrently 🎯 Prevent API rate limit errors
developers.llamaindex.ai
4
8
60
@jerryjliu0
Jerry Liu
6 days
β€œIntelligent Document Processing” πŸ“‘πŸ§ͺ as an industry is gone . With our latest release this week, *anyone* can build and deploy a specialized document agent in seconds βš‘οΈπŸ€–, and customize the steps via code. Let’s take a tour through our invoice processing and contract matching
8
47
394
@jerryjliu0
Jerry Liu
9 days
Document understanding is a huge use case for VLMs, but historically there's been no single "good" benchmark to measure progress here (unlike SWE-bench for coding). This past week I did a deep dive into OlmOCR-Bench, a recent document OCR benchmark that is a huge step in the
@llama_index
LlamaIndex πŸ¦™
10 days
OCR benchmarks matter, so in this blog @jerryjliu0 analyzes OlmOCR-Bench, one of the most influential document OCR benchmarks. TLDR: it’s an important step in the right direction, but doesn’t quite cover real-world document parsing needs. πŸ“Š OlmOCR-Bench covers 1400+ PDFs with
7
19
170
@llama_index
LlamaIndex πŸ¦™
10 days
OCR benchmarks matter, so in this blog @jerryjliu0 analyzes OlmOCR-Bench, one of the most influential document OCR benchmarks. TLDR: it’s an important step in the right direction, but doesn’t quite cover real-world document parsing needs. πŸ“Š OlmOCR-Bench covers 1400+ PDFs with
3
8
53
@llama_index
LlamaIndex πŸ¦™
11 days
Deploy production-ready agent workflows with just one click from LlamaCloud. Here's us deploying the SEC filling extract and review agent! Our new Click-to-Deploy feature lets you build and deploy complete document processing pipelines without touching the command line: πŸš€
1
5
27
@llama_index
LlamaIndex πŸ¦™
12 days
Calling all community members: Join us this Thursday for an office hours in our Discord server, all about LlamaAgents and LlamaSheets. This is a chance to ask anything on your mind about two of our latest releases, and learn about what's coming up next. Drop in anytime from 11AM
2
5
18
@jerryjliu0
Jerry Liu
13 days
Claude Code over Excel++ πŸ€–πŸ“Š Claude already 'works' over Excel, but in a naive manner - it writes raw python/openpyxl to analyze an Excel sheet cell-by-cell and generally lacks a semantic understanding of the content. Basically the coding abstractions used are too low-level to
@llama_index
LlamaIndex πŸ¦™
13 days
Build scripts that automate spreadsheet analysis using coding agents and LlamaSheets to extract clean data from messy Excel files. πŸ€– Set up coding agents like @claudeai and @cursor_ai to work with LlamaSheets-extracted parquet files and rich cell metadata πŸ“Š Use formatting cues
10
39
325
@llama_index
LlamaIndex πŸ¦™
13 days
Build scripts that automate spreadsheet analysis using coding agents and LlamaSheets to extract clean data from messy Excel files. πŸ€– Set up coding agents like @claudeai and @cursor_ai to work with LlamaSheets-extracted parquet files and rich cell metadata πŸ“Š Use formatting cues
4
13
84
@jerryjliu0
Jerry Liu
13 days
Automate ETL over Financial Data πŸ“Š Most real-world financials are not β€œdatabase-shaped”, and requires a ton of human effort to manipulate/copy an Excel sheet into structured formats for analysis. We recently launched LlamaSheets - a specialized AI agent that automatically
8
43
231
@llama_index
LlamaIndex πŸ¦™
16 days
POV: You're building an agent and it keeps giving weird answers because your PDF parsing is broken 🫠 This is a great walkthrough by @mesudarshan showing exactly how to use LlamaParse to fix thisβ€”from basic setup through advanced configs. The video walks through: Β· Why most PDF
5
8
34
@llama_index
LlamaIndex πŸ¦™
18 days
Stop losing 80% of your data when extracting from long documents with repeating entities like catalogs, tables, and lists. Our new Table Row extraction target in LlamaExtract solves the core problem: instead of trying to extract everything at once (where LLMs get overwhelmed),
2
6
19
@jerryjliu0
Jerry Liu
18 days
We launched a new API today to let you parse any Excel sheet in a structured table. Take a look at this example on core production costs 🌽: 1️⃣ The table is located at the center of the sheet with headers, footnotes, and a hierarchical column layout 2️⃣ We get back a structured
@llama_index
LlamaIndex πŸ¦™
19 days
Announcing LlamaSheets in beta πŸ”₯ Transform your messy spreadsheets into AI-ready data with our newest LlamaCloud API πŸ“Š LlamaSheets (in beta) is a specialized API that automatically structures complex spreadsheets while preserving their semantic meaning and hierarchical
5
23
234
@llama_index
LlamaIndex πŸ¦™
19 days
Announcing LlamaSheets in beta πŸ”₯ Transform your messy spreadsheets into AI-ready data with our newest LlamaCloud API πŸ“Š LlamaSheets (in beta) is a specialized API that automatically structures complex spreadsheets while preserving their semantic meaning and hierarchical
4
13
73
@llama_index
LlamaIndex πŸ¦™
20 days
Have a LlamaAgent organize your study material: meet StudyLlama, a web app that uses LlamaAgents to help you organize and gather insights from your notes and papers! How it works: πŸ“Š Create categories to classify your notes πŸ““ Upload your notes πŸ€– Watch as LlamaClassify assigns
0
8
52
@llama_index
LlamaIndex πŸ¦™
23 days
Extract data from table rows with precision using LlamaExtract's Table Row mode πŸ“Š LlamaExtract now offers granular extraction capabilities that go beyond document-level processing, giving you powerful control over how your schema is applied: 🎯 Table row extraction applies
2
8
49
@llama_index
LlamaIndex πŸ¦™
24 days
Not another PDF parser πŸ“„ 🀯? Here's why AI-powered document parsing is all the rave. AI document parsing has evolved beyond OCR to systems that truly understand documents like humans do 🧠 In our latest blog post, we explore what's changing the game: πŸ“Š Zero-shot semantic
2
21
154