Skip to main content

Welcome to LlamaCloud 🦙

LlamaCloud is a hosted service for document processing and search, powered by LlamaIndex. It consists of three primary components:

Parse

Parse transforms complex documents into LLM-ready structured data without losing context:

Support for 50+ file formats (PDF, DOCX, PPTX, XLSX, HTML, EPUB, images)
Multimodal parsing options using LLMs and LVMs for complex documents
Advanced parsing capabilities including tables, charts, and layout extraction
Customizable parsing with predefined Cost Effective, Agentic, and Agentic Plus modes

Extract

Extract transforms complex documents into well-typed structured data with:

Customizable extraction agents and schemas
Batch processing capabilities for scale
Iterative schema development

Index

Index transforms document collections into searchable knowledge bases with:

Seamless integration with popular vector databases
Automated syncing from data sources to vector stores
Built-in query interface for retrieving relevant information
Customizable indexing pipeline for RAG applications

Get started with Web UI, Python SDK, and REST API. Sign up for an account to get started or explore the documentation for each component.