|

liteparse_samples: Interactive Parser Demos from LlamaIndex

liteparse_samples is a set of interactive demos for LiteParse — a fast, local, model-free document parser from LlamaIndex. The repo includes three standalone demos built around real government and financial PDFs.

What’s included

  • Parser comparison — Side-by-side output from LiteParse, PyPDF, and PyMuPDF across 8 document sections from 5 real-world PDFs (FDIC, Federal Reserve, CMS, IRS, WHO). Includes per-document timing for each parser.
  • Visual citations — Exact keyword search over parsed documents with bounding boxes highlighted directly on the source PDF page image.
  • Research docs — A Claude Code skill (/research-docs) that parses documents, answers questions with AI, and generates an HTML report with cited source pages highlighted.

Each demo is configured via a docs.json file — add your own PDFs and pages to customize. Requires Python 3.9+.

Repo: github.com/jerryjliu/liteparse_samples · Docs: developers.llamaindex.ai/liteparse

Similar Posts

Leave a Reply

Your email address will not be published. Required fields are marked *