How It Works
This tool uses Kreuzberg, a Rust-based document intelligence engine compiled to WebAssembly. It runs entirely in your browser — no upload, no server.
Kreuzberg uses format-specific parsers (PDFium for PDFs, native parsers for Office formats, Tesseract WASM for OCR) and reconstructs:
- Headings (H1-H6) from font size and style analysis
- Tables as Markdown pipe tables with headers
- Lists, bold, italic, and hyperlinks
- Metadata: title, authors, dates, page count
- OCR text from images and scanned PDFs