Megaparse [LW24]
Open-source Document Parser to Markdown with OCR/LLMs
Developer Tools
GitHub

Featured onDecember 3rd, 2024

Page AI

The most advanced AI website builder • Sponsored

Try now ⧉

Product upvotes vs the next 3

Waiting for data. Loading

Product comments vs the next 3

Waiting for data. Loading

Product upvote speed vs the next 3

Waiting for data. Loading

Product upvotes and comments

Waiting for data. Loading

Product vs the next 3

Megaparse [LW24]

Open-source Document Parser to Markdown with OCR/LLMs

Megaparse is a file parser optimized for LLM Ingestion. It can parse PDFs, DOCX, PPTX in a format that is ideal for LLMs. All of that accessible from a python package, an API, or a queue.

Top comment

Upvotes219

▲ 219View on ProductHunt ⧉

Comments19

19 commentsSee comments on PH ⧉

Product of the Day10th

Hi everyone, Today I’d like to introduce you to the new Quivr project. It a simple python package, API that helps you take in documents such as PDFs, Docx, PPTx, ... and turn them into Markown It has several new abilities: * OCR * Vision Models * Table Optimization in the extraction * Open-source You can use it in any of your products where you need to parse file to then send them to an LLM or simply store it Here is how to get started: * Go to https://github.com/QuivrHQ/MegaP... * pip install megaparse * Have fun Give it a try! We’d love to hear your feedback and ideas in the comments. This is part of Supabase mega Launch Week -> https://launchweek.dev/HOME

Megaparse [LW24]Open-source Document Parser to Markdown with OCR/LLMsDeveloper ToolsGitHub

Product upvotes and comments

Product vs the next 3

Top comment

Megaparse [LW24]
Open-source Document Parser to Markdown with OCR/LLMs
Developer Tools
GitHub