71 lines
11 KiB
Markdown
71 lines
11 KiB
Markdown
<div align="center" xmlns="http://www.w3.org/1999/html">
|
|
<!-- logo -->
|
|
<p align="center">
|
|
<img src="https://opendatalab.github.io/MinerU/images/MinerU-logo.png" width="300px" style="vertical-align:middle;">
|
|
</p>
|
|
</div>
|
|
|
|
<!-- icon -->
|
|
|
|
[](https://github.com/opendatalab/MinerU)
|
|
[](https://github.com/opendatalab/MinerU)
|
|
[](https://github.com/opendatalab/MinerU/issues)
|
|
[](https://github.com/opendatalab/MinerU/issues)
|
|
[](https://pypi.org/project/mineru/)
|
|
[](https://pypi.org/project/mineru/)
|
|
[](https://pepy.tech/project/mineru)
|
|
[](https://pepy.tech/project/mineru)
|
|
[](https://mineru.net/OpenSourceTools/Extractor?source=github)
|
|
[](https://huggingface.co/spaces/opendatalab/MinerU)
|
|
[](https://www.modelscope.cn/studios/OpenDataLab/MinerU)
|
|
[](https://colab.research.google.com/gist/myhloli/a3cb16570ab3cfeadf9d8f0ac91b4fca/mineru_demo.ipynb)
|
|
[](https://arxiv.org/abs/2409.18839)
|
|
[](https://arxiv.org/abs/2509.22186)
|
|
[](https://deepwiki.com/opendatalab/MinerU)
|
|
|
|
<div align="center">
|
|
<a href="https://trendshift.io/repositories/11174" target="_blank"><img src="https://trendshift.io/api/badge/repositories/11174" alt="opendatalab%2FMinerU | Trendshift" style="width: 250px; height: 55px;" width="250" height="55"/></a>
|
|
|
|
<!-- hot link -->
|
|
|
|
<p align="center">
|
|
🚀<a href="https://mineru.net/?source=github">MinerU Official Website→✅ Zero-Install Online Version ✅ Full-Featured Client ✅ Developer API Online Access, skip deployment hassles, get all product formats with one click, go fast!</a>
|
|
</p>
|
|
|
|
<!-- join us -->
|
|
|
|
<p align="center">
|
|
👋 join us on <a href="https://discord.gg/Tdedn9GTXq" target="_blank">Discord</a> and <a href="https://mineru.net/community-portal/?aliasId=3c430f94" target="_blank">WeChat</a>
|
|
</p>
|
|
</div>
|
|
|
|
## Project Introduction
|
|
|
|
MinerU is a tool that converts PDFs into machine-readable formats (e.g., markdown, JSON), allowing for easy extraction into any format.
|
|
MinerU was born during the pre-training process of [InternLM](https://github.com/InternLM/InternLM). We focus on solving symbol conversion issues in scientific literature and hope to contribute to technological development in the era of large models.
|
|
Compared to well-known commercial products domestically and internationally, MinerU is still young. If you encounter any issues or if the results are not as expected, please submit an issue on [GitHub Issues](https://github.com/opendatalab/MinerU/issues) and **attach the relevant PDF**.
|
|
|
|

|
|
|
|
|
|
## Key Features
|
|
|
|
- Remove headers, footers, footnotes, page numbers and other elements to ensure semantic coherence
|
|
- Output text in human reading order, suitable for single-column, multi-column and complex layouts
|
|
- Retain the original document structure, including titles, paragraphs, lists, etc.
|
|
- Extract images, image descriptions, tables, table titles and footnotes
|
|
- Automatically identify and convert formulas in documents to LaTeX format
|
|
- Automatically identify and convert tables in documents to HTML format
|
|
- Automatically detect scanned PDFs and garbled PDFs, and enable OCR functionality
|
|
- OCR supports detection and recognition of 109 languages
|
|
- Support multiple output formats, such as multimodal and NLP Markdown, reading-order-sorted JSON, and information-rich intermediate formats
|
|
- Support multiple visualization results, including layout visualization, span visualization, etc., for efficient confirmation of output effects and quality inspection
|
|
- Support pure CPU environment operation, and support GPU(CUDA)/NPU(CANN)/MPS acceleration
|
|
- Compatible with Windows, Linux and Mac platforms
|
|
|
|
|
|
## User Guide
|
|
|
|
- [Quick Start Guide](./quick_start/index.md)
|
|
- [Detailed Usage Instructions](./usage/index.md)
|