We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. Weโll occasionally send you account related emails.
Already on GitHub? Sign in to your account
wget https://gcore.jsdelivr.net/gh/opendatalab/MinerU@master/scripts/download_models.py -O download_models.py --2025-05-27 10:02:15-- https://gcore.jsdelivr.net/gh/opendatalab/MinerU@master/scripts/download_models.py Resolving gcore.jsdelivr.net (gcore.jsdelivr.net)... 104.16.175.226, 104.16.174.226, 2606:4700::6810:afe2, ... Connecting to gcore.jsdelivr.net (gcore.jsdelivr.net)|104.16.175.226|:443... connected. HTTP request sent, awaiting response... 200 OK Length: unspecified [application/octet-stream] Saving to: 'download_models.py'
download_models.py [ <=> ] 2.22K --.-KB/s in 0s
2025-05-27 10:02:16 (74.8 MB/s) - 'download_models.py' saved [2273]
python download_models.py 2025-05-27 10:02:17,931 - modelscope - INFO - Intra-cloud acceleration enabled for downloading from opendatalab/PDF-Extract-Kit-1.0 Downloading Model from https://www.modelscope.cn to directory: /root/.cache/modelscope/hub/models/opendatalab/PDF-Extract-Kit-1.0 2025-05-27 10:02:18,389 - modelscope - INFO - Got 33 files, start to download ... Downloading [models/OCR/paddleocr_torch/ch_PP-OCRv5_rec_infer.pth]: 0%| | 0.00/31.1M [00:00<?, ?B/s] Mismatched real-time digest found, falling back to lump-sum hash computation| | 0.00/8.57M [00:00<?, ?B/s] 2025-05-27 10:02:18,628 - modelscope - ERROR - File /root/.cache/modelscope/hub/models/._____temp/opendatalab/PDF-Extract-Kit-1.0/models/OCR/paddleocr_torch/ch_PP-OCRv5_rec_infer.pth integrity check failed, expected sha256 signature is 10fec8892e980ef2f06e8c6062e482060870b8e89c6441af4e30e8fca7a21387, actual is e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855, the download may be incomplete, please try again. | 0.00/13.8M [00:00<?, ?B/s] Downloading [models/OCR/paddleocr_torch/ch_PP-OCRv3_det_infer.pth]: 0%| | 0.00/2.42M [00:00<?, ?B/s] Mismatched real-time digest found, falling back to lump-sum hash computation 0%| | 0.00/92.3M [00:00<?, ?B/s] 2025-05-27 10:02:18,633 - modelscope - ERROR - File /root/.cache/modelscope/hub/models/._____temp/opendatalab/PDF-Extract-Kit-1.0/models/OCR/paddleocr_torch/ch_PP-OCRv3_det_infer.pth integrity check failed, expected sha256 signature is 5e364ffd412f39417db2b4430098cfec7d0f8ed36c859224e3fc036186b91359, actual is e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855, the download may be incomplete, please try again. Downloading [models/OCR/paddleocr_torch/ch_PP-OCRv4_rec_server_doc_infer.pth]: 0%| | 0.00/96.5M [00:00<?, ?B/s] Mismatched real-time digest found, falling back to lump-sum hash computation 2025-05-27 10:02:18,647 - modelscope - ERROR - File /root/.cache/modelscope/hub/models/._____temp/opendatalab/PDF-Extract-Kit-1.0/models/OCR/paddleocr_torch/ch_PP-OCRv4_rec_server_doc_infer.pth integrity check failed, expected sha256 signature is f65e699f4ca792fbce0e92d1df4c9bbdefe3e21bbdb01c3075cc49470b9bc1cc, actual is e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855, the download may be incomplete, please try again. Downloading [models/OCR/paddleocr_torch/ch_PP-OCRv5_det_infer.pth]: 0%| | 0.00/13.8M [00:00<?, ?B/s] Mismatched real-time digest found, falling back to lump-sum hash computation 2025-05-27 10:02:18,679 - modelscope - ERROR - File /root/.cache/modelscope/hub/models/._____temp/opendatalab/PDF-Extract-Kit-1.0/models/OCR/paddleocr_torch/ch_PP-OCRv5_det_infer.pth integrity check failed, expected sha256 signature is 8ee4332ce7d75620dbd86c92045d48c22975c260a8bed1440c454dcb234ac76a, actual is e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855, the download may be incomplete, please try again. Downloading [models/OCR/paddleocr_torch/arabic_PP-OCRv3_rec_infer.pth]: 100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 8.57M/8.57M [00:00<00:00, 20.5MB/s] Downloading [models/OCR/paddleocr_torch/ch_PP-OCRv5_rec_server_infer.pth]: 0%| | 0.00/128M [00:00<?, ?B/s] Mismatched real-time digest found, falling back to lump-sum hash computation Downloading [models/OCR/paddleocr_torch/ch_PP-OCRv4_rec_infer.pth]: 4%|โโ | 1.00M/25.7M [00:00<00:09, 2.74MB/s2025-05-27 10:02:18,832 - modelscope - ERROR - File /root/.cache/modelscope/hub/models/._____temp/opendatalab/PDF-Extract-Kit-1.0/models/OCR/paddleocr_torch/ch_PP-OCRv5_rec_server_infer.pth integrity check failed, expected sha256 signature is 7fbfe0b853f8c6ba6e000cacb8ed759e0e9763ef9f9809b46c5fc7d2f5bb9d7b, actual is e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855, the download may be incomplete, please try again. | 1.00M/92.3M [00:00<00:35, 2.69MB/s] Downloading [models/MFR/unimernet_hf_small_2503/config.json]: 100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 4.97k/4.97k [00:00<00:00, 29.7kB/s] Downloading [models/OCR/paddleocr_torch/ch_ptocr_mobile_v2.0_cls_infer.pth]: 100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 575k/575k [00:00<00:00, 2.44MB/s] Downloading [models/OCR/paddleocr_torch/ch_PP-OCRv4_det_infer.pth]: 100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 13.8M/13.8M [00:00<00:00, 24.7MB/s] Downloading [models/Layout/YOLO/doclayout_yolo_docstructbench_imgsz1280_2501.pt]: 0%| | 0.00/37.9M [00:00<?, ?B/s] Mismatched real-time digest found, falling back to lump-sum hash computation | 0.00/38.8M [00:00<?, ?B/s] 2025-05-27 10:02:19,052 - modelscope - ERROR - File /root/.cache/modelscope/hub/models/._____temp/opendatalab/PDF-Extract-Kit-1.0/models/Layout/YOLO/doclayout_yolo_docstructbench_imgsz1280_2501.pt integrity check failed, expected sha256 signature is 1b152460888dc30be6db7f5dfab28bde3dcc999e5202f46187a764a1699c80be, actual is e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855, the download may be incomplete, please try again. | 0.00/2.42M [00:00<?, ?B/s] Downloading [models/OCR/paddleocr_torch/chinese_cht_PP-OCRv3_rec_infer.pth]: 100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 10.6M/10.6M [00:00<00:00, 21.0MB/s] Downloading [models/OCR/paddleocr_torch/devanagari_PP-OCRv3_rec_infer.pth]: 100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 8.57M/8.57M [00:00<00:00, 24.2MB/s] Downloading [models/OCR/paddleocr_torch/ch_PP-OCRv4_rec_infer.pth]: 100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 25.7M/25.7M [00:00<00:00, 33.1MB/s] Downloading [models/OCR/paddleocr_torch/en_PP-OCRv3_det_infer.pth]: 100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 2.42M/2.42M [00:00<00:00, 8.74MB/s] Downloading [models/OCR/paddleocr_torch/cyrillic_PP-OCRv3_rec_infer.pth]: 100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 8.57M/8.57M [00:00<00:00, 19.9MB/s] Downloading [models/MFR/unimernet_hf_small_2503/generation_config.json]: 100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 191/191 [00:00<00:00, 1.25kB/s] Downloading [models/MFR/unimernet_hf_small_2503/model.safetensors]: 0%| | 0.00/773M [00:00<?, ?B/s] Mismatched real-time digest found, falling back to lump-sum hash computationโโโโโโ | 1.00M/8.57M [00:00<00:02, 3.89MB/s] 2025-05-27 10:02:19,616 - modelscope - ERROR - File /root/.cache/modelscope/hub/models/._____temp/opendatalab/PDF-Extract-Kit-1.0/models/MFR/unimernet_hf_small_2503/model.safetensors integrity check failed, expected sha256 signature is 9244e2565585c0f89bc3a6eeeea080ef3c588375fc0d536074fe88e80b917cda, actual is e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855, the download may be incomplete, please try again. | 1.00M/9.44M [00:00<00:01, 4.49MB/s] Downloading [models/OCR/paddleocr_torch/ch_PP-OCRv4_rec_server_infer.pth]: 100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 92.3M/92.3M [00:01<00:00, 76.8MB/s] Downloading [models/Layout/YOLO/doclayout_yolo_ft.pt]: 100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 38.8M/38.8M [00:00<00:00, 51.5MB/s] Downloading [models/OCR/paddleocr_torch/ka_PP-OCRv3_rec_infer.pth]: 100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 8.56M/8.56M [00:00<00:00, 19.5MB/s] Downloading [models/OCR/paddleocr_torch/en_PP-OCRv4_rec_infer.pth]: 100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 22.7M/22.7M [00:00<00:00, 38.2MB/s] Downloading [models/OCR/paddleocr_torch/japan_PP-OCRv3_rec_infer.pth]: 100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 9.62M/9.62M [00:00<00:00, 20.5MB/s] Downloading [models/OCR/paddleocr_torch/latin_PP-OCRv3_rec_infer.pth]: 100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 8.57M/8.57M [00:00<00:00, 20.2MB/s] Downloading [models/OCR/paddleocr_torch/korean_PP-OCRv3_rec_infer.pth]: 100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 9.44M/9.44M [00:00<00:00, 21.5MB/s] Downloading [models/MFR/unimernet_hf_small_2503/special_tokens_map.json]: 100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 552/552 [00:00<00:00, 2.63kB/s] Downloading [models/MFR/unimernet_hf_small_2503/tokenizer_config.json]: 100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 4.42k/4.42k [00:00<00:00, 29.0kB/s] Downloading [models/OCR/paddleocr_torch/Multilingual_PP-OCRv3_det_infer.pth]: 100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 2.42M/2.42M [00:00<00:00, 8.60MB/s] Downloading [models/MFR/unimernet_hf_small_2503/tokenizer.json]: 100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 3.42M/3.42M [00:00<00:00, 12.8MB/s] Downloading [models/MFR/unimernet_hf_small_2503/README.md]: 100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 1.62k/1.62k [00:00<00:00, 4.61kB/s] Downloading [models/OCR/paddleocr_torch/te_PP-OCRv3_rec_infer.pth]: 100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 8.56M/8.56M [00:00<00:00, 25.1MB/s] Downloading [models/OCR/paddleocr_torch/ta_PP-OCRv3_rec_infer.pth]: 100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 8.56M/8.56M [00:00<00:00, 19.1MB/s] Downloading [models/Layout/YOLO/yolov10l_ft.pt]: 100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 49.9M/49.9M [00:00<00:00, 68.3MB/s] Downloading [models/MFD/YOLO/yolo_v8_ft.pt]: 100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 334M/334M [00:02<00:00, 129MB/s] Processing 33 items: 3%|โโโ | 1.00/33.0 [00:04<02:09, 4.06s/it] Traceback (most recent call last):hf_small_2503/README.md]: 100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 1.62k/1.62k [00:00<00:00, 4.61kB/s] File "/root/download_models.py", line 44, in <module> | 21.0M/334M [00:00<00:08, 40.7MB/s] model_dir = snapshot_download('opendatalab/PDF-Extract-Kit-1.0', allow_patterns=mineru_patterns)โโโโโโโโโโโโโโโโโโโโโโโโโโโโโ | 330M/334M [00:02<00:00, 187MB/s] ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/root/.conda/envs/mineru/lib/python3.12/site-packages/modelscope/hub/snapshot_download.py", line 108, in snapshot_download return _snapshot_download( ^^^^^^^^^^^^^^^^^^^ File "/root/.conda/envs/mineru/lib/python3.12/site-packages/modelscope/hub/snapshot_download.py", line 289, in _snapshot_download _download_file_lists( File "/root/.conda/envs/mineru/lib/python3.12/site-packages/modelscope/hub/snapshot_download.py", line 540, in _download_file_lists _download_single_file(filtered_repo_files) File "/root/.conda/envs/mineru/lib/python3.12/site-packages/modelscope/utils/thread_utils.py", line 66, in wrapper results.append(future.result()) ^^^^^^^^^^^^^^^ File "/root/.conda/envs/mineru/lib/python3.12/concurrent/futures/_base.py", line 449, in result return self.__get_result() ^^^^^^^^^^^^^^^^^^^ File "/root/.conda/envs/mineru/lib/python3.12/concurrent/futures/_base.py", line 401, in __get_result raise self._exception File "/root/.conda/envs/mineru/lib/python3.12/concurrent/futures/thread.py", line 59, in run result = self.fn(*self.args, **self.kwargs) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/root/.conda/envs/mineru/lib/python3.12/site-packages/modelscope/hub/snapshot_download.py", line 527, in _download_single_file download_file( File "/root/.conda/envs/mineru/lib/python3.12/site-packages/modelscope/hub/file_download.py", line 712, in download_file file_integrity_validation(temp_file, expected_hash) File "/root/.conda/envs/mineru/lib/python3.12/site-packages/modelscope/hub/utils/utils.py", line 161, in file_integrity_validation raise FileIntegrityError(msg) modelscope.hub.errors.FileIntegrityError: File /root/.cache/modelscope/hub/models/._____temp/opendatalab/PDF-Extract-Kit-1.0/models/OCR/paddleocr_torch/ch_PP-OCRv5_rec_infer.pth integrity check failed, expected sha256 signature is 10fec8892e980ef2f06e8c6062e482060870b8e89c6441af4e30e8fca7a21387, actual is e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855, the download may be incomplete, please try again.
ๆง่ก pip install modelscope wget https://gcore.jsdelivr.net/gh/opendatalab/MinerU@master/scripts/download_models.py -O download_models.py python download_models.py ็ดๆฅๅคฑ่ดฅ,ๅฏไปฅๅค็ฐ,็ณป็ปไธบ้ฟ้ไบ็็ณป็ป
Linux
Alibaba Cloud Linux 3.2104 LTS 64ไฝ
No response
The text was updated successfully, but these errors were encountered:
No branches or pull requests
๐ Search before asking | ๆไบคไนๅ่ฏทๅ ๆ็ดข
Description of the bug | ้่ฏฏๆ่ฟฐ
wget https://gcore.jsdelivr.net/gh/opendatalab/MinerU@master/scripts/download_models.py -O download_models.py
--2025-05-27 10:02:15-- https://gcore.jsdelivr.net/gh/opendatalab/MinerU@master/scripts/download_models.py
Resolving gcore.jsdelivr.net (gcore.jsdelivr.net)... 104.16.175.226, 104.16.174.226, 2606:4700::6810:afe2, ...
Connecting to gcore.jsdelivr.net (gcore.jsdelivr.net)|104.16.175.226|:443... connected.
HTTP request sent, awaiting response... 200 OK
Length: unspecified [application/octet-stream]
Saving to: 'download_models.py'
download_models.py [ <=> ] 2.22K --.-KB/s in 0s
2025-05-27 10:02:16 (74.8 MB/s) - 'download_models.py' saved [2273]
How to reproduce the bug | ๅฆไฝๅค็ฐ
ๆง่ก
pip install modelscope
wget https://gcore.jsdelivr.net/gh/opendatalab/MinerU@master/scripts/download_models.py -O download_models.py
python download_models.py
็ดๆฅๅคฑ่ดฅ,ๅฏไปฅๅค็ฐ,็ณป็ปไธบ้ฟ้ไบ็็ณป็ป
Operating System Mode | ๆไฝ็ณป็ป็ฑปๅ
Linux
Operating System Version| ๆไฝ็ณป็ป็ๆฌ
Alibaba Cloud Linux 3.2104 LTS 64ไฝ
Python version | Python ็ๆฌ
No response
Software version | ่ฝฏไปถ็ๆฌ (magic-pdf --version)
No response
Device mode | ่ฎพๅคๆจกๅผ
No response
The text was updated successfully, but these errors were encountered: