Skip to content

Execute download_models.py is failed! FileIntegrityError #2527

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. Weโ€™ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
3 tasks done
ningblue opened this issue May 27, 2025 · 0 comments
Open
3 tasks done

Execute download_models.py is failed! FileIntegrityError #2527

ningblue opened this issue May 27, 2025 · 0 comments
Labels
bug Something isn't working

Comments

@ningblue
Copy link

๐Ÿ”Ž Search before asking | ๆไบคไน‹ๅ‰่ฏทๅ…ˆๆœ็ดข

  • I have searched the MinerU Readme and found no similar bug report.
  • I have searched the MinerU Issues and found no similar bug report.
  • I have searched the MinerU Discussions and found no similar bug report.

Description of the bug | ้”™่ฏฏๆ่ฟฐ

wget https://gcore.jsdelivr.net/gh/opendatalab/MinerU@master/scripts/download_models.py -O download_models.py
--2025-05-27 10:02:15-- https://gcore.jsdelivr.net/gh/opendatalab/MinerU@master/scripts/download_models.py
Resolving gcore.jsdelivr.net (gcore.jsdelivr.net)... 104.16.175.226, 104.16.174.226, 2606:4700::6810:afe2, ...
Connecting to gcore.jsdelivr.net (gcore.jsdelivr.net)|104.16.175.226|:443... connected.
HTTP request sent, awaiting response... 200 OK
Length: unspecified [application/octet-stream]
Saving to: 'download_models.py'

download_models.py [ <=> ] 2.22K --.-KB/s in 0s

2025-05-27 10:02:16 (74.8 MB/s) - 'download_models.py' saved [2273]

python download_models.py
2025-05-27 10:02:17,931 - modelscope - INFO - Intra-cloud acceleration enabled for downloading from opendatalab/PDF-Extract-Kit-1.0
Downloading Model from https://www.modelscope.cn to directory: /root/.cache/modelscope/hub/models/opendatalab/PDF-Extract-Kit-1.0
2025-05-27 10:02:18,389 - modelscope - INFO - Got 33 files, start to download ...
Downloading [models/OCR/paddleocr_torch/ch_PP-OCRv5_rec_infer.pth]:   0%|                                                               | 0.00/31.1M [00:00<?, ?B/s]
Mismatched real-time digest found, falling back to lump-sum hash computation|                                                           | 0.00/8.57M [00:00<?, ?B/s]
2025-05-27 10:02:18,628 - modelscope - ERROR - File /root/.cache/modelscope/hub/models/._____temp/opendatalab/PDF-Extract-Kit-1.0/models/OCR/paddleocr_torch/ch_PP-OCRv5_rec_infer.pth integrity check failed, expected sha256 signature is 10fec8892e980ef2f06e8c6062e482060870b8e89c6441af4e30e8fca7a21387, actual is e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855, the download may be incomplete, please try again.                                     | 0.00/13.8M [00:00<?, ?B/s]
Downloading [models/OCR/paddleocr_torch/ch_PP-OCRv3_det_infer.pth]:   0%|                                                               | 0.00/2.42M [00:00<?, ?B/s]
Mismatched real-time digest found, falling back to lump-sum hash computation 0%|                                                        | 0.00/92.3M [00:00<?, ?B/s]
2025-05-27 10:02:18,633 - modelscope - ERROR - File /root/.cache/modelscope/hub/models/._____temp/opendatalab/PDF-Extract-Kit-1.0/models/OCR/paddleocr_torch/ch_PP-OCRv3_det_infer.pth integrity check failed, expected sha256 signature is 5e364ffd412f39417db2b4430098cfec7d0f8ed36c859224e3fc036186b91359, actual is e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855, the download may be incomplete, please try again.
Downloading [models/OCR/paddleocr_torch/ch_PP-OCRv4_rec_server_doc_infer.pth]:   0%|                                                    | 0.00/96.5M [00:00<?, ?B/s]
Mismatched real-time digest found, falling back to lump-sum hash computation
2025-05-27 10:02:18,647 - modelscope - ERROR - File /root/.cache/modelscope/hub/models/._____temp/opendatalab/PDF-Extract-Kit-1.0/models/OCR/paddleocr_torch/ch_PP-OCRv4_rec_server_doc_infer.pth integrity check failed, expected sha256 signature is f65e699f4ca792fbce0e92d1df4c9bbdefe3e21bbdb01c3075cc49470b9bc1cc, actual is e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855, the download may be incomplete, please try again.
Downloading [models/OCR/paddleocr_torch/ch_PP-OCRv5_det_infer.pth]:   0%|                                                               | 0.00/13.8M [00:00<?, ?B/s]
Mismatched real-time digest found, falling back to lump-sum hash computation
2025-05-27 10:02:18,679 - modelscope - ERROR - File /root/.cache/modelscope/hub/models/._____temp/opendatalab/PDF-Extract-Kit-1.0/models/OCR/paddleocr_torch/ch_PP-OCRv5_det_infer.pth integrity check failed, expected sha256 signature is 8ee4332ce7d75620dbd86c92045d48c22975c260a8bed1440c454dcb234ac76a, actual is e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855, the download may be incomplete, please try again.
Downloading [models/OCR/paddleocr_torch/arabic_PP-OCRv3_rec_infer.pth]: 100%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ| 8.57M/8.57M [00:00<00:00, 20.5MB/s]
Downloading [models/OCR/paddleocr_torch/ch_PP-OCRv5_rec_server_infer.pth]:   0%|                                                         | 0.00/128M [00:00<?, ?B/s]
Mismatched real-time digest found, falling back to lump-sum hash computation
Downloading [models/OCR/paddleocr_torch/ch_PP-OCRv4_rec_infer.pth]:   4%|โ–ˆโ–ˆ                                                    | 1.00M/25.7M [00:00<00:09, 2.74MB/s2025-05-27 10:02:18,832 - modelscope - ERROR - File /root/.cache/modelscope/hub/models/._____temp/opendatalab/PDF-Extract-Kit-1.0/models/OCR/paddleocr_torch/ch_PP-OCRv5_rec_server_infer.pth integrity check failed, expected sha256 signature is 7fbfe0b853f8c6ba6e000cacb8ed759e0e9763ef9f9809b46c5fc7d2f5bb9d7b, actual is e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855, the download may be incomplete, please try again.                      | 1.00M/92.3M [00:00<00:35, 2.69MB/s]
Downloading [models/MFR/unimernet_hf_small_2503/config.json]: 100%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ| 4.97k/4.97k [00:00<00:00, 29.7kB/s]
Downloading [models/OCR/paddleocr_torch/ch_ptocr_mobile_v2.0_cls_infer.pth]: 100%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ| 575k/575k [00:00<00:00, 2.44MB/s]
Downloading [models/OCR/paddleocr_torch/ch_PP-OCRv4_det_infer.pth]: 100%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ| 13.8M/13.8M [00:00<00:00, 24.7MB/s]
Downloading [models/Layout/YOLO/doclayout_yolo_docstructbench_imgsz1280_2501.pt]:   0%|                                                 | 0.00/37.9M [00:00<?, ?B/s]
Mismatched real-time digest found, falling back to lump-sum hash computation                                                            | 0.00/38.8M [00:00<?, ?B/s]
2025-05-27 10:02:19,052 - modelscope - ERROR - File /root/.cache/modelscope/hub/models/._____temp/opendatalab/PDF-Extract-Kit-1.0/models/Layout/YOLO/doclayout_yolo_docstructbench_imgsz1280_2501.pt integrity check failed, expected sha256 signature is 1b152460888dc30be6db7f5dfab28bde3dcc999e5202f46187a764a1699c80be, actual is e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855, the download may be incomplete, please try again.                       | 0.00/2.42M [00:00<?, ?B/s]
Downloading [models/OCR/paddleocr_torch/chinese_cht_PP-OCRv3_rec_infer.pth]: 100%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ| 10.6M/10.6M [00:00<00:00, 21.0MB/s]
Downloading [models/OCR/paddleocr_torch/devanagari_PP-OCRv3_rec_infer.pth]: 100%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ| 8.57M/8.57M [00:00<00:00, 24.2MB/s]
Downloading [models/OCR/paddleocr_torch/ch_PP-OCRv4_rec_infer.pth]: 100%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ| 25.7M/25.7M [00:00<00:00, 33.1MB/s]
Downloading [models/OCR/paddleocr_torch/en_PP-OCRv3_det_infer.pth]: 100%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ| 2.42M/2.42M [00:00<00:00, 8.74MB/s]
Downloading [models/OCR/paddleocr_torch/cyrillic_PP-OCRv3_rec_infer.pth]: 100%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ| 8.57M/8.57M [00:00<00:00, 19.9MB/s]
Downloading [models/MFR/unimernet_hf_small_2503/generation_config.json]: 100%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ| 191/191 [00:00<00:00, 1.25kB/s]
Downloading [models/MFR/unimernet_hf_small_2503/model.safetensors]:   0%|                                                                | 0.00/773M [00:00<?, ?B/s]
Mismatched real-time digest found, falling back to lump-sum hash computationโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–‰                                             | 1.00M/8.57M [00:00<00:02, 3.89MB/s]
2025-05-27 10:02:19,616 - modelscope - ERROR - File /root/.cache/modelscope/hub/models/._____temp/opendatalab/PDF-Extract-Kit-1.0/models/MFR/unimernet_hf_small_2503/model.safetensors integrity check failed, expected sha256 signature is 9244e2565585c0f89bc3a6eeeea080ef3c588375fc0d536074fe88e80b917cda, actual is e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855, the download may be incomplete, please try again.                            | 1.00M/9.44M [00:00<00:01, 4.49MB/s]
Downloading [models/OCR/paddleocr_torch/ch_PP-OCRv4_rec_server_infer.pth]: 100%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ| 92.3M/92.3M [00:01<00:00, 76.8MB/s]
Downloading [models/Layout/YOLO/doclayout_yolo_ft.pt]: 100%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ| 38.8M/38.8M [00:00<00:00, 51.5MB/s]
Downloading [models/OCR/paddleocr_torch/ka_PP-OCRv3_rec_infer.pth]: 100%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ| 8.56M/8.56M [00:00<00:00, 19.5MB/s]
Downloading [models/OCR/paddleocr_torch/en_PP-OCRv4_rec_infer.pth]: 100%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ| 22.7M/22.7M [00:00<00:00, 38.2MB/s]
Downloading [models/OCR/paddleocr_torch/japan_PP-OCRv3_rec_infer.pth]: 100%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ| 9.62M/9.62M [00:00<00:00, 20.5MB/s]
Downloading [models/OCR/paddleocr_torch/latin_PP-OCRv3_rec_infer.pth]: 100%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ| 8.57M/8.57M [00:00<00:00, 20.2MB/s]
Downloading [models/OCR/paddleocr_torch/korean_PP-OCRv3_rec_infer.pth]: 100%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ| 9.44M/9.44M [00:00<00:00, 21.5MB/s]
Downloading [models/MFR/unimernet_hf_small_2503/special_tokens_map.json]: 100%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ| 552/552 [00:00<00:00, 2.63kB/s]
Downloading [models/MFR/unimernet_hf_small_2503/tokenizer_config.json]: 100%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ| 4.42k/4.42k [00:00<00:00, 29.0kB/s]
Downloading [models/OCR/paddleocr_torch/Multilingual_PP-OCRv3_det_infer.pth]: 100%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ| 2.42M/2.42M [00:00<00:00, 8.60MB/s]
Downloading [models/MFR/unimernet_hf_small_2503/tokenizer.json]: 100%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ| 3.42M/3.42M [00:00<00:00, 12.8MB/s]
Downloading [models/MFR/unimernet_hf_small_2503/README.md]: 100%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ| 1.62k/1.62k [00:00<00:00, 4.61kB/s]
Downloading [models/OCR/paddleocr_torch/te_PP-OCRv3_rec_infer.pth]: 100%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ| 8.56M/8.56M [00:00<00:00, 25.1MB/s]
Downloading [models/OCR/paddleocr_torch/ta_PP-OCRv3_rec_infer.pth]: 100%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ| 8.56M/8.56M [00:00<00:00, 19.1MB/s]
Downloading [models/Layout/YOLO/yolov10l_ft.pt]: 100%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ| 49.9M/49.9M [00:00<00:00, 68.3MB/s]
Downloading [models/MFD/YOLO/yolo_v8_ft.pt]: 100%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ| 334M/334M [00:02<00:00, 129MB/s]
Processing 33 items:   3%|โ–ˆโ–ˆโ–ˆ                                                                                                    | 1.00/33.0 [00:04<02:09, 4.06s/it]
Traceback (most recent call last):hf_small_2503/README.md]: 100%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ| 1.62k/1.62k [00:00<00:00, 4.61kB/s]
  File "/root/download_models.py", line 44, in <module>                                                                         | 21.0M/334M [00:00<00:08, 40.7MB/s]
    model_dir = snapshot_download('opendatalab/PDF-Extract-Kit-1.0', allow_patterns=mineru_patterns)โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ | 330M/334M [00:02<00:00, 187MB/s]
                ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/root/.conda/envs/mineru/lib/python3.12/site-packages/modelscope/hub/snapshot_download.py", line 108, in snapshot_download
    return _snapshot_download(
           ^^^^^^^^^^^^^^^^^^^
  File "/root/.conda/envs/mineru/lib/python3.12/site-packages/modelscope/hub/snapshot_download.py", line 289, in _snapshot_download
    _download_file_lists(
  File "/root/.conda/envs/mineru/lib/python3.12/site-packages/modelscope/hub/snapshot_download.py", line 540, in _download_file_lists
    _download_single_file(filtered_repo_files)
  File "/root/.conda/envs/mineru/lib/python3.12/site-packages/modelscope/utils/thread_utils.py", line 66, in wrapper
    results.append(future.result())
                   ^^^^^^^^^^^^^^^
  File "/root/.conda/envs/mineru/lib/python3.12/concurrent/futures/_base.py", line 449, in result
    return self.__get_result()
           ^^^^^^^^^^^^^^^^^^^
  File "/root/.conda/envs/mineru/lib/python3.12/concurrent/futures/_base.py", line 401, in __get_result
    raise self._exception
  File "/root/.conda/envs/mineru/lib/python3.12/concurrent/futures/thread.py", line 59, in run
    result = self.fn(*self.args, **self.kwargs)
             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/root/.conda/envs/mineru/lib/python3.12/site-packages/modelscope/hub/snapshot_download.py", line 527, in _download_single_file
    download_file(
  File "/root/.conda/envs/mineru/lib/python3.12/site-packages/modelscope/hub/file_download.py", line 712, in download_file
    file_integrity_validation(temp_file, expected_hash)
  File "/root/.conda/envs/mineru/lib/python3.12/site-packages/modelscope/hub/utils/utils.py", line 161, in file_integrity_validation
    raise FileIntegrityError(msg)
modelscope.hub.errors.FileIntegrityError: File /root/.cache/modelscope/hub/models/._____temp/opendatalab/PDF-Extract-Kit-1.0/models/OCR/paddleocr_torch/ch_PP-OCRv5_rec_infer.pth integrity check failed, expected sha256 signature is 10fec8892e980ef2f06e8c6062e482060870b8e89c6441af4e30e8fca7a21387, actual is e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855, the download may be incomplete, please try again.

How to reproduce the bug | ๅฆ‚ไฝ•ๅค็Žฐ

ๆ‰ง่กŒ
pip install modelscope
wget https://gcore.jsdelivr.net/gh/opendatalab/MinerU@master/scripts/download_models.py -O download_models.py
python download_models.py
็›ดๆŽฅๅคฑ่ดฅ,ๅฏไปฅๅค็Žฐ,็ณป็ปŸไธบ้˜ฟ้‡Œไบ‘็š„็ณป็ปŸ

Operating System Mode | ๆ“ไฝœ็ณป็ปŸ็ฑปๅž‹

Linux

Operating System Version| ๆ“ไฝœ็ณป็ปŸ็‰ˆๆœฌ

Alibaba Cloud Linux 3.2104 LTS 64ไฝ

Python version | Python ็‰ˆๆœฌ

No response

Software version | ่ฝฏไปถ็‰ˆๆœฌ (magic-pdf --version)

No response

Device mode | ่ฎพๅค‡ๆจกๅผ

No response

@ningblue ningblue added the bug Something isn't working label May 27, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant