A list of ~92,000 podcast feed URLs scraped from the Apple podcast database in late 2021, including a script for downloading those podcasts.
I've included a script that will download all these podcasts, which is basically impossible because they would consume somewhere around 1PB of data. Instead, you can download the podcasts in chunks, and process those chunks individually (as I have done).
Usage:
- Edit download.py, change num_workers=6to however many concurrent downloads you want to use. There is pretty much no upper bound other than your connection speed.
- Change output_dir=to wherever you want to download the podcasts to.
- python download.pyand watch your disk fill up.
Enjoy!