--- title: hmd_download keywords: fastai sidebar: home_sidebar summary: "Bulk download Heritage Made Digital digitised newspapers from the British Library Research Repository " description: "Bulk download Heritage Made Digital digitised newspapers from the British Library Research Repository " nb_path: "index.ipynb" ---
This command line tool is intended to make it easy to bulk download Heritage Made Digital Newspapers from the British Library Research Repository.
The tool has been used by Living with Machines but may be of use to other people. Since the tool is intended to download the collection in 'bulk' it is likely to be useful if you either want:
This tool was developed for internal use so it might not be suitable for your needs. If you have problems or suggestions with the tool please open an issue.
python -m pip install git+https://github.com/Living-with-machines/hmd_newspaper_dl
It is recommened to install the package insdide a virtual environment. Since this is a command line tool one simple option for installing is pipx which will install the tool inside a new virtual environment for you:
pipx install git+https://github.com/Living-with-machines/hmd_newspaper_dl
This will by default download all available newspaper titles. If you just want a subset you can pass in a subset parameter to specify how many titles you want. At the moment this is just a random selection.
This tool was put together for internal Living with Machines but is shared in case it is helpful for other people. If you have feedback, problems or want to suggest changes please open a new issue.