Published June 11, 2024
| Version v0.4.0
Software
Open
liserman/archiveRetriever: archiveRetriever 0.4.0
Description
archiveRetriever 0.4.0
- Replace deprecated functions of dependencies
- Fix bugs in archive_overview() and retrieve_urls()
- New option nonArchive added to retrieve_links() and scrape_urls(). This option allows users to scrape internet pages not stemming from the Internet Archive.
- New feature added to the collapse option of scrape_urls(). collapse can now also take a Xpath as input, to collapse results based on a structuring Xpath. Unfortunately, this works only with Xpaths and not with CSS selectors. If used, Paths refers only to children of the structuring Xpath given in collapse.
Files
liserman/archiveRetriever-v0.4.0.zip
Files
(2.5 MB)
Name | Size | Download all |
---|---|---|
md5:a449ebb7a4bbd96af6dae88b17637d92
|
2.5 MB | Preview Download |
Additional details
Related works
- Is supplement to
- Software: https://github.com/liserman/archiveRetriever/tree/v0.4.0 (URL)