Published February 19, 2021
| Version v0.8.0
Software
Open
adbar/trafilatura: trafilatura-0.8.0
Authors/Creators
- 1. Berlin-Brandenburg Academy of Sciences
- 2. @DataDog
- 3. Freelance
- 4. SACHA
- 5. @dolead
- 6. médialab - Sciences Po
Description
- improved link discovery and handling
- fixes in metadata extraction, feeds and sitemaps processing
- breaking change: the
extractfunction now reads target format fromoutput_formatargument only - new extraction option: preserve links, CLI options re-ordered
- more opportunistic backup extraction
Files
adbar/trafilatura-v0.8.0.zip
Files
(14.7 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:8bc8f8b11de78ef00dab10711e6b03d8
|
14.7 MB | Preview Download |
Additional details
Related works
- Is supplement to
- https://github.com/adbar/trafilatura/tree/v0.8.0 (URL)