Published January 31, 2024 | Version v1.11.0
Software Open

Hyphe: web corpus curation tool & links crawler

  • 1. TANTLab - Aalborg University
  • 2. médialab - Sciences Po
  • 3. OuestWare

Description

A research-driven web crawler which aims at providing a tool to build web corpus by crawling data from the web and generating networks between what we call "web entities", which can be single pages as well as a website, subdomains or parts of it, or even a combination of those.

Files

medialab/hyphe-v1.11.0.zip

Files (42.7 MB)

Name Size Download all
md5:7ff38cc822c87f4eaa04971bc79d96f1
42.7 MB Preview Download

Additional details

Related works

Is supplement to
Software: https://github.com/medialab/hyphe/tree/v1.11.0 (URL)