Published July 7, 2025 | Version v1.12.2
Software Open

Hyphe: web corpus curation tool & links crawler

  • 1. TANTLab - Aalborg University
  • 2. médialab - Sciences Po
  • 3. OuestWare

Description

A research-driven web crawler which aims at providing a tool to build web corpus by crawling data from the web and generating networks between what we call "web entities", which can be single pages as well as a website, subdomains or parts of it, or even a combination of those.

Files

medialab/hyphe-v1.12.2.zip

Files (42.7 MB)

Name Size Download all
md5:1b05ccfc0d09950737016bbb50e94b2f
42.7 MB Preview Download

Additional details

Related works

Is supplement to
Software: https://github.com/medialab/hyphe/tree/v1.12.2 (URL)

Software