Published November 29, 2022
                      
                       | Version v1.10.5
                    
                    
                      
                        
                          Software
                        
                      
                      
                        
                          
                        
                        
                          Open
                        
                      
                    
                  Hyphe: web corpus curation tool & links crawler
- 1. TANTLab - Aalborg University
- 2. médialab - Sciences Po
- 3. OuestWare
Description
A research-driven web crawler which aims at providing a tool to build web corpus by crawling data from the web and generating networks between what we call "web entities", which can be single pages as well as a website, subdomains or parts of it, or even a combination of those.
Files
      
        medialab/hyphe-v1.10.5.zip
        
      
    
    
      
        Files
         (41.7 MB)
        
      
    
    | Name | Size | Download all | 
|---|---|---|
| md5:fbc8bf3d1a9a13b854026377456a6188 | 41.7 MB | Preview Download | 
Additional details
Related works
- Is supplement to
- https://github.com/medialab/hyphe/tree/v1.10.5 (URL)