Category Posts Navigation

HYPHE – Web Corpus Curation Tool Featuring A Research-Driven Web Crawler

Posted by Marcus Zillman

HYPHE – Web Corpus Curation Tool Featuring A Research-Driven Web Crawler
http://hyphe.medialab.sciences-po.fr/

Hyphe provides users with a method to build web corpora implemented as an HTML5 User Interface. In Hyphe, a web corpus is built as a set of web pages curated and organized by the user. Hyphe’s UI guides you through each step of the web corpus curation method and helps you monitor your collect. Hyphe behaves nicely with hundreds of thousands crawled web pages (and a billion more detected) grouped in tens of thousands of web entities for a 2 GB RAM and 5 GB filesystem footprint. This will be added to Bot Research Subject Tracer™. This will be added to Web Data Extractors Subject Tracer™. This will be added to Business Intelligence Resources Subject Tracer™. This will be added to Entrepreneurial Resources Subject Tracer™. This will be added to the tools section of Research Resources Subject Tracer™.

Leave a Reply

Facebook Comments

Sign up for Awareness Watch

* = required field

Browse Categories