Category Posts Navigation

GROBID

Posted by Marcus Zillman

GROBID
https://github.com/kermitt2/grobid

GROBID (or Grobid) means GeneRation Of BIbliographic Data. GROBID is a machine learning library for extracting, parsing and re-structuring raw documents such as PDF into structured TEI-encoded documents with a particular focus on technical and scientific publications. First developments started in 2008 as a hobby. In 2011 the tool has been made available in open source. Work on GROBID has been steady as side project since the beginning and is expected to continue until at least 2020.

Leave a Reply

Facebook Comments

Sign up for Awareness Watch

* = required field

Browse Categories