pcfg-generator

A module to convert JSON-formated treebank data into a Stochastic Context-Free Grammar

conllu-stream

Using this module you can parse CoNLL-U files as a stream of sentence objects. You can also access the low-level line parser, if you want deeper control.

penn-treebank-sample

a non-commercial, fair-use subset of the penn-treebank, in JSON