Pydelphin plugin for the LinGO Redwoods Treebank
pip install delphin.redwoodsrequirements:
pydelphin >= 1.0.0svn >= 0.3.46
Treebank class that acts as a bundle, grouping testsuites of interest
from delphin.redwoods import TreebankRedwoods data can be retrieved from 3 sources:
- User specified path to gold parse
profile = Treebank("wsj00a", "path/to/gold/")- If environment
$LOGONROOTis setup, use its remote copy of Redwoods
profile = Treebank("wsj00a")- retreave svn repository to
~/redwoods<TAG>(default option)
profile = Treebank("wsj00a")When retreiving the data you can specified which tag version to use. Currently supported versions:
- 1214 (default)
- 2018
User can edit the profiles stored in the bundle:
profile = Treebank("wsj00") # access to profile wsj00
profile.upload("wsj01") # access to profiles wsj00 and wsj01
profile.remove("wsj00") # access to profiles wsj01There exists support to create standard bundle sets of profiles:
deepbank_train = Treebank("deepbank.train") # wsj section 0 - 19
deepbank_dev = Treebank("deepbank.dev") # wsj section 20
deepbank_test = ReTreebankdwoods("deepbank.test") # wsj section 21TreebankResponse can be retrieved for a single profile or all profiles specified in the bundle:
profile.get("wsj00") # ``TreebankResponse`` for Profile wsj00
profile.get_all() # ``TreebankResponse`` for all profiles specified in ``profile``Each TreebankResponse consists of the following information:
metadataabout the the partition, including its description, split, and stats.resultsfor each sentence the following information is recorded:surface: surface formderivation: derivation informationtree: syntactic treemrs: mrs representation