Differences between revisions 1 and 4 (spanning 3 versions)
Revision 1 as of 2010-08-09 06:22:17
Size: 2116
Comment:
Revision 4 as of 2011-11-18 16:06:05
Size: 1684
Comment:
Deletions are marked like this. Additions are marked like this.
Line 4: Line 4:
|| TreeTagger ||The TreeTagger is a tool for automatic annotation of text corpora with part-of-speech and lemma information. || [[http://www.ims.uni-stuttgart.de/projekte/corplex/RFTagger/|RFTagger|class=white]] || The RFTagger is a POS tagger for fine-grained POS tagsets. ||
|| [[http://www.ims.uni-stuttgart.de/projekte/gramotron/SOFTWARE/SFST.html|SFST|class=white]] || SFST is a toolbox for the implementation of morphological analysers and other programs which are based on finite state transducers. || SMOR || SMOR is a German finite-state morphology implemented in the SFST programming language. An older version of SMOR with a few sample lexicon entries comes with the SFST tools (see above). ||
|| [[http://www.ims.uni-stuttgart.de/projekte/gramotron/SOFTWARE/LoPar.html|LoPar|class=white]] || [[http://www.ims.uni-stuttgart.de/projekte/gramotron/SOFTWARE/LoPar.html|LoPar|class=white]] is a parser for head-lexicalized probabilistic context-free grammars. || BitPar || BitPar is an efficient parser for Treebank grammars. ||
|| Trace Parser || BitPar-based
English parser which generates analyses with traces || YAP || YAP is a fast parser for feature-based grammars. ||
|| VPF || VPF is a parse forest browser for feature-structure based grammars. || || ||
|| [[http://www.ims.uni-stuttgart.de/projekte/corplex/TreeTagger/|TreeTagger|class=white]] ||The !TreeTagger is a tool for automatic annotation of text corpora with part-of-speech and lemma information. || [[http://www.ims.uni-stuttgart.de/projekte/corplex/RFTagger/|RFTagger|class=white]] || The RFTagger is a POS tagger for fine-grained POS tagsets. ||
|| [[http://www.ims.uni-stuttgart.de/projekte/gramotron/SOFTWARE/SFST.html|SFST|class=white]] || SFST is a toolbox for the implementation of morphological analysers and other programs which are based on finite state transducers. || SMOR || SMOR is a German finite-state morphology implemented in the SFST programming language. An older version of SMOR with a few sample lexicon entries comes with the SFST tools. ||
|| [[http://www.ims.uni-stuttgart.de/projekte/gramotron/SOFTWARE/LoPar.html|LoPar|class=white]] || !LoPar is a parser for head-lexicalized probabilistic context-free grammars. || [[http://www.ims.uni-stuttgart.de/tcl/SOFTWARE/BitPar.html|BitPar|class=white]] || !BitPar is an efficient parser for Treebank grammars. ||
|| [[http://www.ims.uni-stuttgart.de/projekte/gramotron/SOFTWARE/BitPar.html|Trace parser|class=white]] || !BitPar-
based English parser which generates analyses with traces || [[http://www.ims.uni-stuttgart.de/projekte/gramotron/SOFTWARE/YAP.html|YAP|class=white]] || YAP is a fast parser for feature-based grammars. ||
|| [[http://www.ims.uni-stuttgart.de/tcl/SOFTWARE/VPF.html|VPF|class=white]] || VPF is a parse forest browser for feature-structure based grammars. || || ||
Line 10: Line 10:

== Text corpora ==
||'''Corpus name''' ||'''Description''' ||'''CQP''' ||'''Source''' ||'''Contact''' ||
||[[TextCorpora/Reuters|Reuters]] ||This is distributed on two CDs and contains about 810,000 Reuters, English Language News stories. It requires about 2.5 GB for storage of the uncompressed files. || ||[[http://trec.nist.gov/data/reuters/reuters.html|Reuters Corpora @ NIST|class=white]] || ||
||[[TextCorpora/GermanWikipedia|German Wikipedia]] ||German Wikipedia articles || (./) ||[[http://www.de.wikipedia.org|http://www.de.wikipedia.org|class=white]] ||LukasMichelbacher ||
||[[TextCorpora/EnglishWikipedia|English Wikipedia]] ||English Wikipedia articles || (./) ||[[http://www.en.wikipedia.org|http://www.en.wikipedia.org|class=white]]||LukasMichelbacher ||
Back to [[extern/StatNLPGroup|StatNLP Group]]

Resources of the StatNLP group


Tools

TreeTagger

The TreeTagger is a tool for automatic annotation of text corpora with part-of-speech and lemma information.

RFTagger

The RFTagger is a POS tagger for fine-grained POS tagsets.

SFST

SFST is a toolbox for the implementation of morphological analysers and other programs which are based on finite state transducers.

SMOR

SMOR is a German finite-state morphology implemented in the SFST programming language. An older version of SMOR with a few sample lexicon entries comes with the SFST tools.

LoPar

LoPar is a parser for head-lexicalized probabilistic context-free grammars.

BitPar

BitPar is an efficient parser for Treebank grammars.

Trace parser

BitPar-based English parser which generates analyses with traces

YAP

YAP is a fast parser for feature-based grammars.

VPF

VPF is a parse forest browser for feature-structure based grammars.

Back to StatNLP Group

extern/StatNLPResources (last edited 2013-03-01 08:45:27 by HinrichSchuetze)