⇤ ← Revision 1 as of 2010-08-09 06:22:17
Size: 2116
Comment:
|
Size: 2167
Comment:
|
Deletions are marked like this. | Additions are marked like this. |
Line 16: | Line 16: |
Back to [[extern/StatNLPGroup|StatNLP Group]] |
Resources of the StatNLP group
Tools
The TreeTagger is a tool for automatic annotation of text corpora with part-of-speech and lemma information. |
The RFTagger is a POS tagger for fine-grained POS tagsets. |
||
SFST is a toolbox for the implementation of morphological analysers and other programs which are based on finite state transducers. |
SMOR |
SMOR is a German finite-state morphology implemented in the SFST programming language. An older version of SMOR with a few sample lexicon entries comes with the SFST tools (see above). |
|
LoPar is a parser for head-lexicalized probabilistic context-free grammars. |
BitPar is an efficient parser for Treebank grammars. |
||
Trace Parser |
BitPar-based English parser which generates analyses with traces |
YAP |
YAP is a fast parser for feature-based grammars. |
VPF |
VPF is a parse forest browser for feature-structure based grammars. |
|
|
Text corpora
Corpus name |
Description |
CQP |
Source |
Contact |
This is distributed on two CDs and contains about 810,000 Reuters, English Language News stories. It requires about 2.5 GB for storage of the uncompressed files. |
|
|
||
German Wikipedia articles |
|
|||
English Wikipedia articles |
|
Back to StatNLP Group