Tools & Resources
Our lab published a number of free/open source tools, components, frameworks, and resources for NLP and Semantic Computing.
Frameworks and Architectures
Semantic Assistants
The Semantic Assistants architecture provides for easy integration of NLP into (desktop) clients using W3C Web Services and ontologies. The server-side part integrates the GATE framework and allows to publish any existing pipeline as an NLP Web service and a plugin for the OpenOffice.org word processor provides for executing semantic services.
Corpora
Durm Corpus
A single book from a historic encyclopedia of architecture, written in German (in various formats).
NLP Components
OwlExporter
A GATE component for easy ontology population from text.
Predicate-Argument Extractor
A GATE component that extract predicate-argument structures (subject, predicate, object triples) in a common format from the output of different parsers (RASP, Minipar, Stanford, SUPPLE).
Durm German Lemmatizer
The Durm self-learning, context-aware lemmatizer for German nouns.
Reported Speech Tagger
Our reported speech tagger for English newspaper articles.
MuNPEx NP Chunker
The multi-lingual noun phrase (NP) chunker MuNPEx for GATE.
Other Tools and Resources
The Javadoc NLP Corpus Generation Doclet
A doclet for Javadoc that allow to generate a corpus from Java source code optimized for NLP processing of source code comments.
Support
For questions, comments, etc., please visit the Tools & Resources Forum.









