GATE Components
OwlExporter v3.0 Released

We just released a new version of the OwlExporter ontology population plugin for GATE. The OwlExporter PR can be added to any NLP pipeline to facilitate the population of an existing OWL ontology with entities detected in the corpus. It supports the population of separate NLP- and domain-ontologies and has support for some advanced features, like the export of coreference chains.
In this release, we included a pre-compiled binary and a complete example pipeline that transforms GATE's ANNIE information extraction example into an ontology population system. We also completely revamped the documentation and website to make it more accessible to ontology population novices.
Assessing The Quality Factors Found in In-Line Documentation Written in Natural Language: The JavadocMiner
Submitted by ninus on Fri, 2011-04-15 19:14{Predicate-Argument EXtractor (PAX)}
Submitted by ralf on Fri, 2011-04-01 08:12Flexible Ontology Population from Text: The OwlExporter
Submitted by ninus on Wed, 2011-03-16 14:12Multi-Lingual Noun Phrase Extractor (MuNPEx) v1.0 for GATE released
MuNPEx 1.0The noun phrase chunker MuNPEx (Multi-Lingual Noun Phrase Extractor) is now available in the new and improved release v1.0. MuNPEx is a base NP chunker for the GATE framework and implemented in JAPE. It is fast, robust, customizable, well-tested and currently supports English, German, and French (with Spanish in beta).
Major changes in this release:
- Limited number of pre- and post-head modifiers to make MuNPEx more robust on certain kinds of input (like a long list of tags or menu entries when processing web pages)
- New optional grammars to add a HEAD_LEMMA slot to an NP annotation, with the lemma extracted from the GATE morphological analyser (for English), the Durm Lemmatizer (for German), or the TreeTagger (for German, Spanish, French)
- DET/MOD/HEAD/MOD2 slots are now stored as strings (rather than Content objects) to make them easier to export and compatible with the new Predicate-Argument Extractor (PAX) component
- other code cleanup and improvements
- no longer labeled as "beta" -- five years of testing ought to be enough, we're not Google ;-)
For more details and the download, please visit the MuNPEx page.
{Minding the Source: Automatic Tagging of Reported Speech in Newspaper Articles}
Submitted by rene on Sat, 2010-07-31 13:01New GATE PR: The Predicate-Argument Extractor (PAX)
At the LREC workshop New Challenges for NLP Frameworks we released a new component for GATE: The Predicate-Argument Extractor (PAX).
The OwlExporter: Flexible Ontology Population from Text

This page describes the OwlExporter, an open source (AGPL3) component that facilitates populating an OWL Ontology from annotations created by an existing GATE application.
The GATE Predicate-Argument EXtractor Component (PAX)
PAX is a GATE component for extracting predicate-argument structures (PAS) from the output of different parsers.
First Release of the Reported Speech Tagger
Coinciding with the presentation of our paper on Minding the Source: Automatic Tagging of Reported Speech in Newspaper Articles at LREC 2008, we are happy to announce the first public release of our free/open source Reported Speech Tagging Components.


