Coinciding with the presentation of our paper on Minding the Source: Automatic Tagging of Reported Speech in Newspaper Articles at LREC 2008, we are happy to announce the first public release of our free/open source Reported Speech Tagging Components.
Reported speech in the form of direct and indirect reported speech is an important indicator of evidentiality in traditional newspaper texts, but also increasingly in the new media that rely heavily on citation and quotation of previous postings, as for instance in blogs or newsgroups. We developed an NLP component in form of a GATE resource that can automatically detect and tag reported speech constructs, in particular the source, reporting verb and content. This is intended as a first module for more sophisticated representation and reasoning with attributed information, such as belief reasoning based on nested belief structures.
The growing number of publicly available information sources makes it impossible for individuals to keep track of all the various opinions on one topic. The goal of our fuzzy believer project is to extract and analyze statements of opinion from newspaper articles.
Beliefs are modeled using a fuzzy-theoretic approach applied after NLP-based information extraction. A fuzzy believer models a human agent, deciding what statements to believe or reject based on different, configurable strategies.