Skip navigation.
Home
Semantic Software Lab
Concordia University
Montréal, Canada

Text Mining

An Automatic Workflow for Formalization of Scholarly Articles' Structural and Semantic Elements

Sack, H., S. Dietze, A. Tordai, and C. Lange (Eds.), Sateli, B., and R. Witte, "An Automatic Workflow for Formalization of Scholarly Articles' Structural and Semantic Elements", The 13th Extended Semantic Web Conference (The Semantic Publishing Challenge 2016), vol. 641, Heraklion, Crete, Greece : Springer International Publishing, pp. 309–320, 06/2016.

LODeXporter: Transforming GATE Annotations to LOD Triples

The LODeXporter is a GATE component that allows to export NLP annotations directly to a triplestore, with configurable vocabularies, for use in LOD applications.

Rhetector: Automatic Dection of Rhetorical Entities in Scientific Literature

Rhetector is a GATE plugin for the automatic detection of Rhetorical Entities (REs) in scientific literature. Rhetorical Entities are spans of text (sentences, passages, sections, etc.) in a document, where authors convey their findings, like Claims or Arguments, to the readers. We designed a lightweight pipeline to automatically detect rhetorical entities in scientific literature, currently limited to Claims and Contributions. The motivation and application behind Rhetector is described in our publication, Sumner, T. (Eds.), Sateli, B., and R. Witte, "Semantic representation of scientific literature: bringing claims, contributions and named entities onto the Linked Open Data cloud", PeerJ Computer Science, vol. 1, no. e37 PeerJ, 12/2015.

The GATE LODtagger component

The LODtagger is a GATE component that provides linking entities from a document to their corresponding resource on the Linked Open Data (LOD) cloud. LODtagger relies on external tools to perform the actual content tagging and hides the complexity of communicating with LOD taggers, such as DBpedia Spotlight, from the perspective of pipeline developers.

Syndicate content