Assessment of NER solutions against the first and second CALBC Silver Standard Corpus.

Dietrich Rebholz-Schuhmann; Jimeno Yepes, Antonio; Chen Li; Senay Kafkas; Ian Lewin; Ning Kang; Peter Corbett; David Milward; Ekaterina Buyko; Elena Beisswanger; Kerstin Hornbostel; Alexandre Kouznetsov; René Witte; Jonas B. Laurila; Christopher J.O. Baker; Kuo, Cheng-Ju; Clematide, Simone; Fabio Rinaldi; Richárd Farkas; György Móra; Kazuo Hara; Furlong, Laura I; Michael Rautschka; Neves, Mariana Lara; Pascual-Montano, Alberto; Qi Wei; Nigel Collier; Chowdhury, Md Faisal Mahbub; Alberto Lavelli; Berlanga, Rafael; Roser Morante; Vincent Van Asch; Walter Daelemans; José Luís Marina; van Mulligen, Erik; Kors, Jan; Udo Hahn; Marie-Jean Meurs; Caitlin Murphy; Ingo Morgenstern; Nona Naderi; Greg Butler; Justin Powlowski; Adrian Tsang; René Witte; Nona Naderi; Marie-Jean Meurs; Caitlin Murphy; Nona Naderi; Ingo Morgenstern; Carolina Cantu; Shary Semarjit; Greg Butler; Justin Powlowski; Adrian Tsang; René Witte

OMM Query

OMM Query is our online search interface for an index for full-text research papers from the PMC Open Access Corpus (nearly half a million documents) that have been mined for mutation information with Open Mutation Miner (OMM) and OrganismTagger. It can be accessed using the Mímir query language, combining entity annotations with their features with plain text (see below for some examples).

Note that you can index your own set of documents through OMM and install a local query server, if you want to mine a different set of documents for mutation impact information: all software used in this process is freely available under open source licenses. Besides the web interface, it is also possible to query the Mímir server through a RESTful API.

» READ MORE

Assessment of NER solutions against the first and second CALBC Silver Standard Corpus.

Submitted by rene on Mon, 2012-01-23 09:41

Rebholz-Schuhmann, D., A. Jimeno Yepes, C. Li, S. Kafkas, I. Lewin, N. Kang, P. Corbett, D. Milward, E. Buyko, E. Beisswanger, et al., "Assessment of NER solutions against the first and second CALBC Silver Standard Corpus.", Journal of biomedical semantics, vol. 2 Suppl 5, pp. S11, 2011.

»

Semantic Computing Course

The Semantic Computing course (SOEN 6211) is offered at Concordia University, providing graduate students with a unique opportunity to study research and development of novel semantic software systems. The course is taught by Prof. René Witte and supported by team members from the Semantic Software Lab. Students from other universities in Québec can register for this course through CREPUQ.

This course provide an introduction to selected topics from Semantic Computing, including text mining, tagging and tag analysis, recommender systems, RDF and linked data, semantic desktops and semantic wikis.

» READ MORE

Open Mutation Miner (OMM)

Mutations as sources of evolution have long been the focus of attention in the biomedical literature. Accessing the mutational information and their impacts on protein properties facilitates research in various domains, such as enzymology and pharmacology. However, manually reading through the rich and fast growing repository of biomedical literature is expensive and time-consuming. Text mining methods can help by automatically analysing the literature and extracting mutation-related knowledge into a structured represenation.

Our Open Mutation Miner (OMM) system provides a number of advanced text mining components for mutation mining from full-text research papers, including the detection of various forms of mutation mentions, protein properties, organisms, impact mentions, and the relations between them. OMM provides output options in various formats, including populating an OWL ontology, Web service access, structured queries, and interactive use embedded in desktop clients. It is described and evaluated in detail in our paper, Naderi, N., and R. Witte, "Automated extraction and semantic analysis of mutation impacts from the biomedical literature", BMC Genomics, vol. 13, no. Suppl 4, pp. S10, 06/2012.

» READ MORE

Semantic Text Mining for Lignocellulose Research

Submitted by mj on Wed, 2011-11-02 15:38

Ananiadou, S., D. Lee, S. Navathe, and M. Song (Eds.), Meurs, M. - J., C. Murphy, I. Morgenstern, N. Naderi, G. Butler, J. Powlowski, A. Tsang, and R. Witte, "Semantic Text Mining for Lignocellulose Research", The ACM Fifth International Workshop on Data and Text Mining in Biomedical Informatics in conjunction with CIKM, Glasgow, UK : ACM New York, NY, USA ©2011, 10/2011.

»

OwlExporter v3.0 Released

We just released a new version of the OwlExporter ontology population plugin for GATE. The OwlExporter PR can be added to any NLP pipeline to facilitate the population of an existing OWL ontology with entities detected in the corpus. It supports the population of separate NLP- and domain-ontologies and has support for some advanced features, like the export of coreference chains.

In this release, we included a pre-compiled binary and a complete example pipeline that transforms GATE's ANNIE information extraction example into an ontology population system. We also completely revamped the documentation and website to make it more accessible to ontology population novices.

» READ MORE

»

Login to post comments

Automated Extraction of Protein Mutation Impacts from the Biomedical Literature

Submitted by nona on Sun, 2011-09-18 07:45

Naderi, N., "Automated Extraction of Protein Mutation Impacts from the Biomedical Literature", Department of Computer Science and Software Engineering, M. Comp. Sc., Montreal : Concordia University, 09/2011.

»

The OrganismTagger System

Our open source OrganismTagger is a hybrid rule-based/machine-learning system that extracts organism mentions from the biomedical literature, normalizes them to their scientific name, and provides grounding to the NCBI Taxonomy database. Our pipeline provides the flexibility of annotating the species of particular interest to bio-engineers on different corpora, by optionally including detection of common names, acronyms, and strains. The OrganismTagger performance has been evaluated on two manually annotated corpora, OT and Linneaus. On the OT corpus, the OrganismTagger achieves a precision and recall of 95% and 94% and a grounding accuracy of 97.5%. On the manually annotated corpus of Linneaus-100, the results show a precision and recall of 99% and 97% and grounding with an accuracy of 97.4%. It is described in detail in our publication, Naderi, N., T. Kappler, C. J. O. Baker, and R. Witte, "OrganismTagger: Detection, normalization, and grounding of organism entities in biomedical documents", Bioinformatics, vol. 27, no. 19 Oxford University Press, pp. 2721--2729, August 9, 2011.

» READ MORE

Towards Evaluating the Impact of Semantic Support for Curating the Fungus Scientific Literature

Submitted by rene on Thu, 2011-08-04 09:04

Meurs, M. - J., C. Murphy, N. Naderi, I. Morgenstern, C. Cantu, S. Semarjit, G. Butler, J. Powlowski, A. Tsang, and R. Witte, "Towards Evaluating the Impact of Semantic Support for Curating the Fungus Scientific Literature", The 3rd Canadian Semantic Web Symposium (CSWS2011) , vol. 774 , Vancouver, British Columbia, Canada , 08/2011.

»

Text Mining: Wissensgewinnung aus natürlichsprachigen Dokumenten

Submitted by witte on Tue, 2011-01-04 09:41

Witte, R., and J. Mülle (Eds.), Text Mining: Wissensgewinnung aus natürlichsprachigen Dokumenten, Universität Karlsruhe, Fakultät für Informatik, Institut für Programmstrukturen und Datenorganisation (IPD), 2006.

»

Site Menu

User login

Upcoming events

Popular content

Today's:

All time:

Last viewed:

Current weather

Text Mining

OMM Query

Assessment of NER solutions against the first and second CALBC Silver Standard Corpus.

Semantic Computing Course

Open Mutation Miner (OMM)

Semantic Text Mining for Lignocellulose Research

OwlExporter v3.0 Released

Automated Extraction of Protein Mutation Impacts from the Biomedical Literature

The OrganismTagger System

Towards Evaluating the Impact of Semantic Support for Curating the Fungus Scientific Literature

Text Mining: Wissensgewinnung aus natürlichsprachigen Dokumenten

Tag Cloud

New Publications

Recent blog posts

New forum topics

Syndicate

Search

Semantic Assistants Durm Wiki Open Positions	Search this site: