Bioinformatics
Assessment of NER solutions against the first and second CALBC Silver Standard Corpus.
Submitted by rene on Mon, 2012-01-23 10:41Open Mutation Miner (OMM)
Mutations as sources of evolution have long been the focus of attention in the biomedical literature. Accessing the mutational information and their impacts on protein properties facilitates research in various domains, such as enzymology and pharmacology. However, manually reading through the rich and fast growing repository of biomedical literature is expensive and time-consuming. Text mining methods can help by automatically analysing the literature and extracting mutation-related knowledge into a structured represenation.
Our Open Mutation Miner (OMM) system provides a number of advanced text mining components for mutation mining from full-text research papers, including the detection of various forms of mutation mentions, protein properties, organisms, impact mentions, and the relations between them. OMM provides output options in various formats, including populating an OWL ontology, Web service access, structured queries, and interactive use embedded in desktop clients. It is described and evaluated in detail in our paper, "Automated extraction and semantic analysis of mutation impacts from the biomedical literature", BMC Genomics, vol. 13, no. Suppl 4, pp. S10, 06/2012.
Semantic Text Mining for Lignocellulose Research
Submitted by mj on Wed, 2011-11-02 16:38- Login to post comments
- Tagged
- XML
- BibTex
- Google Scholar
Automated Extraction of Protein Mutation Impacts from the Biomedical Literature
Submitted by nona on Sun, 2011-09-18 08:45The OrganismTagger System
Our open source OrganismTagger is a hybrid rule-based/machine-learning system that extracts organism mentions from the biomedical literature, normalizes them to their scientific name, and provides grounding to the NCBI Taxonomy database. Our pipeline provides the flexibility of annotating the species of particular interest to bio-engineers on different corpora, by optionally including detection of common names, acronyms, and strains. The OrganismTagger performance has been evaluated on two manually annotated corpora, OT and Linneaus. On the OT corpus, the OrganismTagger achieves a precision and recall of 95% and 94% and a grounding accuracy of 97.5%. On the manually annotated corpus of Linneaus-100, the results show a precision and recall of 99% and 97% and grounding with an accuracy of 97.4%. It is described in detail in our publication,
"OrganismTagger: Detection, normalization, and grounding of organism entities in biomedical documents",
Bioinformatics, vol. 27, no. 19 Oxford University Press, pp. 2721--2729, August 9, 2011.
OrganismTagger: Detection, normalization, and grounding of organism entities in biomedical documents
Submitted by rene on Sun, 2011-08-14 18:00Genozymes
Within this project, we investigated semantic support, included ontologies, linked data, and text mining, for genozymes for bioproducts and bioprocesses development.
Through the selection of appropriate technologies and their combination in a coherent system that brings measurable improvements to the users, we develop a semantic infrastructure in support of genomics-based lignocellulose research.
Towards Evaluating the Impact of Semantic Support for Curating the Fungus Scientific Literature
Submitted by rene on Thu, 2011-08-04 10:04- Login to post comments
- Tagged
- XML
- BibTex
- Google Scholar
Proceedings of the European Conference on Computational Biology (ECCB) 2010 Workshop: Annotation, interpretation and management of mutations (AIMM)
Submitted by witte on Tue, 2011-07-05 08:31- Login to post comments
- Tagged
- XML
- BibTex
- Google Scholar
Mutation Miner
Submitted by witte on Sat, 2011-01-01 10:11- Login to post comments
- Tagged
- XML
- BibTex
- Google Scholar
