Semantic Assistants
Semantic Assistants: Eclipse Plug-In
Natural Language Processing (NLP) for Software Engineering: Our Eclipse plug-in integrates the Eclipse development environment into the Semantic Assistants architecture. It provides a user interface for offering various Natural Language Processing services to users. In particular, when using Eclipse as a software development environment, you can now offer novel semantic analysis services, such as named entity detection or quality analysis of source code comments, to software developers.
Text Analytics with Eclipse for Software Engineers
Submitted by witte on Sat, 2011-11-05 16:25The Organism Tagger System

Our open source OrganismTagger is a hybrid rule-based/machine-learning system that extracts organism mentions from the biomedical literature, normalizes them to their scientific name, and provides grounding to the NCBI Taxonomy database. Our pipeline provides the flexibility of annotating the species of particular interest to bio-engineers on different corpora, by optionally including detection of common names, acronyms, and strains. The OrganismTagger performance has been evaluated on two manually annotated corpora, OT and Linneaus. On the OT corpus, the OrganismTagger achieves a precision and recall of 95% and 94% and a grounding accuracy of 97.5%. On the manually annotated corpus of Linneaus-100, the results show a precision and recall of 99% and 97% and grounding with an accuracy of 97.4%. It is described in detail in our publication,
"OrganismTagger: Detection, normalization, and grounding of organism entities in biomedical documents",
Bioinformatics: Oxford University Press, August 9, 2011.
New Jenkins Server for Semantic Assistants Project
We now have a public Jenkins server (formerly known as Hudson) available for our Semantic Assistants project that supports the SourceForge code repository.
![]()
In the spirit of continuous integration, every check-in to the Subversion repository is built automatically and additionally checked with various tools. The latest build is archived and available for browsing and download.


