Skip navigation.
Home
Semantic Software Lab
Concordia University
Montréal, Canada

Text Mining and Software Engineering: an Integrated Source Code and Document Analysis Approach

Printer-friendly versionPrinter-friendly versionPDF versionPDF version
TitleText Mining and Software Engineering: an Integrated Source Code and Document Analysis Approach
Publication TypeJournal Article
Year of Publication2008
AuthorsWitte, R., Q. Li, Y. Zhang, and J. Rilling
Refereed DesignationRefereed
JournalIET Software
Volume2
Issue1
Pagination3–16
ISSN1751-8806
Abstract

Documents written in natural languages constitute a major part of the artifacts produced during the software engineering lifecycle. Especially during software maintenance or reverse engineering, semantic information conveyed in these documents can provide important knowledge for the software engineer. In this paper, we present a text mining system capable of populating a software ontology with information detected in documents. A particular novelty is the integration of results from automated source code analysis into an NLP pipeline, allowing to cross-link software artifacts represented in code and natural language on a semantic level.

URLhttp://link.aip.org/link/?SEN/2/3/1
DOI10.1049/iet-sen:20070110
Copyright

Copyright © 2008 IET. This paper is a postprint of a paper submitted to and accepted for publication in the IET Software Journal, Volume: 2, Issue: 1, 2008, and is subject to IET copyright [http://www.iet.org]. The copy of record is available at http://link.aip.org/link/?SEN/2/3/1, DOI: 10.1049/iet-sen:20070110.

Impact Factor

0.620

History

Received 24 September 2007
Published 28 February 2008

AttachmentSize
witte_etal_iet2008.pdf1.5 MB