Construction of ontology-based software repositories by text mining

Yan Wu, Harvey Siy, Mansour Zand, Victor Winter

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Scopus citations


Software document repositories store artifacts produced in the course of developing software products. But most repositories are simply archives of documents. It is not unusual to find projects where different software artifacts are scattered in unrelated repositories with varying levels of granularity and without a centralized management system. This makes the information available in existing repositories difficult to reuse. In this paper, a methodology for constructing an ontologybased repository of reusable knowledge is presented. The information in the repository is extracted from specification documents using text mining. Ontologies are used to guide the extraction process and organize the extracted information. The methodology is being used to develop a repository of recurring and crosscutting aspects in software specification documents.

Original languageEnglish (US)
Title of host publicationComputational Science - ICCS 2007 - 7th International Conference, Proceedings
PublisherSpringer Verlag
Number of pages8
EditionPART 3
ISBN (Print)9783540725879
StatePublished - 2007
Event7th International Conference on Computational Science, ICCS 2007 - Beijing, China
Duration: May 27 2007May 30 2007

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NumberPART 3
Volume4489 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349


Conference7th International Conference on Computational Science, ICCS 2007


  • Early aspects
  • Ontology
  • Software repository
  • Text mining

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Fingerprint Dive into the research topics of 'Construction of ontology-based software repositories by text mining'. Together they form a unique fingerprint.

Cite this