Summarizing developer work history using time series segmentation

Harvey Siy, Parvathi Chundi, Mahadevan Subramaniam

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Scopus citations

Abstract

Temporal segmentation partitions time series data with the intent of producing more homogeneous segments. It is a technique used to preprocess data so that subsequent time series analysis on individual segments can detect trends that may not be evident when performing time series analysis on the entire dataset. This technique allows data miners to partition a large dataset without making any assumption of periodicity or aan other a priori knowdedge of the dataset's features. We investigate the insights that can be gained from the application of time series segmentation to software version repositories. Software version repositories from large projects contain on the order of hundreds of thousands of timestamped entries or more. It is a continuing challenge to aggregate such data so that noise is reduced and important characteristics are brought out. In this paper, we present a way to summarize developer work history in terms of the files they have modified over time by segmenting the CVS change data of individual Eclipse developers. We show that the files they modify tends to change significantly over time though most of them tend to work within the same directories.

Original languageEnglish (US)
Title of host publication30th International Conference on Software Engineering, ICSE 2008 - 2008 International Working Conference on Mining Software Repositories, MSR'08
PublisherIEEE Computer Society
Pages137-140
Number of pages4
ISBN (Print)9781605580241
DOIs
StatePublished - 2008
Event30th International Conference on Software Engineering, ICSE 2008 - 2008 International Working Conference on Mining Software Repositories, MSR'08 - Leipzig, Germany
Duration: May 10 2008May 11 2008

Publication series

NameProceedings - International Conference on Software Engineering
ISSN (Print)0270-5257

Conference

Conference30th International Conference on Software Engineering, ICSE 2008 - 2008 International Working Conference on Mining Software Repositories, MSR'08
Country/TerritoryGermany
CityLeipzig
Period5/10/085/11/08

Keywords

  • Mining software repositories
  • Open source
  • Temporal segmentation
  • Time series

ASJC Scopus subject areas

  • Software

Fingerprint

Dive into the research topics of 'Summarizing developer work history using time series segmentation'. Together they form a unique fingerprint.

Cite this