Subsequence matching on structured time series data

Huanmei Wu, Steve B. Jiang, Betty Salzberg, Hiroki Shirato, Gregory C. Sharp, David Kaeli

Research output: Chapter in Book/Report/Conference proceedingConference contribution

47 Scopus citations

Abstract

Subsequence matching in time series databases is a useful technique, with applications in pattern matching, prediction, and rule discovery. Internal structure within the time series data can be used to improve these tasks, and provide important insight into the problem domain. This paper introduces our research effort in using the internal structure of a time series directly in the matching process. This idea is applied to the problem domain of respiratory motion data in cancer radiation treatment. We propose a comprehensive solution for analysis, clustering, and online prediction of respiratory motion using subsequence similarity matching. In this system, a motion signal is captured in real time as a data stream, and is analyzed immediately for treatment and also saved in a database for future study. A piecewise linear representation of the signal is generated from a finite state model, and is used as a query for sub-sequence matching. To ensure that the query subsequence is representative, we introduce the concept of subsequence stability, which can be used to dynamically adjust the query subsequence length. To satisfy the special needs of similarity matching over breathing patterns, a new subsequence similarity measure is introduced. This new measure uses a weighted L 1 distance function to capture the relative importance of each source stream, amplitude, frequency, and proximity in time. From the subsequence similarity measure, stream and patient similarity can be denned, which are then used for offline and online applications. The matching results are analyzed and applied for motion prediction and correlation discovery. While our system has been customized for use in radiation therapy, our approach to time series modeling is general enough for application domains with structured time series data.

Original languageEnglish (US)
Title of host publicationProceedings of the ACM SIGMOD International Conference on Management of Data
EditorsJ. Widom, F. Ozcan, R. Chirkova
Pages682-693
Number of pages12
DOIs
StatePublished - 2005
EventSIGMOD 2005: ACM SIGMOD International Conference on Management of Data - Baltimore, MD, United States
Duration: Jun 14 2005Jun 16 2005

Other

OtherSIGMOD 2005: ACM SIGMOD International Conference on Management of Data
Country/TerritoryUnited States
CityBaltimore, MD
Period6/14/056/16/05

ASJC Scopus subject areas

  • General Computer Science

Fingerprint

Dive into the research topics of 'Subsequence matching on structured time series data'. Together they form a unique fingerprint.

Cite this