ABSTRACT
Although most time-series data mining research has concentrated on providing solutions for a single distance function, in this work we motivate the need for a single index structure that can support multiple distance measures. Our specific area of interest is the efficient retrieval and analysis of trajectory similarities. Trajectory datasets are very common in environmental applications, mobility experiments, video surveillance and are especially important for the discovery of certain biological patterns. Our primary similarity measure is based on the Longest Common Subsequence (LCSS) model, that offers enhanced robustness, particularly for noisy data, which are encountered very often in real world applications. However, our index is able to accommodate other distance measures as well, including the ubiquitous Euclidean distance, and the increasingly popular Dynamic Time Warping (DTW). While other researchers have advocated one or other of these similarity measures, a major contribution of our work is the ability to support all these measures without the need to restructure the index. Our framework guarantees no false dismissals and can also be tailored to provide much faster response time at the expense of slightly reduced precision/recall. The experimental results demonstrate that our index can help speed-up the computation of expensive similarity measures such as the LCSS and the DTW.
- J. Aach and G. Church. Aligning gene expression time series with time warping algorithms. In Bioinformatics, Volume 17, pages 495--508, 2001.]]Google ScholarCross Ref
- O. Arikan and D. Forsyth. Interactive motion generation from examples. In Proc. of ACM SIGGRAPH, 2002.]] Google ScholarDigital Library
- Z. Bar-Joseph, G. Gerber, D. Gifford, T. Jaakkola, and I. Simon. A new approach to analyzing gene expression time series data. In Proc. of 6th RECOMB, pages 39--48, 2002.]] Google ScholarDigital Library
- D. Berndt and J. Clifford. Using Dynamic Time Warping to Find Patterns in Time Series. In Proc. of KDD Workshop, 1994.]]Google Scholar
- M. Betke, J. Gips, and P. Fleming. The camera mouse: Visual tracking of body features to provide computer access for people with severe disabilities. In IEEE Transactions on Neural Systems and Rehabilitation Engineering, Vol. 10, No. 1, 2002.]]Google ScholarCross Ref
- G. Das, D. Gunopulos, and H. Mannila. Finding Similar Time Series. In Proc. of the First PKDD Symp., pages 88--100, 1997.]] Google ScholarDigital Library
- D. Gavrila and L. Davis. Towards 3-d model-based tracking and recognition of human movement: a multi-view approach. In Int. Workshop on Face and Gesture Recognition.]]Google Scholar
- M. Hadjieleftheriou, G. Kollios, V. Tsotras, and D. Gunopulos. Efficient indexing of spatiotemporal objects. In Proc. of 8th EDBT, 2002.]] Google ScholarDigital Library
- T. Kahveci, A. Singh, and A. Gurel. Similarity searching for multi-attribute sequences. In Proc. of SSDBM, 2002.]] Google ScholarDigital Library
- E. Keogh. Exact indexing of dynamic time warping. In Proc. of VLDB, 2002.]]Google ScholarDigital Library
- E. Keogh and S. Kasetty. On the need for time series data mining benchmarks: A survey and empirical demonstration. In Proc. of SIGKDD, 2002.]] Google ScholarDigital Library
- S. Kim, S. Park, and W. Chu. An index-based approach for similarity search supporting time warping in large sequence databases. In In Proc. of 17th ICDE, 2001.]] Google ScholarDigital Library
- Z. Kovács-Vajna. A fingerprint verification system based on triangular matching and dynamic time warping. In IEEE Transactions on PAMI, Vol. 22, No. 11.]] Google ScholarDigital Library
- S.-L. Lee, S.-J. Chun, D.-H. Kim, J.-H. Lee, and C.-W. Chung. Similarity Search for Multidimensional Data Sequences. Proc. of ICDE, pages 599--608, 2000.]] Google ScholarDigital Library
- S. Park, W. Chu, J. Yoon, and C. Hsu. Efficient Searches for Similar Subsequences of Different Lengths in Sequence Databases. In Proc. of ICDE, pages 23--32, 2000.]] Google ScholarDigital Library
- T. Rath and R. Manmatha. Word image matching using dynamic time warping. In Tec Report MM-38. Center for Intelligent Information Retrieval, University of Massachusetts Amherst, 2002.]]Google Scholar
- J. F. Roddick and K. Hornsby. Temporal, Spatial and Spatio-Temporal Data Mining. 2000.]] Google ScholarDigital Library
- M. Shimada and K. Uehara. Discovery of correlation from multi-stream of human motion. In Discovery Science 2000.]] Google ScholarDigital Library
- R. E. Valdes-Perez and C. A. Stone. Systematic detection of subtle spatio-temporal patterns in time-lapse imaging ii. particle migrations. In Bioimaging 6(2), pages 71--78, 1998.]]Google ScholarCross Ref
- M. Vlachos, G. Kollios, and D. Gunopulos. Discovering similar multidimensional trajectories. In Proc. of ICDE, 2002.]]Google ScholarCross Ref
- B.-K. Yi, H. V. Jagadish, and C. Faloutsos. Efficient retrieval of similar time sequences under time warping. In Proc. of ICDE, pages 201--208, 1998.]] Google ScholarDigital Library
- Y. Zhu and D. Shasha. Query by humming: a time series database approach. In Proc. of SIGMOD, 2003.]]Google Scholar
Index Terms
- Indexing multi-dimensional time-series with support for multiple distance measures
Recommendations
Indexing Multidimensional Time-Series
While most time series data mining research has concentrated on providing solutions for a single distance function, in this work we motivate the need for an index structure that can support multiple distance measures. Our specific area of interest is the ...
The influence of global constraints on similarity measures for time-series databases
A time series consists of a series of values or events obtained over repeated measurements in time. Analysis of time series represents an important tool in many application areas, such as stock-market analysis, process and quality control, observation ...
Elastic similarity and distance measures for multivariate time series
AbstractThis paper contributes multivariate versions of seven commonly used elastic similarity and distance measures for time series data analytics. Elastic similarity and distance measures can compensate for misalignments in the time axis of time series ...
Comments