Skip to main content
Log in

Comprehensive and efficient discovery of time series motifs

  • Published:
Journal of Zhejiang University SCIENCE C Aims and scope Submit manuscript

Abstract

Time series motifs are previously unknown, frequently occurring patterns in time series or approximately repeated subsequences that are very similar to each other. There are two issues in time series motifs discovery, the deficiency of the definition of K-motifs given by Lin et al. (2002) and the large computation time for extracting motifs. In this paper, we propose a relatively comprehensive definition of K-motifs to obtain more valuable motifs. To minimize the computation time as much as possible, we extend the triangular inequality pruning method to avoid unnecessary operations and calculations, and propose an optimized matrix structure to produce the candidate motifs almost immediately. Results of two experiments on three time series datasets show that our motifs discovery algorithm is feasible and efficient.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Abe, H., Ohsaki, M., Yokoi, H., Yamaguchi, T., 2005. Implementing an integrated time-series data mining environment based on temporal pattern extraction methods: a case study of an interferon therapy risk mining for chronic hepatitis. LNCS, 4012:425–435. [doi:10.1007/11780496_45]

    Google Scholar 

  • André-Jönsson, H., Badal, D.Z., 1997. Using Signature Files for Querying Time-Series Data. Practice of Knowledge Discovery in Databases, 1263:211–220.

    Google Scholar 

  • Beaudoin, P., Coros, S., van de Panne, M., Poulin, P., 2008. Motion-Motif Graphs. Symp. on Computer Animation, p.117–126.

  • Chiu, B.Y., Keogh, E.J., Lonardi, S., 2003. Probabilistic Discovery of Time Series Motifs. ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, p.493–498.

  • Ding, H., Trajcevski, G., Scheuermann, P., Wang, X.Y., Keogh, E.J., 2008. Querying and Mining of Time Series Data: Experimental Comparison of Representations and Distance Measures. Proc. Int. Conf. on Very Large Data Bases, 1(2):1542–1552.

    Google Scholar 

  • Ferreira, P.G., Azevedo, P.J., Silva, C.G., Brito, R.M.M., 2006. Mining approximate motifs in time series. Discov. Sci., 4265:89–101. [doi:10.1007/11893318_12]

    Article  Google Scholar 

  • Guyet, T., Garbay, C., Dojat, M., 2007. Knowledge construction from time series data using a collaborative exploration system. J. Biomed. Inform., 40(6):672–687. [doi:10.1016/j.jbi.2007.09.006]

    Article  Google Scholar 

  • Hegland, M., Clarke, W., Kahn, M., 2001. Mining the Macho dataset. Comput. Phys. Commun., 142(1–3):22–28. [doi:10.1016/S0010-4655(01)00307-1]

    Article  MATH  Google Scholar 

  • Lin, J., Keogh, E., Lonardi, S., Patel, P., 2002. Finding Motifs in Time Series. 2nd Workshop on Temporal Data Mining at the 8th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, p.53–68.

  • Mueen, A., Keogh, E.J., Zhu, Q., Cash, S., Westover, M.B., 2009. Exact Discovery of Time Series Motifs. Society for Industrial and Applied Mathematics Conf. on Data Mining, p.473–484.

  • Tanaka, Y., Iwamoto, K., Uehara, K., 2005. Discovery of time-series motif from multi-dimensional data based on MDL principle. Mach. Learn., 58(2–3):269–300. [doi:10.1007/s10994-005-5829-2]

    Article  MATH  Google Scholar 

  • Ueno, K., Xi, X.P., Keogh, E.J., Lee, D.J., 2006. Anytime Classification Using the Nearest Neighbor Algorithm with Applications to Stream Mining. IEEE Int. Conf. on Data Mining, p.623–632.

  • Xu, X.K., Zhang, J., Small, M., 2008. Superfamily phenomena and motifs of networks induced from time series. PNAS, 105(50):19601–19605. [doi:10.1073/pnas.0806082105]

    Article  MATH  MathSciNet  Google Scholar 

  • Yi, B.K., Faloutsos, C., 2000. Fast Time Sequence Indexing for Arbitrary L p Norms. Int. Conf. on Very Large Data Bases, p.385–394.

  • Zhang, J., Small, M., 2006. Complex network from pseudoperiodic time series: topology versus dynamics. Phys. Rev. Lett., 96:238701. [doi:10.1103/PhysRevLett.96.238701]

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Lian-hua Chi.

Additional information

Project supported by the “Nuclear High Base” National Science and Technology Major Project (No. 2010ZX01042-001-003), the National Basic Research Program (973) of China (No. 2007CB310804), and the National Natural Science Foundation of China (No. 61173061)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chi, Lh., Chi, Hh., Feng, Yc. et al. Comprehensive and efficient discovery of time series motifs. J. Zhejiang Univ. - Sci. C 12, 1000–1009 (2011). https://doi.org/10.1631/jzus.C1100037

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1631/jzus.C1100037

Key words

CLC number

Navigation