Abstract
Events formulate the world of the human being and could be regarded as the semantic units in different granularities for information organization. Extracting events and temporal information from texts plays an important role for information analytics in big data because of the wide use of multilingual texts. This paper surveys existing research work on text-based event temporal resolution and reasoning including identification of events, temporal information resolutions of events in English and Chinese texts, the rule-based temporal relation reasoning between events and relevant temporal representations. For the scientific big data analytics, we point out the shortcomings of existing research work and give the argument about the future research work for advancing identification of events, establishment of temporal relations and reasoning of temporal relations.
Similar content being viewed by others
References
Agichtein E, Ganti V (2004) Mining reference tables for automatic text segmentation. In: Proceedings of the tenth ACM SIGKDD international conference on knowledge discovery and data mining. ACM, pp 20–29
Allen JF (1983) Maintaining knowledge about temporal intervals. Commun ACM 26(11):832–843
Allen JF (1984) Towards a general theory of action and time. Artif Intell 23(2):123–154
Alonso O, Gertz M, Baeza-Yates R (2009) Clustering and exploring search results using timeline constructions. In: Proceedings of the 18th ACM conference on information and knowledge management. ACM, pp 97–106
Appelt DE, Hobbs JR, Bear J, Israel D, Tyson M (1993) Fastus: a finite-state processor for information extraction from real-world text. IJCAI 93:1172–1178
Asher N (1993) Reference to abstract objects in discourse, vol 50. Springer, Berlin
Bach E (1986) The algebra of events. Linguist Philos 9(1):5–16
Bittar A (2010) Building a timebank for french: a reference corpus annotated according to the ISO-TimeMl standard. Ph.D. thesis, Paris 7
Borthwick A, Sterling J, Agichtein E, Grishman R (1998) Exploiting diverse knowledge sources via maximum entropy in named entity recognition. In: Proceedings of the sixth workshop on very large corpora, vol 182
Califf ME, Mooney RJ (2003) Bottom–up relational learning of pattern matching rules for information extraction. J Mach Learn Res 4:177–210
Chambers N, Jurafsky D (2008) Jointly combining implicit constraints improves temporal ordering. In: Proceedings of the conference on empirical methods in natural language processing. Association for Computational Linguistics, pp 698–706
Chambers N, Wang S, Jurafsky D (2007) Classifying temporal relations between events. In: Proceedings of the 45th annual meeting of the ACL on interactive poster and demonstration sessions. Association for Computational Linguistics, pp 173–176
Chang AX, Manning CD (2012) Sutime: a library for recognizing and normalizing time expressions. In: LREC, pp 3735–3740
Cheng Y, Asahara M, Matsumoto Y (2007) Constructing a temporal relation tagged corpus of chinese based on dependency structure analysis. In: 14th International symposium on temporal representation and reasoning. IEEE, pp 59–69
Cheng Y, Asahara M, Matsumoto Y (2008) Use of event types for temporal relation identification in chinese text. In: IJCNLP, pp 31–38
Choi Y, Cardie C, Riloff E, Patwardhan S (2005) Identifying sources of opinions with conditional random fields and extraction patterns. In: Proceedings of the conference on human language technology and empirical methods in natural language processing. Association for Computational Linguistics, pp 355–362
Cunningham H, Maynard D, Bontcheva K, Tablan V (2002) Gate: an architecture for development of robust hlt applications. In: Proceedings of the 40th annual meeting on association for computational linguistics. Association for Computational Linguistics, pp 168–175
Dakka W, Gravano L, Ipeirotis PG (2012) Answering general time-sensitive queries. IEEE Trans Knowl Data Eng 24(2):220–235
Davidson D (1967) Causal relations. J Philos 64(21):691–703
Elkhlifi A, Faiz R (2009) Automatic annotation approach of events in news articles. Int J Comput Inf Sci 7(1):40–50
Gaizauskas R, Wilks Y (1998) Information extraction: beyond document retrieval. J Doc 54(1):70–105
Gao J, Li J, Cai Z, Gao H (2015) Composite event coverage in wireless sensor networks with heterogeneous sensors. In: 2015 IEEE conference on computer communications (INFOCOM). IEEE, pp 217–225
Grishman R, Sundheim B (1996) Message understanding conference-6: a brief history. COLING 96:466–471
Hitzeman J, Moens M, Grover C (1995) Algorithms for analysing the temporal structure of discourse. In: Proceedings of the seventh conference on European chapter of the Association for computational linguistics. Morgan Kaufmann, pp 253–260
Hovy D, Fan J, Gliozzo A, Patwardhan S, Welty C (2012) When did that happen? Linking events and relations to timestamps. In: Proceedings of the 13th conference of the European chapter of the Association for Computational Linguistics. Association for Computational Linguistics, pp 185–193
Kamp H (1981) A theory of truth and semantic representation. Formal semantics-the essential readings, pp 189–222
Kulkarni A, Teevan J, Svore KM, Dumais ST (2011) Understanding temporal query dynamics. In: Proceedings of the fourth ACM international conference on Web search and data mining. ACM, pp 167–176
Lascarides A, Asher N, Oberlander J (1992) Inferring discourse relations in context. In: Proceedings of the 30th annual meeting on Association for Computational Linguistics. Association for Computational Linguistics, pp 1–8
Li W, Wong KF (2002) A word-based approach for modeling and discovering temporal relations embedded in Chinese sentences. TALIP 1(3):173–206
Li W, Wong KF, Cao G, Yuan C (2004) Applying machine learning to chinese temporal relation resolution. In: Proceedings of the 42nd annual meeting on Association for Computational Linguistics. Association for Computational Linguistics, p 582
Li X, Croft WB (2003) Time-based language models. In: Proceedings of the twelfth international conference on Information and knowledge management. ACM, pp 469–475
Longacre RE (1996) The grammar of discourse. Springer, Berlin
Malouf R (2002) A comparison of algorithms for maximum entropy parameter estimation. In: Proceedings of the 6th conference on Natural language learning. Association for Computational Linguistics, vol 20, pp 1–7
Mani I, Schiffman B, Zhang J (2003) Inferring temporal ordering of events in news. In: Proceedings of the 2003 conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology: companion volume of the proceedings of HLT-NAACL 2003–short papers-volume 2. Association for Computational Linguistics, pp 55–57
Mani I, Verhagen M, Wellner B, Lee CM, Pustejovsky J (2006) Machine learning of temporal relations. In: Proceedings of the 21st international conference on computational linguistics and the 44th annual meeting of the Association for Computational Linguistics, ACL-44. Association for Computational, Linguistics, Stroudsburg, PA, USA. doi:10.3115/1220175.1220270, pp 753–760
Mani I, Verhagen M, Wellner B, Lee CM, Pustejovsky J (2006) Machine learning of temporal relations. In: Proceedings of the 21st international conference on computational linguistics and the 44th annual meeting of the Association for Computational Linguistics. Association for Computational Linguistics, pp 753–760
Marthi B, Milch B, Russell S (2003) First-order probabilistic models for information extraction. In: IJCAI 2003 workshop on learning statistical models from relational data
Mazur P, Dale R (2010) Wikiwars: A new corpus for research on temporal expressions. In: Proceedings of the 2010 conference on empirical methods in natural language processing. Association for Computational Linguistics, pp 913–922
McCallum A, Freitag D, Pereira FC (2000) Maximum entropy markov models for information extraction and segmentation. In: ICML, pp 591–598
Peng F, McCallum A (2006) Information extraction from research papers using conditional random fields. Inf Process Manag 42(4):963–979
Pustejovsky J, Castano JM, Ingria R, Sauri R, Gaizauskas RJ, Setzer A, Katz G, Radev DR (2003) TimeML: robust specification of event and temporal expressions in text. New Direct Quest Answ 3:28–34
Pustejovsky J, Hanks P, Sauri R, See A, Gaizauskas R, Setzer A, Radev D, Sundheim B, Day D, Ferro L et al (2003) The timebank corpus. In: Corpus linguistics, vol 2003, p 40
Qu P, Zhang J, Yao C, Zeng W (2016) Identifying long tail term from large-scale candidate pairs for big data-oriented patent analysis. Concurrency and computation: practice and experience doi:10.1002/cpe.3792
Reichenbach H (2005) The tenses of verbs. In: Mani I, Pustejovsky J, Gaizauskas R (eds) The language of time: a reader. Oxford University Press, Oxford, UK, pp 71–78
Riloff E et al (1993) Automatically constructing a dictionary for information extraction tasks. In: AAAI, pp 811–816
Saurii R, Littman J, Knippen B, Gaizauskas R, Setzer A, Pustejovsky J (2005) TimeML annotation guidelines
Schilder F, Katz G, Pustejovsky J (2007) Annotating, extracting and reasoning about time and events. In: Annotating, extracting and reasoning about time and events. Springer, pp 1–6
Setzer A, Gaizauskas R (2002) On the importance of annotating temporal event-event relations in text. In: Proceedings of LREC workshop on annotation standards for temporal information in natural language, pp 52–60
Seymore K, McCallum A, Rosenfeld R (1999) Learning hidden markov model structure for information extraction. In: AAAI-99 workshop on machine learning for information extraction, pp 37–42
Strötgen J, Gertz M (2010) Heideltime: High quality rule-based extraction and normalization of temporal expressions. In: Proceedings of the 5th international workshop on semantic evaluation. Association for Computational Linguistics, pp 321–324
Sun Y, Bie R, Zhang J (2016) Measuring semantic-based structural similarity in multi-relational networks. Int J Data Wareh Min 12(1):20–33
Sun Y, Jara AJ (2014) An extensible and active semantic model of information organizing for the internet of things. Pers Ubiquit Comput 18(8):1821–1833
Sun Y, Lu C, Bie R, Zhang J (2016) Semantic relation computing theory and its application. J Netw Comput Appl 59:219–229
Sun Y, Song H, Jara AJ, Bie R (2016) Internet of things and big data analytics for smart and connected communities. IEEE Access 4:766–773
Sun Y, Yan H, Lu C, Bie R, Zhou Z (2014) Constructing the web of events from raw data in the web of things. Mob Inf Syst 10(1):105–125
Sun Y, Yan H, Zhang J, Xia Y, Wang S, Bie R, Tian Y (2014) Organizing and querying the big sensing data with event-linked network in the internet of things. Int J Distrib Sens Netw. doi:10.1155/2014/218521
UzZaman N, Llorens H, Allen J, Derczynski L, Verhagen M, Pustejovsky J (2012) Tempeval-3: evaluating events, time expressions, and temporal relations. arXiv preprint arXiv:1206.5333
Vendler Z (1957) Verbs and times. Philos Rev 143–160
Verhagen M, Gaizauskas R, Schilder F, Hepple M, Katz G, Pustejovsky J (2007) Semeval-2007 task 15: tempeval temporal relation identification. In: Proceedings of the 4th international workshop on semantic evaluations. Association for Computational Linguistics, pp 75–80
Verhagen M, Gaizauskas R, Schilder F, Hepple M, Moszkowicz J, Pustejovsky J (2009) The tempeval challenge: identifying temporal relations in text. Lang Resour Eval 43(2):161–179
Verhagen M, Sauri R, Caselli T, Pustejovsky J (2010) Semeval-2010 task 13: Tempeval-2. In: Proceedings of the 5th international workshop on semantic evaluation. Association for Computational Linguistics, pp 57–62
Vilain MB (1982) A system for reasoning about time. In: AAAI, pp 197–201
Whorf BL (1940) Science and linguistics
Yoshikawa K, Riedel S, Asahara M, Matsumoto Y (2009) Jointly identifying temporal relations with markov logic. In: Proceedings of the joint conference of the 47th annual meeting of the ACL and the 4th international joint conference on natural language processing of the AFNLP. Association for Computational Linguistics, vol 1, pp 405–413
Zhang J, Sun Y (2012) Managing resources in internet of things with semantic hyper-network model. In: Proceedings of the 2012 IEEE 21st international workshop on enabling technologies: infrastructure for collaborative enterprises. IEEE Computer Society, pp 318–323
Zhang J, Yao C, Qu P, Sun Y (2015) Text-based event temporal resolution and reasoning for information analytics in big data. In: 2015 international conference on identification, information and knowledge in the internet of things. IEEE, pp 78–81
Zhang NR (2001) Hidden markov models for information extraction. Technical report. Stanford Natural Language Processing Group
Acknowledgments
This research work was partially supported by National Natural Science of China (Grant Nos. 71503240 and 61371185), Humanities and Social Sciences of Ministry of Education Planning Fund (Grant No. 16YJA710007) and the National Key Technology R&D Program (Grant No. 2015BAH25F01).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Zhang, J., Yao, C., Sun, Y. et al. Building text-based temporally linked event network for scientific big data analytics. Pers Ubiquit Comput 20, 743–755 (2016). https://doi.org/10.1007/s00779-016-0940-x
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00779-016-0940-x