Developmental Learning for User Activities

Huang, Xiao; Weng, Juyang; Zhang, Zhengyou

doi:10.1007/978-3-642-30574-0_58

Xiao Huang²,
Juyang Weng³ &
Zhengyou Zhang⁴

Part of the book series: Springer Handbooks ((SHB))

7202 Accesses

Abstract

This chapter presents a brain-inspired developmental learning system. A personal computer lives with the human user as long as the power is on. It can develop and report some activities of the user like a shadow machine, a virtual machine that runs in the background while the human user is doing its regular activities, on the computer or off the computer. The goal of the teacher of this shadow machine is to enable it to observe human usersʼ status, recognize usersʼ activities, and provide the taught actions as desired reports. Both visual and acoustic contexts are used by this shadow machine to infer the userʼs activities (e.g., in an office). A major challenge is that the system must be applicable to open domains – without a handcrafted environmental model. That is, there is no handcrafted constraint on office lighting, size, setting, nor requirements of the use of a head-mounted close-talk microphone. A room microphone sits somewhere near the computer. The distance between the sound sources and the microphone varies significantly. This system is designed to respond to its sensory inputs. A more challenging issue is to make the system adapt to different users and different environments. Instead of building all the world knowledge in advance (which is intractable), the systemʼs adaptive capability enables it to learn sensorimotor association (which is tractable). The real-time prototype system has been tested in different office environments.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 269.00; Price excludes VAT (USA)

Hardcover Book: USD 349.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Abbreviations

BBN:: Bayesian belief network
FFT:: fast Fourier transformation
FPS:: frames per second
GSO:: Gram–Schmidt orthogonalization
HMM:: hidden Markov model
IDFT:: inverse discrete Fourier transform
IHDR:: incremental hierarchical discriminant regression
LDA:: linear discriminant analysis
LHMM:: layered hidden Markov model
MDP:: Markov decision process
MFCC:: mel-frequency cepstral coefficient
ODMDP:: observation-driven Markov decision process
POMPD:: partially observable MDP
log:: logistic regression

References

J. Piaget: The Construction of Reality in the Child (Basic Books, New York 1954)
Book Google Scholar
J. Elman, E.A. Bates, M.H. Johnson, A. Karmiloff-Smith, D. Parisi, K. Plunkett: Rethinking Innateness: A Connectionist Perspective On Development (MIT, Cambridge 1997)
Google Scholar
J. Weng, J. McClelland, A. Pentland, O. Sporns, I. Stockman, M. Sur, E. Thelen: Autonomous mental development by robots and animals, Science 291(5504), 599–600 (2001)
Article Google Scholar
J. Weng, N. Ahuja, T.S. Huang: Learning recognition and segmentation of 3-D objects from 2-D images, Proc. IEEE 4th Int. Conf. Comput. Vis. (Michigan State Univ., East Lansing 1993) pp. 121–128
Google Scholar
J. Weng, N. Ahuja, T.S. Huang: Learning recognition using the Cresceptron, Int. J. Comput. Vis. 25(2), 109–143 (1997)
Article Google Scholar
T. Moran, P. Dourish: Introduction to this special issue on context-aware computing, Hum. Comput. Interact. 16, 87–95 (2001)
Article Google Scholar
S. Shafer, B. Brumitt, J. Cadiz: Interaction issues in context-aware interactive environments, Hum. Comput. Interact. 16, 363–378 (2001)
Article Google Scholar
N. Oliver, E. Horvitz: Selective perception policies for guiding sensing and computation in multimodal systems: A comparative analysis, Proc. Int. Conf. Multimodal Interfaces (Vancouver 2003) pp. 3–8
Google Scholar
N. Oliver, A. Pentland: Driver behavior recognition and prediction in a smartcar, Proc. SPIE Aerosense2000 `Enhanc. Synth. Vis.ʼ (Orlando, Florida 2000)
Google Scholar
K. Torkkola, N. Massey, B. Leivian, C. Wood, J. Summers, S. Kundalkar: Classification of critical driving events, Proc. Int. Conf. Mach. Learn. Appl. (ICMLA) (Los Angeles, CA, USA 2003) pp. 81–85
Google Scholar
F. Sparacino, A. Pentland, G. Davenport: Wearable performance, 1st Int. Symp. Wearable Comput. (Cambridge 1997)
Google Scholar
J.K. Aggarwal, Q. Cai: Human motion analysis: A review, Comput. Vis. Image Underst. 73(3), 428–440 (1999)
Article Google Scholar
J. Yamato, J. Ohya, K. Ishii: Recognizing human action in time-seqential images using hidden Markov model, Proc. Int. Conf. Comput. Vis. Pattern Recognit. (NTT Hum. Interface Labs, Yokosuka 1992) pp. 379–385
Google Scholar
A. Galata, N. Johnson, D. Hogg: Learning variable length Markov models of behaviour, Comput. Vis. Image Underst. 81(3), 398–413 (2001)
Article MATH Google Scholar
M. Brand, N. Oliver, A. Pentland: Coupled hidden Markov models for modeling interacting processes, Proc. Int. Conf. Comput. Vis. Pattern Recognit. (1996) pp. 994–999
Google Scholar
M. Brand, V. Kettnaker: Discovery and segmentaion of activities in video, IEEE Trans. Pattern Anal. Mach. Intell. 22(8), 844–851 (2000)
Article Google Scholar
Y. Ivanov, A. Bobick: Recognition of visual activities and interactions by stochastic parsing, IEEE Trans. Pattern Anal. Mach. Intell. 22(8), 852–872 (2000)
Article Google Scholar
H. Buxton, S. Gong: Advanced Visual Surveillance using Bayesian Networks, Proc. Int. Conf. Comput. Vis. (Cambridge 1995) pp. 111–123, June
Google Scholar
A. Madabhushi, J. Aggarwal: A Bayesian approach to human activity recognition, Proc. 2nd Int. Workshops Vis. Surveill. (Washington D.C. 1999) pp. 25–30
Google Scholar
S. Bengio: An asynchronous hidden Markov model for audio-visual speech recognition, Proc. Adv. Neural Inf. Process. Syst. (2003) pp. 1213–1220
Google Scholar
B. Clarkson, A. Pentland: Unsupervised clustering of ambulatory audio and video, Int. Jt. Conf. Acoust., Speech Signal Proces., ICASSPʼ99 (1999) pp. 3037–3040
Google Scholar
N. Oliver, E. Horvitz, A. Garg: Layered representation for human activity recognition, Proc. Int. Conf. Multimodal Interfaces (2002) pp. 3–8
Google Scholar
J. Zacks, B. Tersky: Event structure in perception and cognition, Psychol. Bull. 127(1), 3–21 (2001)
Article Google Scholar
J. Deller, J. Proakis, J. Hansen: Discrete-Time Processing of Speech Signals (Inst. Electr. Electron. Eng., New York 2000)
Google Scholar
J. Weng: On developmental mental architectures, Neurocomputing 70(13–15), 2303–2323 (2007)
Article Google Scholar
M.L. Puterman: Markov Decision Processes (Wiley, New York 1994)
Book MATH Google Scholar
L. Kaelbling, M. Littman, A. Moore: Reinforcement learning: A survey, J. Artif. Intell. Res. 4, 237–285 (1996)
Google Scholar
L.R. Rabiner: A tutorial on hidden Markov models and selected applications in speech recognition, Proc. IEEE 77(2), 257–286 (1989)
Article Google Scholar
D.R. Cox: Statistical analysis of time series: Some recent developments, Scand. J. Stat. 8(2), 93–115 (1981)
MathSciNet MATH Google Scholar
J. Quinlan: C4.5: Programs for Machine Learning (Morgan Kaufmann, San Mateo, CA 1993)
Google Scholar
L. Breiman, J. Friedman, R. Olshen, C. Stone: Classification and Regression Trees (Chapman Hall, New York 1993)
MATH Google Scholar
W. Hwang, J. Weng: Hierarchical discriminant regression, IEEE Trans. Pattern Anal. Mach. Intell. 22(11), 1277–1293 (2000)
Article Google Scholar
X. Huang, J. Weng: Locally balanced incremental hierarchical discriminant regression, 4th Int. Conf. Intell. Data Eng. Autom. Learn. (Hong Kong 2003)
Google Scholar
R. Duta, P. Hart, D. Stork: Pattern Classification, 2nd edn. (Wiley, New York 2001)
Google Scholar
W. Hwang, J. Weng: An online training and online testing algorithm for OCR and image orientation classification using hierarchical discriminant regression, Proc. 4th IAPR Int. Workshop Document Anal. Syst. (Rio De Janeiro, Brazil 2000)
Google Scholar
W. Pratt: Digital Image Processing (John Wiley, New York 1991)
MATH Google Scholar
X. Huang, J. Weng, Z. Zhang: Office presence detection using multimodal context information, Proc. Int. Conf. Acoust., Speech Signal Proces. (ICASSP 2004) (Montreal, Quebec, Canada, USA 2004)
Google Scholar
J. Weng: Why have we passed ``neural networks do not abstract wellʼʼ?, Nat. Intell.: INNS Mag. 1(1), 13–22 (2011)
Google Scholar
M. Luciw, J. Weng: Where what network 3: Developmental top-down attention with multiple meaningful foregrounds, Proc. IEEE Int. Jt. Conf. Neural Netw. (Barcelona, Spain 2010) pp. 4233–4240
Google Scholar

Download references

Author information

Authors and Affiliations

Room 1214, City Center Plaza, 555110th St. NE, 98155, Bellevue, WA, USA
Xiao Huang
Department of Computer Science and Engineering, Michigan State University, 48824, East Lansing, MI, USA
Juyang Weng
Microsoft Research, Microsoft, One Microsoft Way, 98052, Redmond, WA, USA
Zhengyou Zhang

Authors

Xiao Huang
View author publications
You can also search for this author in PubMed Google Scholar
Juyang Weng
View author publications
You can also search for this author in PubMed Google Scholar
Zhengyou Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Juyang Weng or Zhengyou Zhang .

Editor information

Editors and Affiliations

KEDRI – Knowledge Engineering and Discovery Research Institute, Auckland University of Technology, 120 Mayoral Drive, 1010, Auckland, New Zealand
Nikola Kasabov

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Huang, X., Weng, J., Zhang, Z. (2014). Developmental Learning for User Activities. In: Kasabov, N. (eds) Springer Handbook of Bio-/Neuroinformatics. Springer Handbooks. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-30574-0_58

Download citation

DOI: https://doi.org/10.1007/978-3-642-30574-0_58
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-30573-3
Online ISBN: 978-3-642-30574-0
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics