Abstract
Social sensing applications refer to those where individuals play an important role in data collection. They can act as sensor carriers (e.g., carrying GPS devices that share location data), sensor operators (e.g., taking pictures with smart phones), or as sensors themselves (e.g., sharing their observations on Twitter). The proliferation of sensors in the possession of the average individual, together with the popularity of social networks that allow massive information dissemination, heralds an era of social sensing that brings about new research challenges reviewed in this chapter.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Regression is a technique from estimation theory, applied to continuous or inherently ordered parameters to predict continuous or ordered values. In contrast, prediction uses machine learning to predict unordered class labels.
- 2.
To make error values meaningful, we have normalized fuel consumption values to be zero mean and between \(-1\) and \(1\).
References
T. Abdelzaher et al., Mobiscopes for human spaces. IEEE Pervasive Comput. 6(2), 20–29 (2007)
T.F. Abdelzaher, Y. Anokwa, P. Boda, J. Burke, D. Estrin, L.J. Guibas, A. Kansal, S. Madden, J. Reich, Mobiscopes for human spaces. IEEE Pervasive Comput. 6(2), 20–29 (2007)
K. Aberer, Z. Despotovic, Managing trust in a peer-2-peer information system. in CIKM ’01: Proceedings of the Tenth International Conference on Information and Knowledge Management, pp. 310–317. ACM, New York, NY, USA, 2001
G. Adomavicius, A. Tuzhilin, Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions. IEEE Trans. Knowl. Data Eng. 17(6), 734–749 (2005)
C. Aggarwal, T. Abdelzaher, Social sensing. in Managing and Mining Sensor Data (Kluwer Academic Publishers, Boston, 2013)
C. Aggarwal, T. Abdelzaher, Integrating sensors and social networks. Social Network Data Analytics (Springer, expected in 2011)
D. Agrawal, C.C. Aggarwal, On the design and quantification of privacy preserving data mining algorithms. in Proceedings of the 20th ACM SIGMOD Symposium on Principles of Database Systems, pp. 247–255, 2001
R. Agrawal, R. Srikant, Privacy preserving data mining. in Proceedings of ACM Conference on Management of Data, pp. 439–450, May 2000
H. Ahmadi, T. Abdelzaher, J. Han, N. Pham, R.K. Ganti. The sparse regression cube: a reliable modeling technique for open cyber-physical systems. in Proceedings of the 2011 IEEE/ACM Second International Conference on Cyber-Physical Systems, ICCPS ’11, pp. 87–96. IEEE Computer Society, Washington, DC, USA, 2011
H. Ahmadi, N. Pham, R. Ganti, T. Abdelzaher, S. Nath, J. Han, Privacy-aware regression modeling of participatory sensing data. in Proceedings of the 8th ACM Conference on Embedded Networked Sensor Systems, SenSys ’10, pp. 99–112. ACM, New York, NY, USA, 2010
R. Balakrishnan, Source rank: relevance and trust assessment for deep web sources based on inter-source agreement. in 20th World Wide Web Conference (WWW’11) 2011
L. Berti-Equille, A.D. Sarma, X. Dong, A. Marian, D. Srivastava, Sailing the information ocean with awareness of currents: discovery and application of source dependence. in CIDR’09 2009
J. Bilmes, A gentle tutorial on the em algorithm and its application to parameter estimation for gaussian mixture and hidden markov models. Technical report, University of Berkeley, ICSI-TR-97-021, 1997
A.T. Campbell, S.B. Eisenman, N.D. Lane, E. Miluzzo, R.A. Peterson, H. Lu, X. Zheng, M. Musolesi, K. Fodor, G.-S. Ahn, The rise of people-centric sensing. IEEE Internet Comput. 12(4), 12–21 (2008)
G. Casella, R. Berger, Statistical Inference (Duxbury Press, Pacific Grove, 2002)
S. Chaudhuri, U. Dayal, An overview of data warehousing and OLAP technology. SIGMOD Rec. 26, 65–74 (1997)
B.-C. Chen, L. Chen, Y. Lin, R. Ramakrishnan, Prediction cubes. in Proceedings 2005 International Conference Very Large Data Bases (VLDB’05), pp. 982–993, Trondheim, Norway, Aug 2005
Y. Chen, G. Dong, J. Han, J. Pei, B.W. Wah, J. Wang, Regression cubes with lossless compression and aggregation. IEEE Trans. Knowl. Data Eng. 18, 1585–1599 (2006)
Y. Chen, G. Dong, J. Han, B. W. Wah, J. Wang. Multi-dimensional regression analysis of time-series data streams. in Proceedings 2002 International Conference Very Large Data Bases (VLDB’02), pp. 323–334, Hong Kong, China, Aug 2002
D.J. Cook, L.B. Holder, Sensor selection to support practical use of health-monitoring smart environments. Wiley Interd. Rev. Data Min. Knowl. Discovery 1(4), 339–351 (2011)
H. Cramer. Mathematical Methods of Statistics (Princeton University Press, Princeton, 1946)
O. Dekel, O. Shamir, Vox populi: collecting high-quality labels from a crowd. in In Proceedings of the 22nd Annual Conference on Learning Theory 2009
S.A. Delre, W. Jager, M.A. Janssen, Diffusion dynamics in small-world networks with heterogeneous consumers. Comput. Math. Organ. Theory 13, 185–202 (2007)
A.P. Dempster, N.M. Laird, D.B. Rubin, Maximum likelihood from incomplete data via the em algorithm. J. Roy. Stat. Soc. B 39(1), 1–38 (1977)
X. Dong, L. Berti-Equille, Y. Hu, D. Srivastava, Global detection of complex copying relationships between sources. PVLDB 3(1), 1358–1369 (2010)
X. Dong, L. Berti-Equille, D. Srivastava, Truth discovery and copying detection in a dynamic world. VLDB 2(1), 562–573 (2009)
A. Doucet, N. De Freitas, N. Gordon, (eds.), Sequential Monte Carlo Methods, in Practice (Springer, New York, 2001)
R.O. Duda, P.E. Hart, D.G. Stork. Pattern Classification, 2nd edn. (Wiley-Interscience, New York, 2001)
S.B. Eisenman et al., The bikenet mobile sensing system for cyclist experience mapping. in Proceedings of SenSys, Nov 2007
A. Evfimievski, J. Gehrke, R. Srikant, Limiting privacy breaches in privacy preserving data mining. in Proceedings of the SIGMOD/PODS Conference, pp. 211–222, 2003
A. Galland, S. Abiteboul, A. Marian, P. Senellart, Corroborating information from disagreeing views. in WSDM, pp. 131–140, 2010
R. Ganti, N. Pham, Y.-E. Tsai, T. Abdelzaher, Poolview: stream privacy for grassroots participatory sensing. in ACM Sensys, Raleigh, NC, Nov 2008
R.K. Ganti, S. Srinivasan, A. Gacic, Multisensor fusion in smartphones for lifestyle monitoring. in Proceedings of the 2010 International Conference on Body Sensor Networks, BSN ’10, pp. 36–43. IEEE Computer Society, Washington, DC, USA, 2010
P. Gilbert, L.P. Cox, J. Jung, D. Wetherall, Toward trustworthy mobile sensing. in Proceedings of the Eleventh Workshop on Mobile Computing Systems and Applications, HotMobile ’10, pp. 31–36. ACM, New York, NY, USA, 2010
P. Gilbert, J. Jung, K. Lee, H. Qin, D. Sharkey, A. Sheth, L.P. Cox. Youprove: authenticity and fidelity in mobile sensing. in Proceedings of the 9th ACM Conference on Embedded Networked Sensor Systems, SenSys ’11, pp. 176–189. ACM, New York, NY, USA, 2011
J. Gray, S. Chaudhuri, A. Bosworth, A. Layman, D. Reichart, M. Venkatrao, F. Pellow, H. Pirahesh, Data cube: a relational aggregation operator generalizing group-by, cross-tab and sub-totals. Data Min. and Knowl. Discovery 1, 29–54 (1997)
V. Harinarayan, A. Rajaraman, J.D. Ullman. Implementing data cubes efficiently. in Proceedings 1996 ACM-SIGMOD International Conference Management of Data (SIGMOD’96), pp. 205–216, Montreal, Canada, June 1996
A. Helal, D.J. Cook, M. Schmalz, Smart home-based health platform for behavioral monitoring and alteration of diabetes patients. J. Diab. Sci. Technol. 3(1), 141–148 (2009)
K. Hoffman, D. Zage, C.N. Rotaru, A survey of attack and defense techniques for reputation systems. ACM Comput. Surv. 42(1), 1–31 (2009)
R.V. Hogg, A.T. Craig, Introduction to Mathematical Statistics (Prentice Hall, Upper Saddle River, 1995)
D. Houser, J. Wooders, Reputation in auctions: theory, and evidence from ebay. J. Econ. Manage. Strategy 15(2), 353–369 (2006)
J. Huang. Color-spatial image indexing and applications. Ph.D. thesis, Cornell University, 1998
J.-H. Huang, S. Amjad, S. Mishra, Cenwits: a sensor-based loosely coupled search and rescue system using witnesses. in Proceedings of SenSys, pp. 180–191, 2005
Z. Huang, W. Du, B. Chen, Deriving private information from randomized data. in Proceedings of the 2005 ACM SIGMOD Conference, pp. 37–48, Baltimore, MD, June 2005
C. Hui, M.K. Goldberg, M. Magdon-Ismail, W.A. Wallace, Simulating the diffusion of information: an agent-based modeling approach. in IJATS, pp. 31–46, 2010
B. Hull et al. Cartel: a distributed mobile sensor computing system. in Proceedings of SenSys, pp. 125–138, 2006
U.T. Inc, U.T.I. Staff, Solving Data Mining Problems Using Pattern Recognition Software with CDROM, 1st edn. (Prentice Hall PTR, Upper Saddle River, 1997)
J. Han, M. Kamber, J. Pei, Data Mining: Concepts and Techniques, 3rd edn. (Morgan Kaufman, San Francisco, 2011)
R.A. Johnson, D.W. Wichern, Applied Multivariate Statistical Analysis (Prentice-Hall, Inc., Upper Saddle River, 2002)
A. Jøsang, R. Ismail, C. Boyd, A survey of trust and reputation systems for online service provision. Decis. Support Syst. 43(2), 618–644 (2007)
R.E. Kalman, A new approach to linear filtering and prediction problems. Trans. ASME J. Basic Eng. 82(Series D), 35–45 (1960)
H. Kargutpa, S. Datta, Q. Wang, K. Sivakumar, On the privacy preserving properties of random data perturbation techniques. in Proceedings of the IEEE International Conference on Data Mining, pp. 99–106, 2003
J.M. Kleinberg, Authoritative sources in a hyperlinked environment. J. ACM 46(5), 604–632 (1999)
X. Li, J. Han, Z. Yin, J.-G. Lee, Y. Sun, Sampling cube: a framework for statistical OLAP over sampling data. in Proceedings 2008 ACM SIGMOD International Conference on Management of Data (SIGMOD’08), Vancouver, BC, Canada, June 2008
Q. Lian, Z. Zhang, M. Yang, B. Y. Zhao, Y. Dai, X. Li, An empirical study of collusion behavior in the maze p2p file-sharing system. in Proceedings of the 27th International Conference on Distributed Computing Systems, ICDCS ’07, p. 56. IEEE Computer Society, Washington, DC, USA, 2007
B. Longstaff, S. Reddy, D. Estrin, Improving activity classification for health applications on mobile devices using active and semi-supervised, learning, p. 6, 2010
A. Madan, M. Cebrian, D. Lazer, A. Pentland, Social sensing for epidemiological behavior change. in Proceedings of the 12th ACM International Conference on Ubiquitous Computing, Ubicomp ’10, pp. 291–300. ACM, New York, NY, USA, 2010
A. Madan, S.T. Moturu, D. Lazer, A. Pentland, Social sensing: obesity, unhealthy eating and exercise in face-to-face networks. in Wireless, Health, pp. 104–110, 2010
G.J. McLachlan, T. Krishnan, The Em Algorithm and Extensions (Wiley, New York, 1997)
E. Miluzzo, N.D. Lane, K. Fodor, R. Peterson, H. Lu, M. Musolesi, S.B. Eisenman, X. Zheng, A.T. Campbell, Sensing meets mobile social networks: the design, implementation and evaluation of the cenceme application. in Proceedings of the 6th ACM conference on Embedded network sensor systems, SenSys ’08, pp. 337–350. ACM, New York, NY, USA, 2008
M. Mun, S. Reddy, K. Shilton, N. Yau, J. Burke, D. Estrin, M. Hansen, E. Howard, R. West, P. Boda, Peir, the personal environmental impact report, as a platform for participatory sensing systems research. in Proceedings of the 7th International Conference on Mobile Systems, Applications, and Services, MobiSys ’09, pp. 55–68. ACM, New York, NY, USA, 2009
N. Mustapha, M. Jalali, M. Jalali, Expectation maximization clustering algorithm for user modeling in web usage mining systems. Eur. J. Sci. Res. 32(4), 467–476 (2009)
S. Nath, Ace: exploiting correlation for energy-efficient and continuous context sensing. in Proceedings of the Tenth International Conference on Mobile systems, Applications, and Services (MobiSys’12) 2012
T. Park, J. Lee, I. Hwang, C. Yoo, L. Nachman, J. Song, E-gesture: a collaborative architecture for energy-efficient gesture recognition with hand-worn sensor and mobile devices. in Proceedings of the 9th ACM Conference on Embedded Networked Sensor Systems, SenSys ’11, pp. 260–273. ACM, New York, NY, USA, 2011
J. Pasternack, D. Roth, Knowing what to believe (when you already know something). in International Conference on Computational Linguistics (COLING) 2010
N. Pham, R. Ganti, M.Y. Uddin, S. Nath, T. Abdelzaher, Privacy-preserving reconstruction of multidimensional data maps in vehicular participatory sensing. in EWSN, Coimbra, Portugal, Feb 2010
N. Pham, R.K. Ganti, Y.S. Uddin, S. Nath, T. Abdelzaher, Privacy-preserving reconstruction of multidimensional data maps in vehicular participatory sensing. in Proceedings of the 7th European Conference on Wireless Sensor Networks, EWSN’10, pp. 114–130. Springer-Verlag, Berlin, Heidelberg, 2010
D. Pomerantz, G. Dudek, Context dependent movie recommendations using a hierarchical bayesian model. in Proceedings of the 22nd Canadian Conference on Artificial Intelligence: Advances in Artificial Intelligence, Canadian AI ’09, pp. 98–109. Springer-Verlag, Berlin, Heidelberg, 2009
R. Ramakrishnan, B.-C. Chen, Exploratory mining in cube space. Data Min. Knowl. Discovery 15, 29–54 (2007)
S. Reddy, D. Estrin, M. Srivastava, Recruitment framework for participatory sensing data collections. in Proceedings of the 8th International Conference on Pervasive Computing, pp. 138–155. Springer, Berlin, Heidelberg, May 2010
S. Reddy, K. Shilton, G. Denisov, C. Cenizal, D. Estrin, M. Srivastava, Biketastic: sensing and mapping for better biking. in Proceedings of the 28th International Conference on Human Factors in Computing Systems, CHI ’10, pp. 1817–1820. ACM, New York, NY, USA, 2010
Sense Networks. Cab Sense. http://www.cabsense.com
V.S. Sheng, F. Provost, P.G. Ipeirotis, Get another label? Improving data quality and data mining using multiple, noisy labelers. in Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD ’08, pp. 614–622. ACM, New York, NY, USA, 2008
Y. Sun, Y. Yu, J. Han, Ranking-based clustering of heterogeneous information networks with star network schema. in 15th SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’09), pp. 797–806, 2009
P.-N. Tan, M. Steinbach, V. Kumar, Introduction to Data Mining (Addison Wesley, Boston, 2005)
A. Thiagarajan, J. Biagioni, T. Gerlich, J. Eriksson, Cooperative transit tracking using smart-phones. in SenSys’10, pp. 85–98, 2010
A.N. Tikhonov, V.Y. Arsenin, Solution of Ill Posed Problems (V. H. Winstons and Sons, Washington, 1977)
D. Wang, T. Abdelzaher, H. Ahmadi, J. Pasternack, D. Roth, M. Gupta, J. Han, O. Fatemieh, H. Le, On bayesian interpretation of fact-finding in information networks. in 14th International Conference on Information Fusion (Fusion 2011), 2011
D. Wang, H. Ahmadi, T. Abdelzaher, H. Chenji, R. Stoleru, C. Aggarwal, Optimizing quality-of-information in cost-sensitive sensor data fusion. in IEEE 7th International Conference on Distributed Computing in Sensor Systems (DCoSS 11), June 2011
D. Wang, L. Kaplan, T. Abdelzaher, C. Aggarwal, On scalability and robustness limitations of real and asymptotic confidence bounds in social sensing. in 9th Annual IEEE Communications Society Conference on Sensor, Mesh and Ad Hoc Communications and Networks (SECON), Seoul, Korea, June 2012
D. Wang, H. Le, L. Kaplan, T. Abdelzaher. On truth discovery in social sensing: a maximum likelihood estimation approach. in 11th ACM/IEEE Conference on Information Processing in Sensor Networks (IPSN), April 2012
M.E. Whitman, H.J. Mattord, Principles of Information Security (Course Technology Press, Boston, 2004)
C.F.J. Wu, On the convergence properties of the EM algorithm. Ann. Stat. 11(1), 95–103 (1983)
J. Xie, S. Sreenivasan, G. Korniss, W. Zhang, C. Lim, B.K. Szymanski, Social consensus through the influence of committed minorities. CoRR, abs/1102.3931, 2011
Z. Yang, S. Zhong, R.N. Wright, Privacy-preserving classification of customer data without loss of accuracy. in Proceedings of SIAM International Conference on Data Mining, pp. 92–102, 2005
X. Yin, J. Han, P.S. Yu, Truth discovery with multiple conflicting information providers on the web. IEEE Trans. Knowl. Data Eng. 20, 796–808 (2008)
X. Yin, W. Tan, Semi-supervised truth discovery. in WWW. ACM, New York, NY, USA, 2011
H. Yu, M. Kaminsky, P.B. Gibbons, A. Flaxman, Sybilguard: defending against sybil attacks via social networks. SIGCOMM Comput. Commun. Rev. 36, 267–278 (2006)
C. Zhai, A note on the expectation maximization (em) algorithm. University of Illinois at Urbana Champaign, Department of Computer Scinece, 2007
B. Zhao, B.I.P. Rubinstein, J. Gemmell, J. Han, A Bayesian approach to discovering truth from conflicting sources for data integration. Proc. VLDB Endow. 5(6), 550–561 (2012)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Abdelzaher, T., Wang, D. (2014). Analytic Challenges in Social Sensing. In: Ammari, H. (eds) The Art of Wireless Sensor Networks. Signals and Communication Technology. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40066-7_16
Download citation
DOI: https://doi.org/10.1007/978-3-642-40066-7_16
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-40065-0
Online ISBN: 978-3-642-40066-7
eBook Packages: EngineeringEngineering (R0)