skip to main content
10.1145/3452296.3472907acmconferencesArticle/Chapter ViewAbstractPublication PagescommConference Proceedingsconference-collections
research-article
Public Access

Personalizing head related transfer functions for earables

Published:09 August 2021Publication History

ABSTRACT

Head related transfer functions (HRTF) describe how sound signals bounce, scatter, and diffract when they arrive at the head, and travel towards the ears. HRTFs produce distinct sound patterns that ultimately help the brain infer the spatial properties of the sound, such as its direction of arrival, 𝜃. If an earphone can learn the HRTF, it could apply the HRTF to any sound and make that sound appear directional to the user. For instance, a directional voice guide could help a tourist navigate a new city. While past works have estimated human HRTFs, an important gap lies in personalization. Today's HRTFs are global templates that are used in all products; since human HRTFs are unique, a global HRTF only offers a coarse-grained experience. This paper shows that by moving a smartphone around the head, combined with mobile acoustic communications between the phone and the earbuds, it is possible to estimate a user's personal HRTF. Our personalization system, UNIQ, combines techniques from channel estimation, motion tracking, and signal processing, with a focus on modeling signal diffraction on the curvature of the face. The results are promising and could open new doors into the rapidly growing space of immersive AR/VR, earables, smart hearing aids, etc.

Skip Supplemental Material Section

Supplemental Material

video-presentation.mp4

mp4

175.3 MB

References

  1. 2015. The Sound Professionals. Retrieved Jan 26, 2021 from https://www. soundprofessionals.com/cgi-bin/gold/item/SP-TFB-2Google ScholarGoogle Scholar
  2. 2015. Wave Interactions and Interference. Retrieved Jan 24, 2021 from https://www.ck12.org/section/wave-interactions-and-interference-%3a%3aof% 3a%3a-waves-%3a%3aof%3a%3a-ck-12-physical-science-for-middle-school/Google ScholarGoogle Scholar
  3. 2017. Beyond Surround Sound: Audio Advances in VR. Retrieved Jan 24, 2021 from https://www.oculus.com/blog/beyond-surround-sound-audio-advances in-vr/Google ScholarGoogle Scholar
  4. 2017. Near-field 3D Audio Explained. Retrieved Jun 11, 2021 from https: //developer.oculus.com/blog/near-field-3d-audio-explained/Google ScholarGoogle Scholar
  5. 2018. Simulating Dynamic Soundscapes at Facebook Reality Labs. Retrieved Jan 26, 2021 from https://www.oculus.com/blog/simulating-dynamic-soundscapes at-facebook-reality-labs/Google ScholarGoogle Scholar
  6. 2019. Audio in mixed reality. Retrieved Jan 24, 2021 from https://docs.microsoft. com/en-us/windows/mixed-reality/design/spatial-soundGoogle ScholarGoogle Scholar
  7. 2019. Mach1 will provide spatial audio for Bose's AR platform. Retrieved Jan 24, 2021 from https://venturebeat.com/2019/12/18/mach1-will-provide-spatial audio-for-boses-ar-platform/Google ScholarGoogle Scholar
  8. 2020. Apple brings surround sound and Dolby Atmos to AirPods Pro. Re trieved Jan 24, 2021 from https://thenextweb.com/plugged/2020/06/22/apple brings-surround-sound-and-dolby-atmos-to-airpods-pro/Google ScholarGoogle Scholar
  9. 2020. Diffraction. Retrieved Jan 24, 2021 from https://en.wikipedia.org/wiki/ DiffractionGoogle ScholarGoogle Scholar
  10. 2020. Inside Facebook Reality Labs Research: The Future of Audio. Retrieved Jan 24, 2021 from https://about.fb.com/news/2020/09/facebook-reality-labs-research future-of-audio/Google ScholarGoogle Scholar
  11. 2020. Xiaomi United States. Retrieved Jan 26, 2021 from https://www.mi.com/us/Google ScholarGoogle Scholar
  12. 2021. DIY HRTF measurement using an iPhone. Retrieved Jun 11, 2021 from https://www.earfish.eu/sites/default/files/2018-01/DIY_earfish_iPhone_0.pdfGoogle ScholarGoogle Scholar
  13. 2021. Equal-loudness contour. Retrieved Jan 24, 2021 from https://en.wikipedia. org/wiki/Equal-loudness_contourGoogle ScholarGoogle Scholar
  14. Ishwarya Ananthabhotla, Vamsi Krishna Ithapu, and W Owen Brimijoin. 2021. A framework for designing head-related transfer function distance metrics that capture localization perception. JASA Express Letters 1, 4 (2021), 044401.Google ScholarGoogle ScholarCross RefCross Ref
  15. Jeffrey R Blum, Mathieu Bouchard, and Jeremy R Cooperstock. 2011. What's around me? Spatialized audio augmented reality for blind users with a smart phone. In International Conference on Mobile and Ubiquitous Systems: Computing, Networking, and Services. Springer, 49--62.Google ScholarGoogle Scholar
  16. C Phillip Brown and Richard O Duda. 1997. An efficient HRTF model for 3-D sound. In Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics. IEEE, 4--pp.Google ScholarGoogle ScholarCross RefCross Ref
  17. Thibaut Carpentier, Hélène Bahu, Markus Noisternig, and Olivier Warusfel. 2014. Measurement of a head-related transfer function database with high spatial resolution. In 7th Forum Acusticum (EAA).Google ScholarGoogle Scholar
  18. Jorge Dávila-Chacón, Jindong Liu, and Stefan Wermter. 2018. Enhanced robot speech recognition using biomimetic binaural sound source localization. IEEE transactions on neural networks and learning systems 30, 1 (2018), 138--150.Google ScholarGoogle Scholar
  19. Hossein Falaki, Ratul Mahajan, Srikanth Kandula, Dimitrios Lymberopoulos, Ramesh Govindan, and Deborah Estrin. 2010. Diversity in smartphone usage. In Proceedings of the 8th international conference on Mobile systems, applications, and services. 179--194.Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Yang Gao, Wei Wang, Vir V Phoha, Wei Sun, and Zhanpeng Jin. 2019. EarEcho: Using Ear Canal Echo for Wearable Authentication. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies 3, 3 (2019), 1--24.Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. William G Gardner. 2005. Spatial audio reproduction: Towards individualized binaural sound. In Frontiers of Engineering:: Reports on Leading-Edge Engineering from the 2004 NAE Symposium on Frontiers of Engineering, Vol. 34. 113.Google ScholarGoogle Scholar
  22. William G Gardner and Keith D Martin. 1995. HRTF measurements of a KEMAR. The Journal of the Acoustical Society of America 97, 6 (1995), 3907--3908.Google ScholarGoogle ScholarCross RefCross Ref
  23. Reza Ghaffarivardavagh, Sayed Saad Afzal, Osvy Rodriguez, and Fadel Adib. 2020. Ultra-wideband underwater backscatter via piezoelectric metamaterials. In Proceedings of the Annual conference of the ACM Special Interest Group on Data Communication on the applications, technologies, architectures, and protocols for computer communication. 722--734.Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Yasaman Ghasempour, Chia-Yi Yeh, Rabi Shrestha, Yasith Amarasinghe, Daniel Mittleman, and Edward W Knightly. 2020. LeakyTrack: non-coherent single antenna nodal and environmental mobility tracking with a leaky-wave antenna. In Proceedings of the 18th Conference on Embedded Networked Sensor Systems. 56--68.Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Michael M Goodwin and Jean-Marc Jot. 2007. Binaural 3-D audio rendering based on spatial audio scene coding. In Audio Engineering Society Convention 123. Audio Engineering Society.Google ScholarGoogle Scholar
  26. Michael M Goodwin, Jean-Marc Jot, and Mark Dolson. 2013. Spatial audio analysis and synthesis for binaural reproduction and format conversion. US Patent 8,374,365.Google ScholarGoogle Scholar
  27. Corentin Guezenoc and Renaud Seguier. 2020. HRTF individualization: A survey. arXiv preprint arXiv:2003.06183 (2020).Google ScholarGoogle Scholar
  28. Nail A Gumerov, Ramani Duraiswami, and Dmitry N Zotkin. 2007. Fast multipole accelerated boundary elements for numerical computation of the head related transfer function. In 2007 IEEE International Conference on Acoustics, Speech and Signal Processing-ICASSP'07, Vol. 1. IEEE, I--165.Google ScholarGoogle ScholarCross RefCross Ref
  29. Hongmei Hu, Lin Zhou, Hao Ma, and Zhenyang Wu. 2008. HRTF personalization based on artificial neural network in individual virtual auditory space. Applied Acoustics 69, 2 (2008), 163--172.Google ScholarGoogle ScholarCross RefCross Ref
  30. Sungmok Hwang, Youngjin Park, and Younsik Park. 2007. Sound direction estima tion using artificial ear. In 2007 International Conference on Control, Automation and Systems. IEEE, 1906--1910.Google ScholarGoogle ScholarCross RefCross Ref
  31. C Jackman, M Zampino, D Cadge, R Dravida, V Katiyar, and J Lewis. 2009. Esti mating acoustic performance of a cell phone speaker using Abaqus. In SIMULIA Customer Conference. 14--21.Google ScholarGoogle Scholar
  32. Cheol-Taek Kim, Tae-Yong Choi, ByongSuk Choi, and Ju-Jang Lee. 2008. Robust estimation of sound direction for robot interface. In 2008 IEEE International Conference on Robotics and Automation. IEEE, 3475--3480.Google ScholarGoogle Scholar
  33. Lin Li and Qinghua Huang. 2013. HRTF personalization modeling based on RBF neural network. In 2013 IEEE International Conference on Acoustics, Speech and Signal Processing. IEEE, 3707--3710.Google ScholarGoogle ScholarCross RefCross Ref
  34. Zhihong Luo, Qiping Zhang, Yunfei Ma, Manish Singh, and Fadel Adib. 2019. 3D backscatter localization for fine-grained robotics. In 16th {USENIX} Symposium on Networked Systems Design and Implementation ({NSDI} 19). 765--782.Google ScholarGoogle Scholar
  35. Wenguang Mao, Wei Sun, Mei Wang, and Lili Qiu. 2020. DeepRange: Acous tic Ranging via Deep Learning. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies 4, 4 (2020), 1--23.Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. Alok Meshram, Ravish Mehra, Hongsheng Yang, Enrique Dunn, Jan-Michael Franm, and Dinesh Manocha. 2014. P-HRTF: Efficient personalized HRTF com putation for high-fidelity spatial sound. In 2014 IEEE International Symposium on Mixed and Augmented Reality (ISMAR). IEEE, 53--61.Google ScholarGoogle ScholarCross RefCross Ref
  37. Yan Michalevsky, Aaron Schulman, Gunaa Arumugam Veerapandian, Dan Boneh, and Gabi Nakibly. 2015. Powerspy: Location tracking using mobile device power analysis. In 24th {USENIX} Security Symposium ({USENIX} Security 15). 785--800.Google ScholarGoogle Scholar
  38. Philip M Morse and Pearl J Rubenstein. 1938. The diffraction of waves by ribbons and by slits. Physical Review 54, 11 (1938), 895.Google ScholarGoogle ScholarCross RefCross Ref
  39. Rajalakshmi Nandakumar, Krishna Kant Chintalapudi, Venkat Padmanabhan, and Ramarathnam Venkatesan. 2013. Dhwani: secure peer-to-peer acoustic NFC. ACM SIGCOMM Computer Communication Review 43, 4 (2013), 63--74.Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. Takanori Nishino, Sumie Mase, Shoji Kajita, Kazuya Takeda, and Fumitada Itakura. 1996. Interpolating HRTF for auditory virtual reality. Ph.D. Dissertation. Acoustical Society of America.Google ScholarGoogle Scholar
  41. Chunyi Peng, Guobin Shen, Yongguang Zhang, Yanlin Li, and Kun Tan. 2007. Beepbeep: a high accuracy acoustic ranging system using cots mobile devices. In Proceedings of the 5th international conference on Embedded networked sensor systems. 1--14.Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. Ming-Zher Poh, Kyunghee Kim, Andrew D Goessling, Nicholas C Swenson, and Rosalind W Picard. 2009. Heartphones: Sensor earphones and mobile applica tion for non-obtrusive health monitoring. In 2009 International Symposium on Wearable Computers. IEEE, 153--154.Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. Swadhin Pradhan, Ghufran Baig, Wenguang Mao, Lili Qiu, Guohai Chen, and Bo Yang. 2018. Smartphone-based acoustic indoor space mapping. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies 2, 2 (2018), 1--26.Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. Jay Prakash, Zhijian Yang, Yu-Lin Wei, and Romit Roy Choudhury. 2019. STEAR: Robust Step Counting from Earables. In Proceedings of the 1st International Work shop on Earable Computing. 36--41.Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. Niklas Röber, Sven Andres, and Maic Masuch. 2006. HRTF simulations through acoustic raytracing. Universitäts-und Landesbibliothek Sachsen-Anhalt.Google ScholarGoogle Scholar
  46. Sheng Shen, Daguan Chen, Yu-Lin Wei, Zhijian Yang, and Romit Roy Choudhury. 2020. Voice localization using nearby wall reflections. In Proceedings of the 26th Annual International Conference on Mobile Computing and Networking. 1--14.Google ScholarGoogle ScholarDigital LibraryDigital Library
  47. Tzu-Chun Tai, Kate Ching-Ju Lin, and Yu-Chee Tseng. 2019. Toward reliable local ization by unequal AoA tracking. In Proceedings of the 17th Annual International Conference on Mobile Systems, Applications, and Services. 444--456.Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. Jelmer Tiete, Federico Domínguez, Bruno Da Silva, Laurent Segers, Kris Steenhaut, and Abdellah Touhafi. 2014. SoundCompass: a distributed MEMS microphone array-based sensor for sound source localization. Sensors 14, 2 (2014), 1918--1949.Google ScholarGoogle ScholarCross RefCross Ref
  49. Edgar A Torres-Gallegos, Felipe Orduna-Bustamante, and Fernando Arámbula Cosío. 2015. Personalization of head-related transfer functions (hrtf) based on automatic photo-anthropometry and inference from a database. Applied Acoustics 97 (2015), 84--95.Google ScholarGoogle ScholarCross RefCross Ref
  50. J-M Valin, François Michaud, Jean Rouat, and Dominic Létourneau. 2003. Robust sound source localization using a microphone array on a mobile robot. In Pro ceedings 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2003)(Cat. No. 03CH37453), Vol. 2. IEEE, 1228--1233.Google ScholarGoogle Scholar
  51. Lars Falck Villemoes and Dirk Jeroen Breebaart. 2012. Method and apparatus for generating a binaural audio signal. US Patent 8,265,284.Google ScholarGoogle Scholar
  52. Jeff Wilson, Bruce N Walker, Jeffrey Lindsay, Craig Cambias, and Frank Dellaert. 2007. Swan: System for wearable audio navigation. In 2007 11th IEEE international symposium on wearable computers. IEEE, 91--98.Google ScholarGoogle ScholarDigital LibraryDigital Library
  53. Jens Windau and Laurent Itti. 2016. Walking compass with head-mounted IMU sensor. In 2016 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 5542--5547.Google ScholarGoogle ScholarDigital LibraryDigital Library
  54. Zhijian Yang, Yu-Lin Wei, Sheng Shen, and Romit Roy Choudhury. 2020. Ear-AR: indoor acoustic augmented reality on earphones. In Proceedings of the 26th Annual International Conference on Mobile Computing and Networking. 1--14.Google ScholarGoogle ScholarDigital LibraryDigital Library
  55. Guangzheng Yu, Ruixing Wu, Yu Liu, and Bosun Xie. 2018. Near-field head related transfer-function measurement and database of human subjects. The Journal of the Acoustical Society of America 143, 3 (2018), EL194--EL198.Google ScholarGoogle ScholarCross RefCross Ref
  56. Yanzi Zhu, Yibo Zhu, Ben Y Zhao, and Haitao Zheng. 2015. Reusing 60ghz radios for mobile radar imaging. In Proceedings of the 21st Annual International Conference on Mobile Computing and Networking. 103--116.Google ScholarGoogle ScholarDigital LibraryDigital Library
  57. Harald Ziegelwanger, Wolfgang Kreuzer, and Piotr Majdak. 2016. A priori mesh grading for the numerical calculation of the head-related transfer functions. Applied Acoustics 114 (2016), 99--110.Google ScholarGoogle ScholarCross RefCross Ref
  58. DYN Zotkin, Jane Hwang, R Duraiswaini, and Larry S Davis. 2003. HRTF per sonalization using anthropometric measurements. In 2003 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (IEEE Cat. No. 03TH8684). Ieee, 157--160.Google ScholarGoogle ScholarCross RefCross Ref
  59. Dmitry N Zotkin, Ramani Duraiswami, and Larry S Davis. 2004. Rendering localized spatial audio in a virtual auditory space. IEEE Transactions on multimedia 6, 4 (2004), 553--564.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Personalizing head related transfer functions for earables

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          SIGCOMM '21: Proceedings of the 2021 ACM SIGCOMM 2021 Conference
          August 2021
          868 pages
          ISBN:9781450383837
          DOI:10.1145/3452296

          Copyright © 2021 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 9 August 2021

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          Overall Acceptance Rate554of3,547submissions,16%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader