Abstract
Video lectures are increasingly being used by learners in a ubiquitous manner. However, existing video designs are not optimised for ubiquitous use, creating the need to adapt the style of these videos to meet the constraints of the learning platform and context of use. Our formative study with experienced video editing users, however, found that performing these adaptations using traditional video editors can be a challenging and time-consuming task. We developed VidAdapter, a tool that facilitates lecture video adaptation by allowing direct manipulation of the video content. For this, VidAdapter automatically extracts meaningful elements from the video, enables spatial and temporal reorganisation of the elements, and streamlines the modification of an element's visual appearance. We demonstrate the capabilities and specific use cases of VidAdapter within the domain of adapting existing blackboard lecture videos for on-the-go learning on Optical Head-Mounted Displays. Our evaluation of the tool with experienced video editing users revealed that VidAdapter was strongly preferred over traditional approaches and can improve the efficiency of the adaptation process by over 53% on average.
Supplemental Material
Available for Download
Supplemental movie, appendix, image and software files for, VidAdapter: Adapting Blackboard-Style Videos for Ubiquitous Viewing
- Khan Academy. 2014. Cerebral cortex - Khan Academy. Retrieved July 25, 2023 from https://www.youtube.com/watch?v=mGxomKWfJXs&t=356s&ab_channel=khanacademymedicineGoogle Scholar
- Esha Baidya and Sanjay Goel. 2014. LectureKhoj: Automatic tagging and semantic segmentation of online lecture videos. In 2014 Seventh International Conference on Contemporary Computing (IC3). 37--43. https://doi.org/10.1109/IC3.2014.6897144Google ScholarCross Ref
- Meltem Huri Baturay and Murat Birtane. 2013. Responsive Web Design: A New Type of Design for Web-based Instructional Content. Procedia - Social and Behavioral Sciences 106 (2013), 2275--2279. https://doi.org/10.1016/j.sbspro.2013.12.259 4th International Conference on New Horizons in Education.Google ScholarCross Ref
- Vivek Bhuttoo, Kamlesh Soman, and Roopesh Kevin Sungkur. 2017. Responsive design and content adaptation for e-learning on mobile devices. In 2017 1st International Conference on Next Generation Computing Applications (NextComp). 163--168. https://doi.org/10.1109/ NEXTCOMP.2017.8016193Google ScholarCross Ref
- Arijit Biswas, Ankit Gandhi, and Om Deshmukh. 2015. MMToC: A Multimodal Method for Table of Content Creation in Educational Videos (MM '15). Association for Computing Machinery, New York, NY, USA, 621--630. https://doi.org/10.1145/2733373.2806253Google ScholarDigital Library
- John Brooke. 1995. SUS: A quick and dirty usability scale. Usability Eval. Ind. 189 (11 1995).Google Scholar
- Xiaoyin Che, Haojin Yang, and Christoph Meinel. 2013. Lecture Video Segmentation by Automatically Analyzing the Synchronized Slides. In Proceedings of the 21st ACM International Conference on Multimedia (Barcelona, Spain) (MM '13). Association for Computing Machinery, New York, NY, USA, 345--348. https://doi.org/10.1145/2502081.2508115Google ScholarDigital Library
- Michael B. Dillencourt, Hanan Samet, and Markku Tamminen. 1992. A General Approach to Connected-Component Labeling for Arbitrary Image Representations. J. ACM 39, 2 (apr 1992), 253--280. https://doi.org/10.1145/128749.128750Google ScholarDigital Library
- Peter E Doolittle and Gina J Mariano. 2008. Working memory capacity and mobile multimedia learning environments: Individual differences in learning while mobile. Journal of Educational Multimedia and Hypermedia 17, 4 (2008), 511--530. https://psycnet.apa.org/record/2008-16147-003Google Scholar
- Pierre Dragicevic, Gonzalo Ramos, Jacobo Bibliowitcz, Derek Nowrouzezahrai, Ravin Balakrishnan, and Karan Singh. 2008. Video Browsing by Direct Manipulation. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (Florence, Italy) (CHI '08). Association for Computing Machinery, New York, NY, USA, 237--246. https://doi.org/10.1145/1357054.1357096Google ScholarDigital Library
- Chris Fournier. 2013. Evaluating Text Segmentation using Boundary Edit Distance. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, Sofia, Bulgaria, 1702--1712. https://aclanthology.org/P13-1167Google Scholar
- C. Ailie Fraser, Joy O. Kim, Hijung Valentina Shin, Joel Brandt, and Mira Dontcheva. 2020. Temporal Segmentation of Creative Live Streams (CHI '20). Association for Computing Machinery, New York, NY, USA, 1--12. https://doi.org/10.1145/3313831.3376437Google ScholarDigital Library
- Jane Hoffswell, Wilmot Li, and Zhicheng Liu. 2020. Techniques for Flexible Responsive Visualization Design. In Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems (Honolulu, HI, USA) (CHI '20). Association for Computing Machinery, New York, NY, USA, 1--13. https://doi.org/10.1145/3313831.3376777Google ScholarDigital Library
- Hyeungshik Jung, Hijung Valentina Shin, and Juho Kim. 2018. DynamicSlide: Exploring the Design Space of Reference-Based Interaction Techniques for Slide-Based Lecture Videos. In Proceedings of the 2018 Workshop on Multimedia for Accessible Human Computer Interface (Seoul, Republic of Korea) (MAHCI'18). Association for Computing Machinery, New York, NY, USA, 33--41. https://doi.org/10.1145/3264856.3264861Google ScholarDigital Library
- Jeongyeon Kim, Yubin Choi, Minsuk Kahng, and Juho Kim. 2022. FitVid: Responsive and Flexible Video Content Adaptation. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems (New Orleans, LA, USA) (CHI '22). Association for Computing Machinery, New York, NY, USA, Article 501, 16 pages. https://doi.org/10.1145/3491102.3501948Google ScholarDigital Library
- Jeongyeon Kim, Yubin Choi, Meng Xia, and Juho Kim. 2022. Mobile-Friendly Content Design for MOOCs: Challenges, Requirements, and Design Opportunities. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems (New Orleans, LA, USA) (CHI '22). Association for Computing Machinery, New York, NY, USA, Article 92, 16 pages. https://doi.org/10.1145/3491102.3502054Google ScholarDigital Library
- Francis C. Li, Anoop Gupta, Elizabeth Sanocki, Li-wei He, and Yong Rui. 2000. Browsing Digital Video. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (The Hague, The Netherlands) (CHI '00). Association for Computing Machinery, New York, NY, USA, 169--176. https://doi.org/10.1145/332040.332425Google ScholarDigital Library
- Minghao Li, Tengchao Lv, Jingye Chen, Lei Cui, Yijuan Lu, Dinei Florencio, Cha Zhang, Zhoujun Li, and Furu Wei. 2021. Trocr: Transformer-based optical character recognition with pre-trained models. (2021). https://doi.org/10.48550/arXiv.2109.10282Google ScholarCross Ref
- Lois MacCullagh, Agnes Bosanquet, and Nicholas A. Badcock. 2017. University Students with Dyslexia: A Qualitative Exploratory Study of Learning Practices, Challenges and Strategies. Dyslexia 23, 1 (2017), 3--23. https://doi.org/10.1002/dys.1544 arXiv:https://onlinelibrary.wiley.com/doi/pdf/10.1002/dys.1544Google ScholarCross Ref
- Richard E. Mayer. 2002. Multimedia learning. Psychology of Learning and Motivation, Vol. 41. Academic Press, 85--139. https://doi.org/10.1016/S0079-7421(02)80005-6Google ScholarCross Ref
- Toni-Jan Keith Palma Monserrat, Shengdong Zhao, Kevin McGee, and Anshul Vikram Pandey. 2013. NoteVideo: Facilitating Navigation of Blackboard-Style Lecture Videos. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (Paris, France) (CHI '13). Association for Computing Machinery, New York, NY, USA, 1139--1148. https://doi.org/10.1145/2470654.2466147Google ScholarDigital Library
- Roxana Moreno and Richard Mayer. 1999. Cognitive Principles of Multimedia Learning: The Role of Modality and Contiguity. Journal of Educational Psychology 91 (06 1999), 358--368. https://doi.org/10.1037/0022-0663.91.2.358Google ScholarCross Ref
- Amy Pavel, Colorado Reed, Björn Hartmann, and Maneesh Agrawala. 2014. Video Digests: A Browsable, Skimmable Format for Informational Lecture Videos. In Proceedings of the 27th Annual ACM Symposium on User Interface Software and Technology (Honolulu, Hawaii, USA) (UIST '14). Association for Computing Machinery, New York, NY, USA, 573--582. https://doi.org/10.1145/2642918.2647400Google ScholarDigital Library
- Wenhui Peng and Yaling Zhou. 2015. The Design and Research of Responsive Web Supporting Mobile Learning Devices. In 2015 International Symposium on Educational Technology (ISET). 163--167. https://doi.org/10.1109/ISET.2015.40Google ScholarCross Ref
- Yi-Hao Peng, JiWoong Jang, Jeffrey P Bigham, and Amy Pavel. 2021. Say It All: Feedback for Improving Non-Visual Presentation Accessibility. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems (Yokohama, Japan) (CHI '21). Association for Computing Machinery, New York, NY, USA, Article 276, 12 pages. https://doi.org/10.1145/3411764.3445572Google ScholarDigital Library
- Ashwin Ram and Shengdong Zhao. 2021. LSVP: Towards Effective On-the-Go Video Learning Using Optical Head-Mounted Displays. Proc. ACM Interact. Mob. Wearable Ubiquitous Technol. 5, 1, Article 30 (March 2021), 27 pages. https://doi.org/10.1145/3448118Google ScholarDigital Library
- Ashwin Ram and Shengdong Zhao. 2022. Does Dynamically Drawn Text Improve Learning? Investigating the Effect of Text Presentation Styles in Video Learning. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems (New Orleans, LA, USA) (CHI '22). Association for Computing Machinery, New York, NY, USA, Article 89, 12 pages. https://doi.org/10.1145/3491102.3517499Google ScholarDigital Library
- Luz Rello, Gaurang Kanvinde, and Ricardo Baeza-Yates. 2012. A Mobile Application for Displaying More Accessible eBooks for People with Dyslexia. Procedia Computer Science 14 (2012), 226--233. https://doi.org/10.1016/j.procs.2012.10.026 Proceedings of the 4th International Conference on Software Development for Enhancing Accessibility and Fighting Info-exclusion (DSAI 2012).Google ScholarCross Ref
- Hijung Valentina Shin, Floraine Berthouzoz, Wilmot Li, and Frédo Durand. 2015. Visual Transcripts: Lecture Notes from Blackboard-Style Lecture Videos. ACM Trans. Graph. 34, 6, Article 240 (nov 2015), 10 pages. https://doi.org/10.1145/2816795.2818123Google ScholarDigital Library
- Bernardo Tabuenca, Stefaan Ternier, and Marcus Specht. 2013. Supporting Lifelong Learners to Build Personal Learning Ecologies in Daily Physical Spaces. Int. J. Mob. Learn. Organ. 7, 3/4 (Oct. 2013), 177--196. https://doi.org/10.1504/IJMLO.2013.057160Google ScholarDigital Library
- Clive Thompson. 2011. How Khan Academy is changing the rules of education. Retrieved November 14, 2022 from https://www.wired.com/2011/07/ff-khan/Google Scholar
- Miles Thorogood. 2016. SlideDeck.Js: A Platform for Generating Accessible and Interactive Web-Based Course Content. In Proceedings of the 21st Western Canadian Conference on Computing Education (Kamloops, BC, Canada) (WCCCE '16). Association for Computing Machinery, New York, NY, USA, Article 13, 5 pages. https://doi.org/10.1145/2910925.2910941Google ScholarDigital Library
- Shoko Tsujimura, Kazumasa Yamamoto, and Seiichi Nakagawa. 2017. Automatic Explanation Spot Estimation Method Targeted at Text and Figures in Lecture Slides.. In INTERSPEECH. 2764--2768.Google Scholar
- Nicholas Vanderschantz, Claire Timpany, and Annika Hinze. 2015. Design Exploration of EBook Interfaces for Personal Digital Libraries on Tablet Devices. In Proceedings of the 15th New Zealand Conference on Human-Computer Interaction (Hamilton, New Zealand) (CHINZ 2015). Association for Computing Machinery, New York, NY, USA, 21--30. https://doi.org/10.1145/2808047.2808054Google ScholarDigital Library
- André Vandierendonck, Baptist Liefooghe, and Frederick Verbruggen. 2010. Task Switching: Interplay of Reconfiguration and Interference Control. Psychological bulletin 136 (07 2010), 601--26. https://doi.org/10.1037/a0019791Google ScholarCross Ref
- Bryan Wang, Meng Yu Yang, and Tovi Grossman. 2021. Soloist: Generating Mixed-Initiative Tutorials from Existing Guitar Instructional Videos Through Audio Processing. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems (Yokohama, Japan) (CHI '21). Association for Computing Machinery, New York, NY, USA, Article 98, 14 pages. https://doi.org/10.1145/3411764.3445162Google ScholarDigital Library
- Aoyu Wu, Wai Tong, Tim Dwyer, Bongshin Lee, Petra Isenberg, and Huamin Qu. 2021. MobileVisFixer: Tailoring Web Visualizations for Mobile Phones Leveraging an Explainable Reinforcement Learning Framework. IEEE Transactions on Visualization and Computer Graphics 27, 2 (2021), 464--474. https://doi.org/10.1109/TVCG.2020.3030423Google ScholarCross Ref
- Xiang Xiao and Jingtao Wang. 2017. Undertanding and Detecting Divided Attention in Mobile MOOC Learning. In Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems (Denver, Colorado, USA) (CHI '17). Association for Computing Machinery, New York, NY, USA, 2411--2415. https://doi.org/10.1145/3025453.3025552Google ScholarDigital Library
- Saining Xie. 2015. Holistically-nested edge detection. Retrieved March 19, 2023 from https://github.com/s9xie/hedGoogle ScholarDigital Library
- Saining Xie and Zhuowen Tu. 2015. Holistically-Nested Edge Detection. In Proceedings of the IEEE International Conference on Computer Vision (ICCV).Google ScholarDigital Library
- Chengpei Xu, Ruomei Wang, Shujin Lin, Xiaonan Luo, Baoquan Zhao, Lijie Shao, and Mengqiu Hu. 2019. Lecture2Note: Automatic Generation of Lecture Notes from Slide-Based Educational Videos. In 2019 IEEE International Conference on Multimedia and Expo (ICME). 898--903. https://doi.org/10.1109/ICME.2019.00159Google ScholarCross Ref
- Kuldeep Yadav, Ankit Gandhi, Arijit Biswas, Kundan Shrivastava, Saurabh Srivastava, and Om Deshmukh. 2016. ViZig: Anchor Points Based Non-Linear Navigation and Summarization in Educational Videos. In Proceedings of the 21st International Conference on Intelligent User Interfaces (Sonoma, California, USA) (IUI '16). Association for Computing Machinery, New York, NY, USA, 407--418. https://doi.org/10.1145/2856767.2856788Google ScholarDigital Library
- Haojin Yang and Christoph Meinel. 2014. Content Based Lecture Video Retrieval Using Speech and Video Text Information. IEEE Transactions on Learning Technologies 7, 2 (2014), 142--154. https://doi.org/10.1109/TLT.2014.2307305Google ScholarCross Ref
- Haojin Yang, Maria Siebert, Patrick Luhne, Harald Sack, and Christoph Meinel. 2011. Lecture Video Indexing and Analysis Using Video OCR Technology. In 2011 Seventh International Conference on Signal Image Technology & Internet-Based Systems. 54--61. https://doi.org/10.1109/SITIS.2011.20Google ScholarDigital Library
- Baoquan Zhao, Shujin Lin, Xiaonan Luo, Songhua Xu, and Ruomei Wang. 2017. A Novel System for Visual Navigation of Educational Videos Using Multimodal Cues. In Proceedings of the 25th ACM International Conference on Multimedia (Mountain View, California, USA) (MM '17). Association for Computing Machinery, New York, NY, USA, 1680--1688. https://doi.org/10.1145/3123266.3123406Google ScholarDigital Library
- Baoquan Zhao, Songhua Xu, Shujin Lin, Ruomei Wang, and Xiaonan Luo. 2019. A New Visual Interface for Searching and Navigating Slide-Based Lecture Videos. In 2019 IEEE International Conference on Multimedia and Expo (ICME). 928--933. https://doi.org/10.1109/ ICME.2019.00164Google ScholarCross Ref
Index Terms
- VidAdapter: Adapting Blackboard-Style Videos for Ubiquitous Viewing
Recommendations
Context-aware interactive content adaptation
MobiSys '06: Proceedings of the 4th international conference on Mobile systems, applications and servicesAutomatic adaptation of content for mobile devices is a challenging problem because optimal adaptation often depends on the usage semantics of content, as well as the context of users (e.g., screen size of device being used, network connectivity, ...
Planning-Based Multimedia Adaptation Services Composition for Pervasive Computing
SITIS '09: Proceedings of the 2009 Fifth International Conference on Signal Image Technology and Internet Based SystemsContent adaptation is an attractive and effective solution to resolve the mismatch of resources and properties between the delivery context and the multimedia content in heterogeneous environments. The problem with multimedia content adaptation is that ...
Selection algorithm for multimedia adaptation mechanisms in ubiquitous service environments
iiWAS '10: Proceedings of the 12th International Conference on Information Integration and Web-based Applications & ServicesAs the amount of information in particular multimedia contents and services increases over networks, and in conjunction with the increases in the number of end user devices, it is becoming more and more inefficient for content and service providers to ...
Comments