Abstract
Transfer learning (TL) has emerged as a promising area of research in machine learning (ML) due to its ability to enhance learning efficiency and accuracy by leveraging knowledge from related domains. However, traditional TL is limited in its applicability to real-world scenarios where the assumption of identical feature spaces and distributions between source and target domains is untenable. To address this limitation, Heterogeneous Transfer Learning (HeTL) has emerged as an important research direction that enables knowledge transfer between domains with heterogeneous feature spaces and distributions. Motivated by the growing interest and significance of HeTL, this survey paper comprehensively reviews recent HeTL developments, beginning with mathematical TL definitions and a taxonomy of TL categories. It delves into HeTL, explaining its classification and research status, and highlights symmetric and asymmetric HeTL advancements. Next, we explored the applications of HeTL in various disciplines, such as image and text classification, activity recognition, and cross-project defect prediction, emphasizing HeTL’s advantages over Traditional TL. Furthermore, we also discuss the challenges in HeTL, such as heterogeneity, transferability, negative learning, interpretability, and explainability. Finally, we conclude with a discussion on HeTL directions for future research.
Similar content being viewed by others
Data Availability
Data sharing is not applicable to this article as no new data were generated or analyzed during the current study.
References
Yu F, Xiu X, Li Y (2022) A survey on deep transfer learning and beyond. Mathematics 10(19):3619
Kim HE, Cosa-Linan A, Santhanam N, Jannesari M, Maros ME, Ganslandt T (2022) Transfer learning for medical image classification: a literature review. BMC Med Imaging 22(1):69
Musyafa A, Gao Y, Solyman A, Wu C, Khan S (2022) Automatic correction of indonesian grammatical errors based on transformer. Appl Sci 12(20):10380
Zhi Y, Zhang H, Gao Z et al (2023) Vessel contour detection in intracoronary images via bilateral cross-domain adaptation. IEEE J Biomed Health Inf
Zhao S, Yue X, Zhang S, Li B, Zhao H, Wu B, Krishna R, Gonzalez JE, Sangiovanni-Vincentelli AL, Seshia SA et al (2020) A review of single-source deep unsupervised visual domain adaptation. IEEE Trans Neural Netw Learn Syst 33(2):473–493
Jaiswal A, Gianchandani N, Singh D, Kumar V, Kaur M (2021) Classification of the covid-19 infected patients using densenet201 based deep transfer learning. J Biomol Struct Dyn 39(15):5682–5689
Mahmood F, Chen R, Durr NJ (2018) Unsupervised reverse domain adaptation for synthetic medical images via adversarial training. IEEE Trans Med Imaging 37(12):2572–2581
Das NN, Kumar N, Kaur M, Kumar V, Singh D (2022) Automated deep transfer learning-based approach for detection of COVID-19 infection in chest x-rays. IRBM 43(2):114–119
Liu G, Peng J, El-Latif AAA (2023) Sk-mobilenet: a lightweight adaptive network based on complex deep transfer learning for plant disease recognition. Arab J Sci Eng 48(2):1661–1675
Sabir MFS, Mehmood I, Alsaggaf WA, Khairullah EF, Alhuraiji S, Alghamdi AS, El-Latif A, Ahmed A (2022) An automated real-time face mask detection system using transfer learning with faster-rcnn in the era of the COVID-19 pandemic. Comput Mater Contin 71(2)
Guo S, Liu X, Zhang H, Lin Q, Xu L, Shi C, Gao Z, Guzzo A, Fortino G (2023) Causal knowledge fusion for 3d cross-modality cardiac image segmentation. Inf Fusion 101864
Guo S, Xu L, Feng C, Xiong H, Gao Z, Zhang H (2021) Multi-level semantic adaptation for few-shot segmentation on cardiac image sequences. Med Image Anal 73:102170
Perone CS, Ballester P, Barros RC, Cohen-Adad J (2019) Unsupervised domain adaptation for medical imaging segmentation with self-ensembling. NeuroImage 194:1–11
Zhuang F, Qi Z, Duan K, Xi D, Zhu Y, Zhu H, Xiong H, He Q (2020) A comprehensive survey on transfer learning. Proceedings of the IEEE 109(1):43–76
Kulis B, Saenko K, Darrell T (2011) What you saw is not what you get: domain adaptation using asymmetric kernel transforms. In: CVPR 2011, pp 1785–1792. IEEE
Hssayni EH, Joudar N-E, Ettaouil M (2022) A deep learning framework for time series classification using normal cloud representation and convolutional neural network optimization. Comput Intell 38(6):2056–2074
Wang H, Wang X, Cheng Y (2023) Graph meta transfer network for heterogeneous few-shot hyperspectral image classification. IEEE Trans Geosci Remote Sens 61:1–12
Shi X, Liu Q, Fan W, Philip SY, Zhu R (2010) Transfer learning on heterogenous feature spaces via spectral transformation. In: 2010 IEEE international conference on data mining, pp 1049–1054
Wu Q, Wu H, Zhou X, Tan M, Xu Y, Yan Y, Hao T (2017) Online transfer learning with multiple homogeneous or heterogeneous sources. IEEE Trans Knowl Data Eng 29(7):1494–1507
Esuli A, Moreo A, Sebastiani F (2019) Funnelling: a new ensemble method for heterogeneous transfer learning and its application to cross-lingual text classification. ACM Trans Inf Syst (TOIS) 37(3):1–30
Wu H, Yan Y, Ye Y, Min H, Ng MK, Wu Q (2019) Online heterogeneous transfer learning by knowledge transition. ACM Trans Intell Syst Technol (TIST) 10(3):1–19
Moon S, Carbonell JG (2017) Completely heterogeneous transfer learning with attention-what and what not to transfer. IJCAI 1:1–2
Zhou J, Pan S, Tsang I, Yan Y (2014) Hybrid heterogeneous transfer learning through deep learning. Proceedings of the AAAI Conference on Artificial Intelligence 28:000–0008
Wang C, Mahadevan S (2011) Heterogeneous domain adaptation using manifold alignment. In: IJCAI Proceedings-international joint conference on artificial intelligence, vol 22, p 1541
Yi J, Tao J, Wen Z, Bai Y (2018) Language-adversarial transfer learning for low-resource speech recognition. IEEE/ACM Trans Audio Speech Lang Process 27(3):621–630
Xia W, Huang J, Hansen JH (2019) Cross-lingual text-independent speaker verification using unsupervised adversarial discriminative domain adaptation. In: ICASSP 2019-2019 IEEE international conference on acoustics, speech and signal processing (ICASSP), pp 5816–5820. IEEE
Harel M, Mannor S (2010) Learning from multiple outlooks. arXiv:1005.0027
Deotale D, Verma M, Suresh P, Jangir SK, Kaur M, Idris SA, Alshazly H (2022) Hartiv: human activity recognition using temporal information in videos. Comput Mater Contin 70(2)
Jain B, Patidar S, Sudershan D (2022) Heterogeneous software defect prediction using generative models. In: 2022 IEEE 11th international conference on communication systems and network technologies (CSNT), pp 367–372. IEEE
Farahani A, Pourshojae B, Rasheed K, Arabnia HR (2020) A concise review of transfer learning. In: 2020 international conference on computational science and computational intelligence (CSCI), pp 344–351. IEEE
Friedjungová M, Jirina M (2017) Asymmetric heterogeneous transfer learning: a survey. In: DATA, pp 17–27
Weiss K, Khoshgoftaar TM, Wang D (2016) A survey of transfer learning. J Big Data 3:1–40
Day O, Khoshgoftaar TM (2017) A survey on heterogeneous transfer learning. J Big Data 4:1–42
Pan SJ, Yang Q (2010) A survey on transfer learning. IEEE Trans Knowl Data Eng 22(10):1345–1359
Feuz KD, Cook DJ (2015) Transfer learning across feature-rich heterogeneous feature spaces via feature-space remapping (fsr). ACM Trans Intell Syst Technol (TIST) 6(1):1–27
Khan S, Asim M, Chelloug SA, Abdelrahiem B, Khan S, Musyafa A (2023) A novel cluster matching-based improved kernel fisher criterion for image classification in unsupervised domain adaptation. Symmetry 15(6):1163
Khan S, Asim M, Khan S, Musyafa A, Wu Q (2023) Unsupervised domain adaptation using fuzzy rules and stochastic hierarchical convolutional neural networks. Comput Electr Eng 105:108547
Khan S, Guo Y, Ye Y, Li C, Wu Q (2023) Mini-batch dynamic geometric embedding for unsupervised domain adaptation. Neural Process Lett 1–18
Csurka G (2017) A comprehensive survey on domain adaptation for visual applications. Domain Adaptation Comput Vis Appl 1–35
Rosenstein MT, Marx Z, Kaelbling LP, Dietterich TG (2005) To transfer or not to transfer. In: NIPS 2005 Workshop on Transfer Learning, vol 898
He Y, Jin X, Ding G, Guo Y, Han J, Zhang J, Zhao S (2020) Heterogeneous transfer learning with weighted instance-correspondence data. Proceedings of the AAAI Conference on Artificial Intelligence 34:4099–4106
Zhu Y, Chen Y, Lu Z, Pan S, Xue G-R, Yu Y, Yang Q (2011) Heterogeneous transfer learning for image classification. Proceedings of the AAAI Conference on Artificial Intelligence 25:1304–1309
Yan Y, Wu Q, Tan M, Min H (2016) Online heterogeneous transfer learning by weighted offline and online classifiers. In: Computer Vision–ECCV 2016 Workshops: Amsterdam, The Netherlands, October 8-10 and 15-16, 2016, Proceedings, Part III 14, pp 467–474. Springer
Prettenhofer P, Stein B (2010) Cross-language text classification using structural correspondence learning. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pp 1118–1127
Duan L, Xu D, Tsang I (2012) Learning with augmented features for heterogeneous domain adaptation. arXiv:1206.4660
Wu H, Wu Q, Ng MK (2021) Knowledge preserving and distribution alignment for heterogeneous domain adaptation. ACM Trans Inf Syst (TOIS) 40(1):1–29
Wu H, Zhu H, Yan Y, Wu J, Zhang Y, Ng MK (2021) Heterogeneous domain adaptation by information capturing and distribution matching. IEEE Trans Image Process 30:6364–6376
Blitzer J, McDonald R, Pereira F (2006) Domain adaptation with structural correspondence learning. In: Proceedings of the 2006 conference on empirical methods in natural language processing, pp 120–128
Johnson R, Zhang T (2005) A high-performance semi-supervised learning method for text chunking. In: Proceedings of the 43rd annual meeting of the association for computational linguistics (ACL’05), pp 1–9
Pan SJ, Kwok JT, Yang Q et al (2008) Transfer learning via dimensionality reduction. AAAI 8:677–682
Daumé III H (2009) Frustratingly easy domain adaptation. arXiv:0907.1815
Daumé III H, Kumar A, Saha A (2010) Frustratingly easy semi-supervised domain adaptation. In: Proceedings of the 2010 workshop on domain adaptation for natural language processing, pp 53–59
Zhong E, Fan W, Peng J, Zhang K, Ren J, Turaga D, Verscheure O (2009) Cross domain distribution adaptation via kernel mapping. In: Proceedings of the 15th ACM SIGKDD international conference on knowledge discovery and data mining, pp 1027–1036
Liu F, Zhang G, Lu J (2020) Heterogeneous domain adaptation: an unsupervised approach. IEEE Trans Neural Netw Learn Syst 31(12):5588–5602
Dai W, Chen Y, Xue G-R, Yang Q, Yu Y (2008) Translated learning: transfer learning across different feature spaces. Adv Neural Inf Process Syst 21:1
Wang G, Hoiem D, Forsyth D (2009) Building text features for object image classification. In: 2009 IEEE conference on computer vision and pattern recognition, pp 1367–1374. IEEE
Qi G-J, Aggarwal C, Huang T (2011) Towards semantic knowledge propagation from text corpus to web images. In: Proceedings of the 20th international conference on World Wide Web, pp 297–306
Raina R, Battle A, Lee H, Packer B, Ng AY (2007) Self-taught learning: transfer learning from unlabeled data. In: Proceedings of the 24th international conference on machine learning, pp 759–766
Wang D, Cui P, Zhu W (2018) Deep asymmetric transfer network for unbalanced domain adaptation. Proceedings of the AAAI Conference on Artificial Intelligence 32:0000–00009
Wei B, Pal C (2010) Cross lingual adaptation: an experiment on sentiment classifications. In: Proceedings of the ACL 2010 Conference Short Papers, pp 258–262
Wei B, Pal C (2011) Heterogeneous transfer learning with rbms. Proceedings of the AAAI Conference on Artificial Intelligence 25:531–536
Zhou Z-H, Dai H-B (2007) Exploiting image contents in web search. In: IJCAI, pp 2922–2927
Duan L, Xu D, Chang S-F (2012) Exploiting web images for event recognition in consumer videos: a multiple source domain adaptation approach. In: 2012 IEEE conference on computer vision and pattern recognition, pp 1338–1345. IEEE
Li Q, Han Y, Dang J (2014) Large-scale cross-media retrieval by heterogeneous feature augmentation. In: 2014 12th International conference on signal processing (ICSP), pp 977–980. IEEE
Liu X, Song L, Wu X, Tan T (2016) Transferring deep representation for nir-vis heterogeneous face recognition. In: 2016 International conference on biometrics (ICB), pp 1–8. IEEE
Zhou JT, Tsang IW, Pan SJ, Tan M (2014) Heterogeneous domain adaptation for multiple classes. In: Artificial intelligence and statistics, pp 1095–1103. PMLR
Ando RK, Zhang T, Bartlett P (2005) A framework for learning predictive structures from multiple tasks and unlabeled data. J Mach Learn Res 6(11)
Rashidi P, Cook DJ (2010) Multi home transfer learning for resident activity discovery and recognition. In: KDD international workshop on knowledge discovery from sensor data, pp 53–63
Van Kasteren T, Englebienne G, Kröse BJ (2010) Transferring knowledge of activity recognition across sensor networks. In: Pervasive computing: 8th international conference, pervasive 2010, Helsinki, Finland, May 17-20, 2010. Proceedings 8, pp 283–300. Springer
Nam J, Kim S (2015) Heterogeneous defect prediction. In: Proceedings of the 2015 10th joint meeting on foundations of software engineering, pp 508–519
He Z, Shu F, Yang Y, Li M, Wang Q (2012) An investigation on the feasibility of cross-project defect prediction. Autom Softw Eng 19:167–199
Rahman F, Posnett D, Devanbu P (2012) Recalling the“ imprecision” of cross-project defect prediction. In: Proceedings of the ACM SIGSOFT 20th international symposium on the foundations of software engineering, pp 1–11
Nam J, Pan SJ, Kim S (2013) Transfer defect learning. In: 2013 35th International conference on software engineering (ICSE), pp 382–391. IEEE
Ma Y, Luo G, Zeng X, Chen A (2012) Transfer learning for cross-company software defect prediction. Inf Softw Technol 54(3):248–256
Yosinski J, Clune J, Bengio Y, Lipson H (2014) How transferable are features in deep neural networks? Adv Neural Inf Process Syst 27
Yang Q, Chen Y, Xue G-R, Dai W, Yu Y (2009) Heterogeneous transfer learning for image clustering via the socialweb. In: Proceedings of the joint conference of the 47th annual meeting of the ACL and the 4th international joint conference on natural language processing of the AFNLP, pp 1–9
Wu L, Hoi SC, Jin R, Zhu J, Yu N (2011) Distance metric learning from uncertain side information for automated photo tagging. ACM Trans Intell Syst Technol (TIST) 2(2):1–28
Saenko K, Kulis B, Fritz M, Darrell T (2010) Adapting visual category models to new domains. In: Computer vision–ECCV 2010: 11th European conference on computer vision, Heraklion, Crete, Greece, September 5-11, 2010, Proceedings, Part IV 11, pp 213–226. Springer
Shawe-Taylor J, Cristianini N (2004) Kernel Methods for Pattern Analysis. Cambridge University Press, Cambridge, UK
Hernandez-Cruz N, Nugent C, Zhang S, McChesney I (2021) The use of transfer learning for activity recognition in instances of heterogeneous sensing. Appl Sci 11(16):7660
Cook D, Feuz KD, Krishnan NC (2013) Transfer learning for activity recognition: a survey. Knowl Inf Syst 36:537–556
Sargano AB, Wang X, Angelov P, Habib Z (2017) Human action recognition using transfer learning with deep representations. In: 2017 International joint conference on neural networks (IJCNN), pp 463–469. IEEE
Blanke, U., Schiele, B (2010) Remember and transfer what you have learned-recognizing composite activities based on activity spotting. In: International symposium on wearable computers (ISWC) 2010, pp. 1–8. IEEE
Nomizu K, Katsumi N, Sasaki T (1994) Affine Differential Geometry: Geometry of Affine Immersions. Cambridge University Press, Cambridge, UK
Vaart AW (2000) Asymptotic Statistics, vol 3. Cambridge University Press, Cambridge, UK
Blitzer J, Crammer K, Kulesza A, Pereira F, Wortman J (2007) Learning bounds for domain adaptation. Adv Neural Inf Process Syst 20:1
Lafferty J, Zhai C (2001) Document language models, query models, and risk minimization for information retrieval. In: Proceedings of the 24th annual international ACM SIGIR conference on research and development in information retrieval, pp 111–119
Dietterich TG, Bakiri G (1994) Solving multiclass learning problems via error-correcting output codes. J Artif Intell Res 2:263–286
Shivakumar PG, Georgiou P (2020) Transfer learning from adult to children for speech recognition: evaluation, analysis and recommendations. Comput Speech Lang 63:101077
Chui KT, Arya V, Band SS, Alhalabi M, Liu RW, Chi HR (2023) Facilitating innovation and knowledge transfer between homogeneous and heterogeneous datasets: generic incremental transfer learning approach and multidisciplinary studies. J Innov Knowl 8(2):100313
Bica I, Schaar M (2022) Transfer learning on heterogeneous feature spaces for treatment effects estimation. Adv Neural Inf Process Syst 35:37184–37198
Medhat W, Hassan A, Korashy H (2014) Sentiment analysis algorithms and applications: a survey. Ain Shams Eng J 5(4):1093–1113
Sanz FG, Ebrahimi M, Johnsson A (2022) Exploring approaches for heterogeneous transfer learning in dynamic networks. In: NOMS 2022-2022 IEEE/IFIP network operations and management symposium, pp 1–9. IEEE
Geaur Rahman, M., Zahidul Islam, M.: A framework for supervised heterogeneous transfer learning using dynamic distribution adaptation and manifold regularization. 2108 (2021)
Zhao P, Gao H, Lu Y, Wu T (2019) A cross-media heterogeneous transfer learning for preventing over-adaption. Appl Soft Comput 85:105819
Mignone P, Pio G, Ceci M (2022) Distributed heterogeneous transfer learning for link prediction in the positive unlabeled setting. In: 2022 IEEE international conference on big data (Big Data), pp. 5536–5541. IEEE
Iqbal MS, Luo B, Khan T, Mehmood R, Sadiq M (2018) Heterogeneous transfer learning techniques for machine learning. Iran J Comput Sci 1:31–46
Baik S, Choi M, Choi J, Kim H, Lee KM (2020) Meta-learning with adaptive hyperparameters. Adv Neural Inf Process Syst 33:20755–20765
Joudar N-E, Ettaouil M et al (2022) An adaptive drop method for deep neural networks regularization: Estimation of dropconnect hyperparameter using generalization gap. Knowl-Based Syst 253:109567
Goodfellow I, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, Courville A, Bengio Y (2020) Generative adversarial networks. Commun ACM 63(11):139–144
Kingma, DP, Welling M (2014) Stochastic gradient vb and the variational auto-encoder. In: Second international conference on learning representations, ICLR, vol 19, p121
Mansour, Y., Mohri, M., Rostamizadeh, A.: Domain adaptation: Learning bounds and algorithms. arXiv:0902.3430 (2009)
Gretton A, Borgwardt KM, Rasch MJ, Schölkopf B, Smola A (2012) A kernel two-sample test. J Mach Learn Res 13(1):723–773
Hssayni Eh, Joudar N-E, Ettaouil M (2022) Localization and reduction of redundancy in cnn using l 1-sparsity induction. J Ambient Intell Humaniz Comput, 1–13
Fin C, Abbeel P, Levine S (2017) Model-agnostic meta-learning for fast adaptation of deep networks. In: International conference on machine learning, pp 1126–1135. PMLR
Ganin Y, Ustinova E, Ajakan H, Germain P, Larochelle H, Laviolette F, Marchand M, Lempitsky V (2016) Domain-adversarial training of neural networks. J Mach Learn Res 17(1):2096–2030
Huang J, Gretton A, Borgwardt K, Schölkopf B, Smola A (2006) Correcting sample selection bias by unlabeled data. Adv Neural Inf Process Syst 19
Kim B, Malioutov DM, Varshney KR, Weller A (2017) Proceedings of the 2017 icml workshop on human interpretability in machine learning (whi 2017). 1708
Ribeiro MT, Singh S, Guestrin C (2016) why should i trust you? explaining the predictions of any classifier. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, pp 1135–1144
Caruana R, Lou Y, Gehrke J, Koch P, Sturm M, Elhadad N (2015) Intelligible models for healthcare: Predicting pneumonia risk and hospital 30-day readmission. In: Proceedings of the 21th ACM SIGKDD international conference on knowledge discovery and data mining, pp 1721–1730
Xu K, Ba J, Kiros R, Cho K, Courville A, Salakhudinov R, Zemel R, Bengio Y (2015) Show, attend and tell: Neural image caption generation with visual attention. In: International conference on machine learning, pp 2048–2057. PMLR
Lakkaraju H, Bach SH, Leskovec J (2016) Interpretable decision sets: a joint framework for description and prediction. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, pp 1675–1684
Sanz FG, Ebrahimi M, Johnsson A (2021) On heterogeneous transfer learning for improved network service performance prediction. In: 2021 IEEE global communications conference (GLOBECOM), pp 1–6. IEEE
Zhou JT, Pan SJ, Tsang IW (2019) A deep learning framework for hybrid heterogeneous transfer learning. Artif Intell 275:310–328
Bahdanau D, Cho K, Bengio, Y (2014) Neural machine translation by jointly learning to align and translate. arXiv:1409.0473
Simonyan K, Vedaldi A, Zisserman A (2013) Deep inside convolutional networks: Visualising image classification models and saliency maps. arXiv:1312.6034
Wachter S, Mittelstadt B, Russell C (2017) Counterfactual explanations without opening the black box: automated decisions and the gdpr. Harv JL & Tech 31:841
Sundararajan M, Taly A, Yan Q (2017) Axiomatic attribution for deep networks. In: International conference on machine learning, pp 3319–3328. PMLR
Acknowledgements
This work was supported by National Natural Science Foundation of China (NSFC) 62272172, Guangdong Basic and Applied Basic Research Foundation 2023A1515012920. Also, the authors would like to thanks Prince Sultan University for their support.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflicts of interest
The authors declare no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Khan, S., Yin, P., Guo, Y. et al. Heterogeneous transfer learning: recent developments, applications, and challenges. Multimed Tools Appl (2024). https://doi.org/10.1007/s11042-024-18352-3
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11042-024-18352-3