Are there any differences in the tourists ’ perceived destination image between travel e-commerce platforms and social media platforms?

Purpose – This study aims to explore the differences in the tourists ’ perceived destination image on travel e-commerceplatforms (e.g.CtripandFliggy)andsocial mediaplatforms (e.g. Xiaohongshuand Weibo).


Introduction
Tourism destination image reflects the overall consumer impressions of the tourist destination (Kotler, 2002).It is a crucial factor in destination development and tourist's travel decision (Li et al., 2021).Previous studies have extensively explored the definition, construction and measurements of tourism destination images (Gartner, 1993;Joo et al., 2023;Woosnam et al., 2020).With the advancement of information technology, online platforms have become key channels for tourists to search for information, post comments and share travel experiences (Gaffar et al., 2022).Usergenerated content (UGC) on travel e-commerce platforms and social media platforms is considered a reliable and trustworthy way to evaluate destination images, as consumers trust these channels in their decision-making process (Kaur et al., 2016;Mao et al., 2021).
Differences in tourists' perceived destination images derived from various online platforms can arise due to the geographical location of users or platform-specific biases.For instance, different social media platforms are preferred in different countries, such as Mafengwo in China and Tripadvisor in other countries, leading to different representation of the same destination across cultures (Lee and Park, 2023).Guo et al. (2021) found that platform-specific biases can cause variations in destination images, showing that Finland's destination image differs across five travel e-commerce platforms: Ctrip, Qyer, Mafengwo, Tuniu and Qunaer.Thus, understanding the multiple representations of a single on the internet is essential (Choi et al., 2007).
In addition to travel e-commerce platforms, general social media such as Twitter, Instagram and Weibo are widely used as data sources to represent tourism destination images (Arefieva et al., 2021;Molinillo et al., 2018).Critical differences exist between travel e-commerce and social media platforms.Travel e-commerce platforms, such as TripAdvisor and Ctrip, primarily focus on facilitating online travel deals (Hua et al., 2015).By contrast, general social media platforms like Twitter, Instagram and Weibo emphasize online communication and interaction (Akhtar et al., 2023;Liu et al., 2021).Relying solely on one type of platform to form a representation of a tourism destination image can introduce significant cognitive biases, leading to irrational destination development strategies and unwise travel decisions.Therefore, it is crucial to consider the limitations and biases associated with each platform when evaluating tourism destination images.
A research gap exists regarding the differences in the tourism destination images represented by UGC on travel e-commerce platforms and social media platforms.Further investigation is needed to understand and identify these differences.To fill this gap, this study explored the difference in tourism destination images between these two platforms through UGC analysis.The main contribution of this study is the use of both an inductive approach (the LDA model and co-occurrence analysis with community detection) and a deductive approach (the word frequency analysis with manual classification) to identify the difference between tourism destination images reflected by UGC on travel e-commerce platforms and social media platforms.The specific research objectives are to: • retrieve UGC of a specific tourism destination from both travel e-commerce platform and social media; • construct tourism destination image frameworks from UGC on different platforms using the LDA model, co-occurrence analysis with community detection and word frequency analysis with manual classification; • identify the differences between the tourism destination image frameworks.

Literature review and hypothesis development 2.1 Tourists' perceived destination image
Tourists' perceived destination image reflects the overall consumer impressions of the tourist destination (Kotler, 2002).Gartner (1993)'s theory provides a foundational framework for understanding this image, proposing that it comprises three key components: cognitive, affective and conative images.Cognitive images pertain to how tourists evaluate a destination based on their initial visit or subsequent visits (Joo et al., 2023).Affective images relate to tourists' emotional responses and feelings toward the destination (Lee and Jeong, 2023).Conative images are concerned with future behaviors and intentions, which are influenced by the cognitive and affective components (Gartner and Witt, 1994;Woosnam et al., 2020).The overall perceived destination image includes not only tourists' beliefs, ideas and impressions but also their thoughts and emotions about the destination's attributes (Stylos et al., 2017;Chaulagain et al., 2019).It encompasses both functional and psychological characteristics (Tung et al., 2021), and can explain individuals' reactions and behaviors toward destinations (Afshardoost and Eshaghi, 2020).
This framework represents a significant advancement in the field and was used in this study using a deductive approach to construct the tourism destination image.
2.2 Tourist' perceived destination image and user-generated content Web 2.0 technologies have enabled tourists to create and share opinions on various online platforms (Mak, 2017).Users frequently upload and share blogs, text, photos and videos, expressing their "feelings" about a tourism destination (Hunter, 2016;Taecharungroj and Mathayomchan, 2021).This UGC has become a key driver of visitor behavior, destination choice, e-word of mouth and ultimately, the reputation and success of a destination (Liu et al., 2021).The online perceived image of a destination is more dynamic than its traditional counterpart, reflecting collective beliefs, knowledge, ideas, feelings and overall impressions (Hunter, 2013).Online representations through text and photographs complement and enhance traditional images found in brochures, guidebooks and surveys (Alarcon-Urbistondo et al., 2023).
Advanced technological methods such as natural language processing (NLP) techniques, artificial intelligence (AI) framework and the latent Dirichlet allocation (LDA) model are commonly used to study online destination images (Lin et al., 2021;Kaur et al., 2016).For example, NLP techniques have been used to explore the impact of safety on online perceived destination image (Marine-Roig and Huertas, 2020), while AI frameworks assist in identifying tourism images from uploading photos (Wang et al., 2020).Li et al. (2022) developed a multilevel visual system to analyze cognitive themes and emotional experiences of various destination images.The LDA model is particularly significant for extracting topics from UGC to measure destination image and e-reputation (Dong et al., 2023;Ali et al., 2021).Overall, UGC combined with advanced technological methods, such as the LDA model, provides an inductive approach to studying destination images, allowing for comprehensive and inclusive evaluation of a destination's attributes (Alarcon-Urbistondo et al., 2023).

The difference between travel e-commerce platforms and social media platforms
Travel e-commerce platforms and social media platforms are two primary data sources for destination image research (Li et al., 2023).On social media platforms, tourism-related photos and texts are used to investigate the perceived image of tourism destinations (Arabadzhyan et al., 2021;Narangajavana et al., 2017).Molinillo et al. (2018) compared UGC from various social media platforms such as Instagram, Facebook, YouTube and Twitter to explore differences in tourism's perceived image.By contrast, UGC from travel e-commerce platforms, primarily online review, is used to establish perceived destination image and measure consumer satisfaction (Guo et al., 2017;Hou et al., 2019;Liu et al., 2018;Zhang et al., 2016).
Despite the importance of these platforms, the difference between travel e-commerce platforms and social media platforms remain unexplored.Studies often focus on one type of platform, either travel e-commerce or social media, without comparing the two (Guo et al., 2021;Liu et al., 2021;Molinillo et al., 2018).This lack of comparative research limits our understanding of the similarities and differences between these platforms.Xiang et al. (2017) highlighted that relying on a single data source restricts the generalizability and contribution of social media analytics.Their study demonstrated significant variations in linguistic characteristics, semantic features, sentiment, rating and usefulness among online reviews on TripAdvisor, Expedia and Yelp.
The distinctions between travel e-commerce platforms and social media platforms are clear.Travel e-commerce platforms operate as internet-based business models providing online trading and communication platforms for tourists and tourism suppliers, focusing on trading and evaluation functions (Hua, 2016).By contrast, social media platforms are internet-based applications build on Web 2.0 principles, allowing the creation and exchange of UGC, emphasizing communication and sharing functions (Kaplan and Haenlein, 2010;Akhtar et al., 2023).Additionally, travel e-commerce platforms restrict review posting to users who have booked and consumed a certain travel product on the platform, whereas social media platforms impose no such restrictions, allowing users to freely post reviews, feelings or thoughts about a destination or travel product (Huang, 2024;Zeng and Gerritsen, 2014).Consequently, UGC on these platforms may reflect different images of a destination.Based on these differences, this study proposes the following research question (RQ): RQ. Are there any differences in the perceived destination image derived from UGC on different online platforms?If so, what are the differences?

Data collection
Sanya, located on the southernmost tip of Hainan Island in China, is often referred to as the "Hawaii of China" because of its stunning beaches, clear turquoise waters and tropical climate.This popular tourist destination generates a large amount of UGC on both travel ecommerce platforms and social media, making it an ideal location for studying perceived destination image across different types of platforms.
To collect data, we used Octoparse (https://www.octoparse.com), a robust web scraping tool that efficiently extracts both textual and metadata from various websites (Yu et al., 2020;Yu and Egger, 2021).Data was collected in two phases: May 2023 and July 2024.This twostage approach enhances the robustness of our results by increasing the data volume and mitigating the potential time bias.
This study draws data from four platforms: Ctrip, Fliggy, Xiaohongshu and Weibo.Ctrip and Fliggy represent travel e-commerce platforms, while Xiaohongshu and Weibo are social media platforms.These platforms were selected for three reasons: their widespread use among Chinese users; their representativeness in their respective fields; their frequent use as data sources in tourism image research.
Ctrip (http://www.ctrip.com),established in 1999, is a leading Chinese travel e-commerce platform with over 700 million users by 2023 (Hou et al., 2019).Fliggy (https:// www.fliggy.com),also known as Fei Zhu, is an Alibaba Group-owned online travel service platform launched in 2010.We used "Sanya" as a keyword to search information in Ctrip and Fliggy, collecting tourism reviews for 47 hotels and 14 attractions in Sanya.This resulted in over 35,000 reviews from Ctrip and approximately 16,400 reviews from Fliggy.
Xiaohongshu (https://www.xiaohongshu.com) is a prominent social media platform in China where users share daily life experiences.Weibo, launched by Sina Corporation in 2009 and often referred to as the "Chinese Twitter," is another major social media platform.For Xiaohongshu and Weibo, we collected UGC related to "Sanya Tourism," "Sanya" and the same 47 hotels and 14 attractions.This effort yielded over 23,000 reviews from the Xiaohongshu, and more than 16,000 reviews from Weibo.

Research procedure
This study preformed preprocessing operations in Python such as word separation and removal of deactivated on the corpus source files to obtain the document-word matrix.We used the widely recognized Chinese stop words list developed by the Harbin Institute of Technology (Hou et al., 2019).This study used three methods to extract attributes of tourist's perceived image: (1) The LDA model.The LDA model is a widely used for theme extraction from UGC (Seyyedamiri et al., 2022).We used Python to perform LDA analysis.To determine the optimal number of topics, we ran the LDA model with topic numbers ranging from 2 to 10, calculating the perplexity and coherence value of each model.Figures 1 and 2 present the perplexity value and coherence value for the LDA models applied to of UGC from both platform types.A low perplexity value indicates the model's better grasp of the underlying topic distribution in the data (Blei et al., 2003), while a high coherence value shows that the words within the topics are more related, forming coherent and meaningful concepts (Röder et al., 2015).The visualization results enable researchers to assess whether the topics are reasonable and relevant to the research objectives (Sakiyama et al., 2020).Combining the perplexity and coherence values with the visualization results, we determined the optimal number of topics for the LDA models.For UGC on travel e-commerce platforms, an LDA model with seven topics was identified as optimal.Similarly, for UGC on social media platforms, seven topics also proved to be most appropriate number.These topics and sub-keywords collectively form the tourism destination image of Sanya from the tourist's perspective.This method follows an inductive approach to building tourism destination images.
(2) Co-occurrence analysis with community detection.First, we preprocessed the data through tokenization and removal of stop words to create a 1,000 × 1,000 matrix where both rows and columns represent the top 1,000 most frequently occurring keywords.Each cell value in the matrix indicates the number of times the corresponding keyword pairs co-occur in the data set.The resulting co-occurrence matrix was then imported into Gephi, a clustering analysis program, where a community detection algorithm (Modularity Class) was applied to segment the data from travel e-commerce platforms and social media platforms.
(3) Word frequency analysis and manual classification.Initially, we used Python to analyze the word frequency in tourists' reviews from travel e-commerce platforms and social media platforms.Subsequently, based on the mutual destination image framework (Beerli and Martin, 2004;Lee and Park, 2023), we manually classified the high-frequency words into destination image attributes and explored new destination image attributes.This method follows a deductive approach to build a tourism destination image.

The result of latent Dirichlet allocation analysis
(1) Extracting attributes of tourism destination image from UGC on travel e-commerce platform using the LDA model Table 1 presents the results obtained from applying the LDA model to UGC on travel e-commerce platforms.Figure 3 shows the visualization of these results.In Figure 3, there are seven groups of words, the words in the right represent the top 30 relevant terms in the data set.By clicking the "next topic" button, you can view the top 30 terms for each specific topic.The topics and corresponding feature words in Table 1 are derived from the most relevant terms identified in the visualization.The LDA model derived seven distinct topics from tourist reviews, primarily focused on aspects of hotel service in Sanya.
For example, the feature words for Topic 1 include breakfast, check-in, hotel, facility, environment, vacation, Marriott, Sheraton, complete, sea view room, etc.These words relate to the general perceptions of tourists to hotels.Therefore, we categorize Topic 1 as "General perceptions of hotels," and consider it a critical attribute of tourism perceived destination image of Sanya.The rest of topics are: • Service quality.It focuses on the quality of services provided by hotels, emphasizing reliability, helpfulness and the atmosphere in guest rooms and restaurants.• Tourist satisfaction.This topic centers on tourists' satisfaction, discussing value for money and their likelihood of revisiting.    4 provides a visualization of these findings.The topics and corresponding feature words in Table 2 are derived from the most relevant terms identified in the visualization.The LDA model identified seven topics within UGC on social media platforms.For example, the feature words of Topic 1 include Sanya International Duty Free Shopping Center, influencer hotspot, beach, seascape, photography, Nanshan, scenery, sea, tropics, speed boat, etc.These feature words basically describe the scenic spots of Sanya; Thus, Topic 1 is named as "Scenic spots."The rest topics are: • Social events and celebrity endorsements.It highlights the impact of social events and celebrity endorsements on tourism in Sanya.

The result of co-occurrence analysis with community detection
The co-occurrence analysis method, enhanced by community detection algorithms, provides a more in-depth understanding of keyword relationships within UGC.This approach goes beyond simple frequency analysis and LDA model, offering insights into the relational dynamics of terms within the data set (Liu et al., 2019;Zhang et al., 2019).By analyzing how keywords co-occur, we can identify key themes and trends that may otherwise remain hidden.The community detection algorithm clusters keywords into distinct groups based on their co-occurrence matrix, revealing the underlying structure and thematic composition of the data.This method complements the LDA results by capturing the context and interrelationships between words in a more nuanced way: (1) Extracting attributes of tourism destination image from UGC on travel e-commerce platforms using co-occurrence analysis with community detection.Table 3 presents the results from the co-occurrence analysis conducted on UGC from travel e-commerce platforms.Figure 5 illustrates the co-occurrence matrix-based topic model.The topics and corresponding feature words in Table 3 are summarized based on the most relevant terms identified from the cooccurrence matrix diagram in Figure 5.The analysis, visualized using Gephi, categorizes the travel e-commerce platforms data into seven topics: • natural and photogenic aspects of Sanya; • tourists' experiences with hotels; • quality of service provided by hotel staff; • cleanliness, facilities and overall comfort of guest rooms; • suitability for family-friendly activities; • tourists' experiences with food and dining; and • general satisfaction of tourists.
(2) Extracting attributes of tourism destination image from UGC on social media using co-occurrence analysis with community detection.
Table 4 presents the findings from the co-occurrence analysis conducted on UGC on social media platforms.Figure 6 illustrates the result of the co-occurrence matrix-based topic model.The topics and corresponding feature words in Table 4 are summarized based on the TRC most relevant terms identified from the co-occurrence matrix diagram in Figure 6.The analysis were visualized using Gephi, categorizing the social media data into 11 topics: (1) tourists' experiences and recommendations related to various activities and attractions; (2) services and infrastructure available to tourists; (3) natural beauty of Sanya; (4) tourists' recommendations and experiences with local cuisine; (5) tourists' recommendations of hotels; (6) cultural and historical attractions; (7) recommendations for photogenic locations and activities; (8) media coverage and news related to Sanya tourism; (9) theme parks in Sanya; (10) aspects of daily life and personal documentation; and (11) air transportation facilities.

The result of word frequency analysis
In the word frequency analysis, we examined 36,439 words from UGC on travel e-commerce platforms and 67,084 words from UGC on social media platforms.Words appearing 300 times or more were selected to shape the perceived destination image of Sanya (Jiang et al., 2021).These high-frequency words were classified into various dimensions and attributes using the perceived tourism image framework proposed by Beerli and Martin (2004) and Lee and Park (2023).Then, we calculated the weight of each dimension according to the following Format 1. Format 1: 1 Frequency of Word j In Format 1, weight refers to the weight of attribute a, i represents the high-frequency words that belong to attribute a, and n represents total number of words in attribute a.The Frequency of Word i is the number of times word i appears in the data set.j represents the high-frequency words on travel e-commerce platform and social media platform, and the Frequency of Word j is the number of times word j appears in the data set.m represents total number of frequency words used to construct the tourism perceived destination image.For the travel e-commerce platforms, m is 294, and for the social media platforms, m is 201.
The dimensions and attributes identified in the analysis are summarized in Tables 5 and 6, which list the representative high-frequency words associated with each attribute.The perceived destination image of Sanya, as derived from UGC on travel e-commerce platform,  (1) Extracting attributes of tourism destination image from UGC on travel e-commerce platforms with word frequency analysis From the word frequency analysis of UGC on travel e-commerce platforms, it was found that the tourist infrastructure (37.1%) and the atmosphere of the place (27.5%) are the two most influential dimensions shaping Sanya's perceived tourism image.Within the atmosphere of the place dimension, attributes such as "pleasant" and "satisfactory" are highly emphasized.In the tourist infrastructure dimension, "hotels" stands out as a critical attribute.Other significant dimensions include service (13.0%), natural environment (7.2%) and tourist leisure and recreation (6.3%).Although dimensions like social environment (3.9%), general infrastructure (3.2%) and price level (1.8%) are associated with fewer high-frequency words, they still contribute to the overall tourism perceived image of Sanya.
Notably, the study identified some new attributes that were not present in the previously established destination image frameworks.These include duty-free stores, ocean, special sports and theme parks, which underscore Sanya's unique advantages as a tourism destination.
(2) Extracting attributes of tourism destination image from UGC on social media platforms with word frequency analysis Notes: Attributes in italics refer to those attributes that are not from the mutual destination image framework but are newly identified by text mining analytics in this study.The numbers in the bracket behind dimensions and attributes represent their weights, and the numbers in the bracket behind keywords represent their frequencies Source: By authors Note: Attributes in italics refer to those attributes that are not from the mutual destination image framework but are newly identified by text mining analytics in this study.The numbers in the bracket behind dimensions and attributes represent their weights, and the numbers in the bracket behind keywords represent their frequencies.

Source: By authors
The perceived destination image of Sanya derived from UGC on social media platforms consists of nine dimensions and 21 attributes.The weight of each dimension is distributed as follows: 37.6% for tourist infrastructure, 18.4% for recommendation, 13.7% for natural environment, 9.8% for tourist leisure and recreation, 4.9% for general infrastructure, 3.0% for price level, 2.6% for culture, history and art, 0.6% for service.Some novel attributes emerged from the analysis of social media platforms, including recommendations, satisfactory, duty-free stores, special food, celebrity effect, ocean, theme parks and special activities.These attributes reflect the distinct nature of UGC on social media platforms, where users often share personal experiences, provide recommendations or seek advice about specific destinations.These findings highlight the varied dimensions that shape Sanya's perceived image and reveal how different platforms can offer unique insights into what tourists value in their travel experiences.
Table 7 provides a comprehensive overview of the data sets and the results obtained from the three methods.

Discussion and conclusion
Tourists' perceived destination image is crucial in destination management.Previous studies have emphasized the construction of destination images and their impacts on tourism behavior and destination development (Lee and Jeong, 2023;Molinillo et al., 2018).The importance of destination image in marketing strategy and tourism choice is well documented (Afshardoost and Eshaghi, 2020;Hallmann et al., 2015).However, accurately measuring the destination image remains a challenge for both scholars and destination managers (Beerli and Martin, 2004;Bui et al., 2022).With the development of information technology, leveraging UGC and machine learning has emerged as a comprehensive approach to measure destination images accurately (Buhalis and Law, 2008).

Main findings
Our study identified several key differences in the tourism destination image between travel e-commerce platforms and social media: • Travel e-commerce platforms provide a comprehensive and logistical view of tourism experiences, particularly focusing on hotel accommodations, service quality and facility amenities.These platforms emphasize practical elements like booking • The tourism destination image of Sanya on travel e-commerce platforms is centered around hotels.By contrast, the image derived from social media is richer, covering a broader range of attributes.• Both platforms highlight the importance of tourism infrastructure, indicating that travelers prioritize the completeness and diversity of infrastructure during their travels.This suggests that destinations should focus on developing robust tourism infrastructure.• Social media platforms offer a more comprehensive portrayal of cultural and environmental aspects than travel e-commerce platforms.Users on social media frequently highlight natural beauty, cultural attractions and social activities, reflecting a greater appreciation of these elements.• Our study identified new attributes such as duty-free stores, oceans, special sports, theme parks and recommendations.While the first four attributes are emerging on both platforms, recommendation are unique to social media.
These differences in destination images can be attributed to the distinct nature of travel e-commerce and social media platforms.The primary function of the tourism platform is the transaction of tourism product, with user reviews influencing trading decisions (Hua, 2016;Zhong et al., 2023).Only those who have booked tourism products are allowed to post reviews, which focuses the UGC on e-commerce platforms being more targeted and functional.By contrast, social media platforms are centered around communication, allowing users to freely post notes and comments (Akhtar et al., 2023;Zeng and Gerritsen, 2014).This results in more diverse UGC on social media, encompassing a wider range of the tourism perceived destination image.Additionally, our findings indicate that recommendations are a significant attribute in UGC on social media.UGC on these platforms may play different roles in the decision-making process of tourists.Information on social media may contributes primarily to the travel demand perception and information search stages (Leung et al., 2013), while information on travel e-commerce platforms is more relevant during the travel preparation and satisfaction evaluation stage (Xiang et al., 2017).These differences reflect deeper platform biases.
Specifically, the openness and interactivity of social media make it a primary channel for tourists to share personal experiences and opinions, leading to highly personalized and subjective information.By contrast, UGC on travel e-commerce platforms tends to be more objective, reflecting users' feedback on specific tourism products, though it may exhibit some selection bias.Moreover, social media UGC often includes rich emotional expressions and personal stories, whereas e-commerce UGC is inclined towards functional information.These biases impact how tourists perceive destinations and make decisions.
In summary, studying the differences in tourism destination images between these platforms not only highlights the characteristics and biases of UGC but also provides valuable insights into the tourist decision-making process.These findings can help guide tourism marketing strategies and platform optimization.

Theoretical implications
This study contributes to the literature on tourism destination image in several significant ways.
First, it pioneers the integration of multiple data sources from different online platforms, specifically travel e-commerce platforms and social media.While existing studies have recognized the benefits of using various data sources to measure destination image, they have predominantly relied on UGC from either travel e-commerce platforms (Guo et al., 2021) or social media platforms (Molinillo et al., 2018), but not both.By incorporating UGC from both platform types, our research fills this gap and provides a more comprehensive understanding of tourists' perceived destination image for Sanya.
Second, this study uses a unique combination of methodologies.We used three approaches: (1) the LDA model to extract topics from UGC, representing the tourism destination image; (2) word frequency analysis to identify high-frequency words, followed by manual classification into dimensions and attributes based on a mature tourism destination framework; and (3) the co-occurrence analysis method, enhanced by community detection algorithms, provides a more in-depth understanding of keyword relationships within UGC (Lee and Park, 2023;Qian et al., 2022;Zhong et al., 2023).
This multiple approach enhances the robustness and depth of the analysis, demonstrating the value of combining quantitative and qualitative analyses.
Finally, this study identifies new attributes of Sanya's perceived tourism destination image, including duty-free stores, oceans, special sports, theme parks and recommendations.These attributes, especially those unique to social media, reflect the evolving nature of tourism destination images in the digital age.The findings further suggest that a combination of information technology and manual analysis can yield richer insights, ultimately advancing our understanding of how tourism images are constructed and perceived in the digital era.
In summary, our study makes significant theoretical contributions by broadening the scope of data sources, using a robust methodological framework and identifying new attributes of destination images, all of which enrich the existing literature on tourism destination image.

Practical implications
This study offers important practical implications by highlighting the distinct roles of social media and travel e-commerce platforms in shaping tourism destination images.It provides valuable insights for both destination managers and tourists.
For destination managers, the study suggests two key implications.First, by analyzing feedback from tourists on both platforms, managers can gain a more holistic understanding of tourists' perceptions and feelings about the destination.This comprehensive insight allows for more informed decision-making in destination management.Second, recognizing the differences between social media and travel e-commerce platforms enables managers to tailor their promotional strategies effectively.By understanding the unique characteristics and user behaviors of each platform, they can create targeted marketing campaigns that resonate with each platform's specific audience, thereby enhancing the effectiveness and impacts of their promotional efforts.
For tourists, the study underscores the importance of consulting both travel e-commerce platforms and social media platforms when making travel decisions.Utilizing information from both sources provides a more comprehensive view of the destination, helping tourists make more informed and well-rounded decisions regarding their travel plans.Furthermore, the findings can be generalized to other contexts by applying similar analytical approaches to different destinations.An important practical implication of this study is to consider the differences between platforms when devising destination marketing strategies and tourist communication strategies.Additionally, the potential long-term impacts on destination marketing strategies and tourism behavior include more personalized and effective marketing campaigns, improving tourist satisfaction and destination management practices.

Limitations and future research direction
This study demonstrates the existence of differences in the tourism perceived image of a destination that is derived from UGC on different kinds of platforms (social media and travel e-commerce platforms).Future research may explore the effect of these differences on tourists' travel decisions and destination marketing organizations' decisions.

Figure 1 .
Figure 1.The perplexity value of LDA models

Figure 2 .
Figure 2. The coherence value of LDA models

•
Comfort and cleanliness.It addresses the comfort and cleanliness of hotel rooms, as well as the quality of room service and facilities.•Convenience and accessibility.This topic highlights the eases of access to key locations, such as beaches and attractions like Haitang Bay.• General hotel experiences.It encompasses tourists' enjoyment of amenities like swimming pools, dining options and overall cleanliness.• Family-friendly activities.This topic refers to facilities and activities suitable for families, including playgrounds, water parks and other amusement facilities.•Extracting attributes of tourism destination image from UGC on social media using the LDA model

Figure 3 .
Figure 3.The result of the LDA model on travel e-commerce platform data

Figure 5 .
Figure 5.The co-occurrence matrix diagram of travel e-commerce platform

Figure 6 .
Figure 6.The co-occurrence matrix diagram of social media

Table 1 .
Topics and feature words of LDA model of UGC on travel e-commerce platform Source: By authors

Table 2
details the results of the LDA model applied to UGC on social media platforms, while Figure

Table 2 .
Topics and feature words of LDA model of UGC on social media The result of the LDA model on social media data Hotel recommendations and experiences.The topic covers recommendations and experiences related to hotel accommodations in Sanya, particularly in well-known hotel clusters in Haitang Bay.• Shared experiences and tips.It emphasizes vacation activities, shopping and notable sites like Wuzhizhou Island, as shared by tourists on social media platforms.• Photo-worthy locations.This topic includes recommendations for scenic spots and photo worthy locations in Sanya, such as Yalong Bay, and consider weather conditions.• Relaxation spots.It highlights popular relaxation spots in Sanya, including beaches, islands and comfortable accommodations.• Local foods and special activities.This topic focuses on tourists' recommendations of local foods and special sports activities.

Table 3 .
Topics and feature words of co-occurrence matrix diagram on travel e-commerce platform

Table 4 .
Topics and feature words of co-occurrence matrix diagram on social media Source: By authors consists of eight dimensions and 20 attributes.Meanwhile, UGC on social media platforms reflects nine dimensions and 21 attributes.

Table 5 .
Dimensions and attributes of the perceived destination image of Sanya derived from UGC on travel e-commerce platforms

Table 6
Dimensions and attributes of the perceived destination image of Sanya derived from UGC on social media platforms

Table 7
Summary of data sets and results of data analysis , check-in and check-out processes and detailed service reviews.By contrast, social media platforms highlight overall enjoyment, distinctive experiences and personal recommendation. procedures