Next Article in Journal
Concentrations of Atmospheric Culturable Bioaerosols at Mountain and Seashore Sites
Previous Article in Journal
Survey Results of a Pilot Sleep Study Near Atlanta International Airport
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Network Evolution of a Large Online MSM Dating Community: 2005–2018

1
College of Systems Engineering, National University of Defense Technology, Changsha 410073, China
2
School of Software Engineering, Shenzhen Institute of Information Technology, Shenzhen 518172, China
3
School of Business, Central South University, Changsha 410083, China
*
Author to whom correspondence should be addressed.
Int. J. Environ. Res. Public Health 2019, 16(22), 4322; https://doi.org/10.3390/ijerph16224322
Submission received: 12 October 2019 / Revised: 2 November 2019 / Accepted: 4 November 2019 / Published: 6 November 2019
(This article belongs to the Section Health Behavior, Chronic Disease and Health Promotion)

Abstract

:
Due to multiple sexual partners and low rates of condom use, the HIV infection rate among MSM (men who have sex with men) is much higher than that of the general population. In order to analyze the characteristics of online activities of MSM, and to understand the evolution of their social networks, in this study we collect a comprehensive dataset, covering the period from January 2005 to June 2018, from the largest Chinese online community, Baidu Tieba. We build an online dating network for MSM-related individuals in the gay-bar community, and analyze the network from static and dynamic aspects. It is found that there is a strong homophily regarding the cities where users reside when developing interactions with others, and that most network measurements tend to be stable at the later stages of evolution, while the size of the largest community fluctuates. This is an indication that the network is formed of rapidly flexible interactions which changes quickly. In comparison with studies on heterosexual networks, we find that the MSM dating network shows differences in many aspects, such as the positive degree-degree correlation and high clustering coefficient, suggesting different thinking and measures should be taken in the policy making of public health management towards the MSM population.

1. Introduction

By 2019, 32 countries and regions around the world had legalized same-sex marriage [1,2]. However, same-sex marriage is still not legally recognized in many countries, including China. Influenced by longstanding traditions such as ‘carrying on the family name’, homosexuality is still a sensitive issue in many countries or regions. In the Asia Pacific region, the sexual minority population is often one of the most marginal and vulnerable groups in society [3,4]. The MSM population refers to men who have sex with men [5,6,7], gay men, and some male bisexuals who have exhibit homosexual behavior, who currently account for approximately 5% of the total population in China [8,9]. Because MSM have multiple sexual partners and a low rate of condom use, the HIV epidemic and incidence among MSM are much higher than those among the general population [10]. HIV epidemics among MSM have continued to rise at an alarming rate in recent years [11,12]. Although recent statistics are not available, it was reported that the proportion of MSM with yearly reported new HIV/AIDS cases increased by more than 10 times, that is, from 2.50% in 2006 up to 28.25% in 2015 [13,14]. If the incidence of HIV cases is not properly controlled, the MSM population could easily become a medium of HIV transmission, prompting the spread of the virus more widely among the general population. Therefore, analyzing the characteristics of the dating activities of MSM, and understanding the social relationships in them, plays a crucially guiding role to establish and improve the health management mechanism for the MSM population.
Because of concerns about sensitivity and privacy, research data on hidden populations is generally difficult to access [15,16]. Moreover, the authenticity and accuracy of reporting data also challenge the validity and reliability of studies [17,18]. With the development of the internet and big data technology, people’s lives are increasingly inseparable from the internet. Individuals are able to carry out various social activities in internet communities, generating a huge amount of activity data and providing a new breakthrough for studies with traditional challenging issues. Massive amounts of internet open resource data have been increasingly used in the analysis and research of hidden populations [19,20,21,22]. Due to the existence of social discrimination, MSM lack reliable channels for making friends in the real world, and they are more dependent on social networking tools to find suitable friends or partners [23,24,25]. The anonymity of social networks has brought a good sense of security to this hidden population. Statistics in many studies have shown that the MSM population is very active online, far more than heterosexuals in seeking relationships [26,27,28]. Therefore, analyzing the dating characteristics and romantic relationships of MSM on the internet is an important perspective for understanding the MSM population.
In recent years, many studies have attempted to combine complex network technology with research in the online community of the internet to solve new problems [29]. For example, by using the community detection method of complex networks, Bruch and Newman studied the structure of heterosexual markets in the United States [30]. Holme et al. used the time series network analysis method to find out the structure and time evolution of a Swedish internet community, which is primarily intended for romantic communication [31]. Liu and Lu analyzed a popular HIV-themed online community in China, and found that the average topic similarity among members in HIV communities is positively correlated to network efficiency [32]. On the internet, MSM connect with each other every day in related communities, causing new interactions to be built, and there are new users joining. As a result, analyzing the online dating characteristics of MSM can be transformed into research on the user interaction network. However, there are few studies on the online interacting network of MSM, most of which analyzed only static characteristics [33,34,35]. Research on the long-term evolutionary characteristics of MSM dating networks is rare.
To fill in this gap of knowledge, we collected a comprehensive dataset in this study, covering the period from January 2005 to June 2018, from the largest sub-community related to MSM topics, gay-bar, in the world’s largest Chinese community (i.e., Baidu Tieba). We built the online interacting network for users on gay-bar, to analyze the romantic relationships of a Chinese MSM population on the internet. We carried out the analysis from static and dynamic aspects (Figure 1). We analyzed the static features of an online dating network for MSM—including nodal degree, edge weight, network communities, homophily, etc.—to investigate the social characteristics of the MSM population in the internet community. We also analyzed the evolution pattern [36,37] of the MSM online dating network, to develop an in-depth understanding of the changes in the online interaction networks of the MSM population in China over a 13-year period. In this study, we find out the interaction features as well as the evolution characteristics of the online MSM group, which is helpful to understand their social behavior, and essential for establishing and improving the health management mechanism on the MSM population.

2. Data and Methods

With more than 1 billion registered users and over 300 million monthly active users, Baidu Tieba [38] is the largest online Chinese community in the world. It is a user communication platform based on users’ interest in millions of different topics, gathering massive numbers of users with common interests. These users who share the same hobby or focus on the same topic join subcommunities (called ‘bars’) for discussion and networking. To date, there are 22,736,986 subcommunities for different topics on Baidu Tieba, covering entertainment, games, novels, regions, life, and other various aspects. Because the bars attract a large number of accurate and segmented user groups, many popular bars have a great research value for analysis of a specific population, especially hidden populations with strong concealment, such as MSM, people with AIDs, sex workers, etc.
Gay-bar is the largest sub-community related to MSM on Baidu Tieba, on which millions of MSM-related users post their self-portraits, daily lives, viewpoints, etc., and communicate with each other. As of June 14, 2018, gay-bar had 4.67 million followers, and the cumulative number of posts had reached 300 million. The massive user data generated in this open community is of great significance for analyzing the characteristics of the MSM population in China. In this study, we developed a web crawler to collect from the latest to the oldest possible posts, as well as all following comments and replies under these posts, for a total of 13,012,892 records. All the data was distributed from January 29, 2005, to June 14, 2018.
Users’ activity data consists of the main posts, following comments, and replies. Posting on gay-bar refers to the user creating a new post with a specific title and specific content on a topic, called the main post. After the main post is created, its title and brief content are posted on the gay-bar homepage and are available for everyone to see. The gay-bar homepage refreshes and tops the latest posts. Then users can click on the links in the main posts to enter the detail pages, on which users can follow the post by commenting or replying to others. User data is related to users who have been active on gay-bar, mainly including the personal information that users fill in to register, as well as users’ activity information collected by Baidu Tieba, including the user’s name, gender, city, longitude, latitude, favorite community, etc.
There are mainly three types of social behaviors on gay-bar: posting, commenting, and replying. With the users as nodes, and the interactions between users as edges, we can construct the social network of this platform. Because the main purpose for users of gay-bar is making friends and seeking partners, this social network is actually a dating network. Since the first user joined gay-bar in 2005, new users and interactions have been added to the network accumulatively. By the end of the data collection period, the network contained 807,591 nodes and 12,537,280 edges. Each edge represents a unique contact between two users, with a unique ID and a timestamp indicating the day the contact was generated [39]. All edges are directed. With such data, we can focus on the evolution of the dating network on a daily basis and record the changes. Based on the initial network on the first day (January 29, 2005), new nodes and new edges of each day can be added to the initial network, in turn forming the next day’s dating network. Finally, we construct a total of 4,885 accumulated daily networks.

2.1. Ethics Approval and Consent to Participate

The study and Liu (2018) [32] were both supported by the Natural Science Foundation of China (91546203, 71771213) and approved by the Medical Ethical Committee of the Institutional Review Board (IRB) at Peking University (IRB00001052–16016). The study does not involve any physical, social or legal risks to the participants, and the data is anonymous, confidentiality of the participants’ information is strictly protected.

2.2. Data Availability

All data analyzed in this study are publicly available, all posts in the datasets can be collected on the website of Baidu Tieba (https://tieba.baidu.com/f?kw=gay). Other data that support the findings in this study are available from the corresponding author on reasonable request.

3. Results

3.1. Static Network Analysis

3.1.1. Basic Network Statistics

We build the MSM dating network on gay-bar with users as nodes, and interactions between users as edges. In addition, we can use the number of interactions as the weight of the edges between each pair of users. A total of 5,132,933 weighted edges were found in the final dating network (network in the last day of the data collection), that is, 5,132,933 unique node pairs in which one node has interacted with another. In other words, the static analyses are all based on this directed weighted network. With calculations, we find the diameter of this dating network is equal to 12; that is, any two nodes on gay-bar need to go through 12 nodes at most to connect with each other. Moreover, the average shortest-path length is calculated as well, and we find the length of the gay-bar network is 4.23, which is consistent with the result that the average shortest-path lengths of many real-world networks are small [40,41]. Summaries of network statistics are presented in Table 1.

3.1.2. Degree Distribution and Number of Posts

The accumulated network has an average degree of 6.36; that is, on average, users in the gay-bar community interact with 6.36 others during the 13-year period. The degree distribution shows high heterogeneity, as has been found in many other large human social networks. It follows a truncated power law distribution (Figure 2A). It can be found that there are only 3 nodes with a degree greater than 10,000 in the dating network, and only 16,863 nodes with a degree greater than 100, accounting for approximately 2.1% of all nodes in this network. That is, there are only a very few nodes with relatively large degrees, and most nodes’ degrees are small, which means that interactions with other users are not frequent.
This finding corresponds to the posting characteristics of gay-bar users. The distribution of the number of user posts is shown in Figure 2B. As can be seen, because there are some ‘celebrity’ nodes on gay-bar, a few users post many times; e.g., user 1144404839 made 15,875 postings. Thus, this user’s chance of interacting with other users is greatly increased, causing them to be the nodes with large in-degrees in the network. In addition, some nodes are particularly positive in communications with other users, resulting in their large out-degrees in the network, such as user 910803194. All these ‘celebrity’ nodes and active nodes are responsible for the polarization of the nodes’ degrees. In contrast, most users do not post a lot (more than 53.8% of users posted fewer than 1,000 times, but the maximum number of user postings is larger than 1,000,000). We find that is because most users hang out on gay-bar only sometimes when they are free, and participate only in topics in which they are interested. This phenomenon also explains why the degrees of most nodes are very small in the dating network. Furthermore, we analyze the correlation between the degree and the number of posts, and find that the Spearman correlation coefficient (as s ) between these two indexes is 0.714.

3.1.3. Distribution of the Edge Weights

Figure 2C shows the distribution of the edge weights, which basically obeys a power law. The weight on the edges spans from 1 to 9,887, with the majority of edges containing only one interaction time (58.4%), and those with weights of less than 10 account for 98.1% of all edges. That is, most of the interactions between users are casual and infrequent, because most users interact occasionally with others only when they show great interest in the same topic at the same time, and have not established long-term stable friendships and frequent interactions. There is a positive correlation between the degree of the node and the sum of the weights on all his connected edges (the Pearson correlation coefficient, r = 0.939), indicating that users with more friends tend to be more active in interactions.

3.1.4. Homophily

In early studies, it was discovered that most people are more inclined to make friends with those who are similar [44,45]. Moreover, in the recent exploration of network homophily, studies also illustrated that people prefer to make friends with people who have the same interests or hobbies [46,47,48]. It is believed that the same geographic location is an important factor for people’s social relationships, such as making friends or seeking partners [30,49].
We extract the user’s location information (either from the profile or the opt-to-public GPS location containing the latitude and longitude) as a new attribute of nodes in the network, named loc. We analyze whether the loc of the nodes has an effect on the formation of links between two different nodes. That is, we examine whether the nodes linked by the same edge are more likely to have the same loc rather than randomly. We use homophily to measure this probability, which is quantified as [50,51]
H l o c i = S l o c i l o c i P l o c i 1 P l o c i ,
where H l o c i is the homophily for nodes with property l o c i , S l o c i l o c i is the proportion of type l o c i l o c i links among all links originating from type l o c i nodes, and P l o c i is the proportion of type l o c i nodes in the network. Consequently, H l o c i measures the exceeded likelihood of establishing interactions with users with the same location attribute ( l o c i ) for nodes of type l o c i . H l o c i = 0 means that all nodes interact with other nodes randomly and regardless of the l o c i attribute. H l o c i =1 implies that all nodes with the l o c i attribute are connected to friends with the same property. To measure the global homophily of the network, we use the average of all H l o c i , represented as
H = i = 1 k H l o c i k ,
where k is the number of all unique loc.
With the definitions, we then measure the homophily of the MSM dating network in terms of the cities and provinces in which the users reside. The homophily for city is 0.356, indicating that, in addition to interacting with other users randomly, there is a 35.6% surplus of tendency for users to build interactions with people from their own city. The homophily for province is 0.478, greater than that of city, implying there is a 47.8% surplus of tendency for users to build interactions with people from their own province except random connections. These results suggest that although the internet has made people closer and brings a lot of convenience to daily communication, in the male homosexual dating network, similar geographic location is still an important condition considered by users for making friends and seeking partners. In addition, geographic homophily is more likely to promote the formation of interactions between users in the MSM community, which is consistent with the situation we observed in the content of user posts on gay-bar. We find that many posts are related to making friends and looking for a partner, where most users specifically mention their location, and put forward the strong desire to find male friends in the same region. For example, a popular post recently published on gay-bar states: “I’m 18 years old and in Harbin (capital city of Heilong Jiang Province), I’m here to look for a partner of type zero (the relatively feminine role in the MSM relationship).” Many users commented on his post, some indicating that their location is also Harbin. After the author interacted frequently with a commenter in the same location, both expressed their clear intention to become friends. We speculate that it is because the same geographic location is more conducive to meeting and dating internet pals for MSM, which is helpful for developing relationships on the internet that become romantic relationships in off-line real life, and more in line with users’ original intention and expectations for gay-bar. We also find in many posts many users made it clear that they did not want to maintain a virtual online relationship.
Furthermore, this discovery illustrates that users with the same geographic location are more likely to maintain close communication, thus forming a ‘friendship’, than maintain a long-term frequent interaction. However, only a few users can develop into this stable relationship, which explains why the number of edges with large weights is small. In most cases, in the MSM dating network, most users follow, comment, or reply to others only on certain topics (or posts) for the same interests; these social behaviors do not establish stable attention and interactions between users. Only a small number of users form in-depth relationships based on frequent interactions, and vice versa, close relationships create longer-term and frequent interactions.

3.1.5. Community Analysis of the MSM Dating Network

In this study, we use the Infomap algorithm [52,53], which is one of the most efficient, reliable, and accurate community detection methods [54,55], to detect the community structure in the MSM dating network. A total of 44,076 communities are found. The distribution of the communities’ sizes (the number of nodes in each community) is shown in Figure 3A. There are 812 communities (1.9%) with more than 100 nodes, and only 5 communities with more than 1,000 nodes. The number of nodes in 53.5% of the communities is less than 10. This result shows that the social groups in the MSM dating network are relatively small, and the relationships built by online interactions are relatively weak and non-stable. On gay-bar, users mainly pay attention to the posts they are interested in, and communicate only with others with the same purpose or characteristics, which cause most interactions in the network to be sparse and inconsistent.
To further understand the communities in the gay-bar network, text analysis is performed on the contents posted by users in each community. We use the latent Dirichlet allocation (LDA) model [56,57] to analyze the topics in the communities. We set each community to contain 10 main topics, each of which is represented by the top 20 keywords with maximum weights. Most keywords in these communities tend to be categorized into four classes: First, we find that the keywords in many communities include words referring to location (such as "Hunan", "Wuhan", "Chongqing", etc.), and number strings indicating users’ contact information (most are accounts on QQ or WeChat, and phone numbers). In addition, some common words that express personal information (such as “height”, “weight”, “photo”, “body”, etc.), as well as specific words that suggest users’ sexual orientation and roles, such as “top” and “bottom” are also frequent in the communities. According to the characteristics of the users’ postings on gay-bar, most of these words are used in the scene of MSM seeking friends and partners. Second, many keywords in the communities are related to games, stars, and sexual behaviors, mainly in daily chats among users. In addition, in many communities, numerous topic words are related to going to school, such as “university”, “student”, “school”, “high school”, etc. For example, “Hainan University” and “Donghua University” have high weights in community 792 (the ID of the community is 792), and “Xiamen University” in community 831 also shows a high weight, which indicates that a large portion of the MSM who hang out on gay-bar are students in school. This is consistent with the statistics published by recent studies that in China, 8.5% of male university students are MSM [58]. In this population, an increasing number of new HIV cases are reported: In 2016, 8 out of 10 HIV-infected Chinese university students were MSM [59]. Last, in the topic keywords of some communities, the frequency of “AIDS” and “HIV” is high as well, such as community 1126. This result suggests that HIV is still a common and anxious concern of MSM in China. Furthermore, it is also possible that some MSM on gay-bar have faced this condition or contracted the syndrome. The topic keywords in community 792 and community 1126 are shown in Figure 3B. As we can see, analysis of users’ contents in different communities helps us better understand the characteristics of the communities in the dating network, and discover the topics that are the daily focus and are discussed by this MSM online network.
The left graphic is keywords in community 792, and the keywords (translated into English) sorted by weight from large to small are warm, reply, poster, panties, wanna, live broadcast, Nanchang (city name), happy, naive in relationships, short night, family, hard, south, the more the better, do one’s best, college of engineering, video, Yang Mi (a popular actress), romantic, soul, white, university, the public, romantic novel, handsome guy, etc.
The right graphic is keywords in community 1216, and the keywords (translated into English) sorted by weight from large to small are hiv, reply, how long, poster, qq (a popular Chinese instant messaging software), Gansu (province name), silently, Pingliang (city name), same job, boy, lesbian, afraid, forever, draw eyebrows, eyeliner, close, self-sacrifice, bottom, college, Aids, hide, Lanzhou (city name), clavicle, bed, dentist, WeChat (a social application), make love, lymph, happy birthday, onlookers, Mid-Autumn Festival, etc.

3.2. Temporal Evolution Pattern of the MSM Dating Network

3.2.1. Dynamics of Nodes and Edges

As time passes, the number of nodes and edges in the network increase, as shown in Figure 4. New users gradually joined the network at the beginning of stages: From April 27, 2005, to December 31, 2014, the number of users and edges reached 212,522 and 1,557,572, respectively. However, since 2015, the community has expanded rapidly with an average increase of 472 new nodes and 8707 edges each month. This upward trend is similar to the dynamics of the average degree (Figure 4C). However, we can see that there is a considerable decrease in the average degree after a sharp climb at the beginning of the network evolution, because increasing nodes make the density of the edges in the network smaller. The initial increase is perhaps because of the new users’ curiosity, making them engage in many social activities on gay-bar. Since then, the average degree of the network gradually increased, followed by a steep rise after 2015. Until the second half of 2016, the network average degree reaches around 14, and then continues to maintain a slow upward trend.
To figure out the change pattern in the number of friends of nodes, the number of interactive friends per year (i.e., the degrees in the separate networks by year, not the cumulative network) for all users is calculated. We can see that the degree growth rates of most users are always greater than 0, shown in Figure 5A, which illustrates that the number of friends for most users keeps growing throughout the period. A positive correlation between the average of the degree growth rates (Figure 5B) and the node number growth rate in separate networks is discovered (r = 0.939), suggesting that the faster the number of nodes grows, the faster the number of friends for users grows. In addition, the annual change in user friends is analyzed. The proportion of different friends from the previous year for the same user is used to measure the rate that friends change. We find that the changing friend rates of most users are very high, approaching 1.0, indicating that for the majority, their communication objects tend to change by the next year. Most communications do not establish stable and continuous interacting relationships. However, this situation changed in the last few years. More and more users keep their friend lists with a smaller rate of change. As we can see from Figure 5C, the average changing friend rate generally maintains a downward trend, from 0.978 in 2006 to 0.757 in 2018. We find that there is a negative correlation between the average changing friend rate and the number of nodes (s = −0.901). The more nodes in the network, the smaller the probability that the users change the friends with which they interact.

3.2.2. Dynamics of the Giant Strongly and Weakly Connected Components

Figure 6 shows the evolution of the giant strongly connected component (GSCC, in which there is a directed path from each node to all the other nodes) and the giant weakly connected component (GWCC, in which each node can either reach, or be reached by nodes from the GSCC) [60] in the MSM dating network, respectively. The GSCC and the GWCC grow slowly during the early stage, while the development trends of the proportions of the two components to the network show differences. The proportion of the GSCC quickly drops to around 0 and then maintains a gradual upward trend, while that of the GWCC climbs steeply to around 0.80, then falls to 0.2, followed by a rapid increase (consistent with the trend of the average degree at the early stage). After June 2012, the growth rates of both sizes of the GSCC and the GWCC begin to improve, the same as the proportion of the GSCC. Finally, the GSCC’s proportion stays at around 0.2, and the GWCC stays at around 1.0, while the sizes of the two components show faster growth, which is consistent with the increase trend of the number of the nodes and edges.

3.2.3. Shortest-Path Length

The average shortest-path length rises sharply before January 2006 (Figure 7A), and since then, it exhibits a ladder-type growth when the number of nodes and edges in the network is still relatively small. In April 2009, the length reaches the peak of the shortest-path length, that is, 6.73. The proportion of the giant weakly connected component is just in the fastest-growing period at this time. Thereafter, the average shortest-path length decreases gradually to about 4.20, and tends to be relatively steady, in which the number of nodes and edges in the network starts to soar. This result could suggest that there are denser interactions promoting users to be closer, although new nodes are still being added continuously. When the MSM dating network becomes more robust after 2015, the addition of new nodes and edges has less impact on the connectivity of the network, and the average shortest-path length remains at a small and stable value. This result is in line with the conclusion that the average shortest-path lengths of various human social networks are short.

3.2.4. Clustering

The transitivity, or clustering coefficient [61], measures the probability that two neighbors of a vertex are connected, that is, the ratio of the triangles and connected triplets in the network,
C = c ( 3 ) p ( 3 ) ,
where c(3) represents the number of triangles actually present in the network, and p(3) represents the number of the largest possible triangles in the network.
For the MSM dating networks formed on different days, when the number of nodes and edges is small during the initial stage of the evolution, the clustering coefficient rapidly rises from 0 to the maximum value, 0.12. However, as the size of the network continues to increase, the clustering coefficient of the network begins to decrease, and finally reaches a stable value of 0.01. This indicates that even with the continuous increase in the nodes and edges, the ratio of triangles in the network remains basically stable to a certain value, and the clustering degree of the network does not change much during the later period. In other words, in the MSM dating network, the actual triangle accounts for 1% of all possible triangles. However, in a community populated largely by heterosexual individuals [31], the number of triangles would be much lower. Therefore, it can be claimed that the transitivity of a homosexual dating network, such as the dating network on gay-bar, is higher than that of a heterosexual network.

3.2.5. Assortativity

For understanding the connection preference among MSM-related users in different periods, and further interpreting the tendency in seeking friends when Chinese MSM hang out online, we analyze the evolution pattern of the assortativity [62] of the MSM dating network. We calculate all the degree-degree correlations of the daily networks, and explore whether there is always considerable assortativity or disassortativity in different stages. In this study, the degree-degree correlation coefficient (R) proposed by Newman [63] is used to measure the network assortativity, quantified as:
R = k t o k f r o m k t o k f r o m k t o 2 k t o 2   k f r o m 2 k f r o m 2 ,
in which . denotes the average over arcs, k t o is the degree of the vertex from which the edge leads, and k f r o m is the degree of the vertex from which the edge starts. When R is greater than 0, it means that the MSM dating network is assortative, indicating nodes with large degrees in the network are more likely to generate connections with each other, and vice versa. That is, ‘celebrity’ or popular users on gay-bar tend to interact with users who are also famous or popular. Moreover, the absolute value of the coefficient represents the degree of the assortativity or disassortativity of the network: the more positive the coefficient, the more significant the assortativity; and the more negative the coefficient, the more significant the disassortativity.
The assortativity coefficients of the dating networks at different time are shown in Figure 7C. The coefficients are always greater than 0 with the network evolution. That is, the MSM dating network is always assortative, and nodes with large degrees tend to produce links with nodes that also have large degrees. At the beginning of the evolution, the network’s assortativity coefficient is equal to 1. It is because the number of users on gay-bar is small at that time, and the nodes are connected to each other, which lead to similar degrees among nodes. Then, with the continuous evolution, the number of nodes in the network continues to increase, and the assortativity coefficient gradually declines, and finally tends to a stable value, 0.22. In addition, the assortativity coefficients in different periods are always greater than 0; that is, the network is always assortative. The links in the network have a tendency to be created between nodes with a similar degree, which also suggests that the users on gay-bar are more inclined to communicate with users who are similar to them. Furthermore, MSM always prefer to seek similar friends or partners.
This assortativity of the MSM dating network is consistent with the classic scientific cooperation network and the actor cooperation network, but different from the online social networks for heterosexual romantic relationships [31]. Heterosexual social networks display a significant disassortative mixing for all types of degree-degree correlations [64,65]. This indicates that although all the networks are designed for romantic relationships on the internet, the online homosexual network and the heterosexual network show different characteristics: MSM have a tendency to look for friends and partners with similar degrees (popularity or personality) in an online dating network. However, in heterosexual relationships, there is disassortativity shown in the network, and nodes with larger degrees tend to establish connections with nodes that have fewer degrees; that is, popular users tend to look for heterosexual friends and partners with less popularity. Furthermore, it can be speculated that in homosexual romantic relationships, similarity is more important, but the difference or complementarity is more attractive in heterosexual relationships.

3.2.6. Community

Analyzing the communities in the MSM dating network with time evolution is helpful to understand the social structure of MSM-related groups in different periods. For the 3696 networks formed on different days, the Infomap algorithm [52,53] is used to detect communities, and the number of communities in each network can be calculated. The number of communities in the daily networks is shown in Figure 8A. Over time, the number of communities maintains an upward trend, and the growth rate in 2015 is obviously improved, which is similar to the growth pattern in the number of nodes and edges.
We also analyze the size of the largest community in each daily network, as shown in Figure 8B. Surprisingly, it does not increase continuously, which is considerably different from other network indexes, such as the number of communities and the number of nodes. The size of the largest community fluctuates considerably after 2015, while the number of communities and nodes keeps climbing rapidly. This result suggests that during the later evolution period, the largest community continues to experience decomposition, integration, and reconstruction, which causes the size of the largest community to rise and fall. Through analyzing the relationship between the size of the largest community and the size of the network, we find that there is no direct correlation. More nodes in the network do not mean more or fewer nodes in the largest community. Furthermore, we try to explore the member flow in communities during the evolution period. Figure 8C describes the flow of nodes in the 10 largest communities. We can see that the node flows between different communities are frequent, and members in communities always change, indicating that the social circles of most users are not stable. This is also evidence of the flexible, rapidly changing, casual, and non-stable interactions in the MSM dating network.

4. Discussion and Conclusions

In this study, we collect the oldest to the newest possible activity data from the largest sub-community for MSM, gay-bar, in the largest Chinese community (i.e., Baidu Tieba), and construct an MSM dating network by figuring out the online interaction relationships between users. We also analyze the static characteristics of the MSM dating network, and the dynamic evolution pattern over the past 13 years, to explore the social features of the online MSM in China, as well as the changes.
We find that the nodal degree and the number of posts obey a power law-like heterogenous distribution, and that there is a positive correlation between them (s = 0.714). The weights on most edges are small, suggesting interactions between most users are casual and infrequent. The sum of the weights on connected edges also shows a positive correlation with the degree of the node (r = 0.939). To figure out whether the location is an important factor that affects online MSM making friends and seeking partners, the geographic homophily in the MSM dating network is calculated. We find that there is a strong homophily regarding the regions where the users reside when developing interactions with others (the homophily for province is 0.478). About communities in the network, it is found that most communities are small, indicating online MSM in China prefer small-scale social relationships. Most topic words in these communities can be categorized into four classes: personal information (including location and contact information), daily charts, school, and HIV (or AIDS).
To further explore the dynamic characteristics of MSM over time, we analyze the time evolution pattern of the MSM dating network, and construct the daily cumulative networks from January 29, 2005, to June 14, 2018. This network changes rapidly; the nodes and links are not stable over time. For most users, the number of interacting friends per year keeps growing. The change rate of their friends is very high, showing a negative correlation with the number of nodes in annual networks (s = −0.901). This reveals that the more nodes in the network, the more stable the friends of users. In addition, after a few changes during the previous period, the clustering coefficient levels off at around 0.01, which is higher than the transitivity of the online heterosexual social network. However, the assortativity of the MSM dating network also is quite different from that of the online heterosexual social network. The assortativity coefficient of the MSM dating network is always a large positive value, and remains at around 0.22 at the last period of evolution, while that of the latter is always less than 0, showing obvious disassortativity. Finally, we find that the number of communities increases over time, while the size of the largest community always fluctuates in the later period. Member flows between communities are frequent, which is an indication that the network is formed of rapidly flexible interactions that change quickly.
By analyzing the online dating network of the MSM-related population in China, this paper tries to figure out the static characteristics, as well as the dynamic evolution patterns, of the MSM dating network, to understand the social features of online MSM and the changes in the past 13 years. We found that the male homosexual social network shows differences with online heterosexual social networks in many aspects, such as the transitivity and assortativity. This result reveals the distinct evolving patterns of homosexual social networks in comparison to heterosexual social networks, and implies that decision makers need to adopt different perspectives and methods for development of public health policies targeting MSM. In addition, HIV is still a common and anxious concern of MSM in China. There should be more intervention strategies [66,67] in the HIV transmitted infections among the MSM population, such as identifying and training the most influential individuals (or opinion leaders) [68] within the MSM interacting network, or facilitating peer influence [69] to promote target health behaviors.

Author Contributions

Methodology, X.L.; Data analysis, C.L.; Funding acquisition, X.L.; Writing—original draft, C.L.; Writing—review and editing, X.L.

Funding

Xin Lu acknowledges the National Natural Science Foundation of China (71771213, 71790615, and 91846301) and the Hunan Science and Technology Plan Project (2017RS3040, 2018JJ1034). Chuchu Liu is partially supported by the National Natural Science Foundation of China (71690233, 91546203, 71774168).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Ofosu, E.K.; Chambers, M.K.; Chen, J.M.; Hehman, E. Same-sex marriage legalization associated with reduced implicit and explicit antigay bias. Proc. Natl. Acad. Sci. USA 2019, 116, 8846–8851. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  2. Erickson, T. Legalization of Gay Marriage-United States Supreme Court Declares It Unconstitutional for States to Deny Same-Sex Couples the Right to Marry: Immediate Impact of Gay Marriage in North Dakota. NDL Rev. 2015, 91, 219. [Google Scholar]
  3. Veronese, V.; Clouse, E.; Wirtz, A.L.; Thu, K.H.; Naing, S.; Baral, S.D.; Stoové, M.; Beyrer, C. “We are not gays… don’t tell me those things”: engaging ‘hidden’ men who have sex with men and transgender women in HIV prevention in Myanmar. BMC public health. 2019, 19, 63. [Google Scholar] [CrossRef] [PubMed]
  4. Cao, H.; Zhou, N.; Fine, M.; Liang, Y.; Li, J.; Mills-Koonce, W.R. Sexual minority stress and same-sex relationship well-being: A meta-analysis of research prior to the uS Nationwide legalization of same-sex marriage. J. Marriage Fam. 2017, 79, 1258–1277. [Google Scholar] [CrossRef]
  5. Chu, Z.; Xu, J.; Zhang, Y.; Zhang, J.; Hu, Q.; Yun, k.; Wang, H.; Jiang, Y.; Geng, W.; Shang, H. Poppers use and Sexual Partner Concurrency Increase the HIV Incidence of MSM: a 24-month Prospective Cohort Survey in Shenyang, China. Sci. Rep. 2018, 8, 24. [Google Scholar] [CrossRef]
  6. Jie, W.; Ciyong, L.; Xueqing, D.; Hui, W.; Lingyao, H. A syndemic of psychosocial problems places the MSM (men who have sex with men) population at greater risk of HIV infection. PLoS ONE 2012, 7, e32312. [Google Scholar] [CrossRef]
  7. Berghe, W.V.; Nöstlinger, C.; Hospers, H.; Laga, M. International mobility, sexual behaviour and HIV-related characteristics of men who have sex with men residing in Belgium. BMC Public Health. 2013, 13, 968. [Google Scholar]
  8. Guanghua, L.; Yi, C.; Shuai, T.; Zhiyong, S.; Zhenzhu, T.; Yuhua, R.; Yousuf, M.A.; Wensheng, F. HIV, syphilis and behavioral risk factors among men who have sex with men in a drug-using area of southwestern China: Results of 3 cross-sectional surveys from 2013 to 2015. Medicine. 2018, 97, e0404. [Google Scholar] [CrossRef]
  9. Chow, E.P.F.; Wilson, D.P.; Zhang, J.; Jing, J.; Zhang, L. Human immunodeficiency virus prevalence is increasing among men who have sex with men in China: findings from a review and meta-analysis. STI 2011, 38, 845–857. [Google Scholar] [CrossRef]
  10. Beyrer, C.; Baral, S.D.; Van Griensven, F.; Goodreau, S.M.; Chariyalertsak, S.; Wirtz, A.L.; Brookmeyer, R. Global epidemiology of HIV infection in men who have sex with men. Lancet 2012, 380, 367–377. [Google Scholar] [CrossRef] [Green Version]
  11. A Joint Assessment of HIV/AIDS Prevention, Treatment and Care in China. Available online: http://data.unaids.org/una-docs/china_joint_assessment_2003_en.pdf (accessed on 22 July 2019).
  12. Qi, J.; Zhang, D.; Fu, X.; Li, C.; Meng, S.; Dai, M.; Liu, H.; Sun, J. High risks of HIV transmission for men who have sex with men—a comparison of risk factors of HIV infection among MSM associated with recruitment channels in 15 cities of China. PLoS ONE 2015, 10, e0121267. [Google Scholar] [CrossRef] [PubMed]
  13. Wong, H.T.H.; Tam, H.Y.; Chan, D.P.C.; Lee, S.S. Usage and acceptability of HIV self-testing in men who have sex with men in Hong Kong. AIDS Behav. 2015, 19, 505–515. [Google Scholar] [CrossRef] [PubMed]
  14. Ncaids, N. Update on the AIDS/STD epidemic in China and main response in control and prevention in December, 2015. Chin. J. AIDS STD. 2016, 22, 69. [Google Scholar]
  15. Lu, X. Respondent-Driven Sampling: Theory, Limitations & Improvements; Inst för folkhälsovetenskap/Dept. of Public Health Sciences: Stockholm, Sweden, 2013. [Google Scholar]
  16. Lu, X.; Malmros, J.; Liljeros, F.; Britton, T. Respondent-driven sampling on directed networks. Electron. J. Statist. 2013, 7, 292–322. [Google Scholar] [CrossRef] [Green Version]
  17. Lu, X.; Brelsford, C. Network structure and community evolution on twitter: human behavior change in response to the 2011 Japanese earthquake and tsunami. Sci. Rep. 2014, 4, 6773. [Google Scholar] [CrossRef]
  18. Lu, X.; Bengtsson, L.; Britton, T.; Camitz, M.; Kim, B.J.; Thorson, A.; Liljeros, F. The sensitivity of respondent-driven sampling. J. R. Stat. Soc. Ser. A. Stat. Soc. 2012, 175, 191–216. [Google Scholar] [CrossRef]
  19. Pilon, R.; Leonard, L.; Kim, J.; Vallee, D.; Rubeis, E.D.; Jolly, A.M.; Wylie, j.; Peludel, .; Sandstrom, P. Transmission patterns of HIV and hepatitis C virus among networks of people who inject drugs. PLoS ONE 2011, 6, e22245. [Google Scholar] [CrossRef]
  20. Baltar, F.; Brunet, I. Social research 2.0: virtual snowball sampling method using Facebook. Internet Res. 2012, 22, 57–74. [Google Scholar] [CrossRef]
  21. Coursaris, C.K.; Liu, M. An analysis of social support exchanges in online HIV/AIDS self-help groups. Comput. Human Behav. 2009, 25, 911–918. [Google Scholar] [CrossRef]
  22. Macapagal, K.; Coventry, R.; Puckett, J.A.; Phillips, G. II; Mustanski, B. Geosocial networking app use among men who have sex with men in serious romantic relationships. Arch. Sex. Behav. 2016, 45, 1513–1524. [Google Scholar] [CrossRef]
  23. Bien, C.H.; Best, J.M.; Muessig, K.E.; Wei, C.; Han, L.; Tucker, J.D. Gay apps for seeking sex partners in China: implications for MSM sexual health. AIDS Behav. 2015, 19, 941–946. [Google Scholar] [CrossRef] [PubMed]
  24. Young, L.E.; Michaels, S.; Jonas, A.; Khanna, A.S.; Skaathun, B.; Morgan, E.; Schneider, J.A. Sex behaviors as social cues motivating social venue patronage among young black men who have sex with men. AIDS Behav. 2017, 21, 2924–2934. [Google Scholar] [CrossRef]
  25. Rice, E.; Holloway, I.; Winetrobe, H.; Rhoades, H.; Barman-Adhikari, A.; Gibbs, J.; Carranza, A.; Dent, D.; Dunlap, S. Sex risk among young men who have sex with men who use Grindr, a smartphone geosocial networking application. J. AIDS Clinic. Res. 2012. [Google Scholar] [CrossRef]
  26. Bolding, G.; Davis, M.; Hart, G.; Sherr, L.; Elford, J. Gay men who look for sex on the Internet: is there more HIV/STI risk with online partners? Aids 2005, 19, 961–968. [Google Scholar] [CrossRef] [PubMed]
  27. Shiyao, X. Analysis of HIV high-risk population characteristics with Baidu Tieba data. Big Data Res. 2019, 5, 98–108. [Google Scholar]
  28. Wu, Z.; Xu, j.; Liu, E.; Mao, Y.; Xiao, Y.; Sun, X.; Liu, Y.; Jiang, Y.; McGoogan, J.M.; Dou, Z.; et al. HIV and syphilis prevalence among men who have sex with men: a cross-sectional survey of 61 cities in China. Clin. Infect. Dis. 2013, 57, 298–309. [Google Scholar] [CrossRef]
  29. Valente, T.W.; Pitts, S.R. An appraisal of social network theory and analysis as applied to public health: challenges and opportunities. Annu. Rev. Public Health 2017, 38, 103–118. [Google Scholar] [CrossRef]
  30. Bruch, E.E.; Newman, M.E.J. Structure of online dating markets in US cities. Sociol. Sci. 2019, 6, 219–234. [Google Scholar] [CrossRef]
  31. Holme, P.; Edling, C.R.; Liljeros, F. Structure and time evolution of an Internet dating community. Soc. Netw. 2004, 26, 155–174. [Google Scholar] [CrossRef] [Green Version]
  32. Liu, C.; Lu, X. Analyzing hidden populations online: topic, emotion, and social network of HIV-related users in the largest Chinese online community. BMC Med. Inform. Decis. Mak. 2018, 18, 2. [Google Scholar] [CrossRef]
  33. Huang, G.; Cai, M.; Lu, X. Inferring Opinions and Behavioral Characteristics of Gay Men with Large Scale Multilingual Text from Blued. Int. J. Environ. Res. Public Health 2019, 16, 3597. [Google Scholar] [CrossRef] [PubMed]
  34. Chen, X.; Li, X.; Zheng, J.; Zhao, J.; He, J.; Zhang, G.; Tang, X. Club drugs and HIV/STD infection: an exploratory analysis among men who have sex with men in Changsha, China. PLoS ONE 2015, 10, e0126320. [Google Scholar] [CrossRef] [PubMed]
  35. Prestage, G.; Down, I.; Grulich, A.; Zablotska, I. Sex partying among gay men in Sydney, Melbourne and Brisbane, Australia. AIDS Behav. 2011, 15, 298–304. [Google Scholar] [CrossRef] [PubMed]
  36. Kossinets, G.; Watts, D.J. Empirical analysis of an evolving social network. science. 2006, 311, 88–90. [Google Scholar] [CrossRef]
  37. Gallos, L.K.; Rybski, D.; Liljeros, F.; Havlin, S.; Makse, H.A. How people interact in evolving online affiliation networks. Phys. Rev. X 2012, 2, 031014. [Google Scholar] [CrossRef]
  38. The Introduction of Baidu Tieba. Available online: https://en.wikipedia.org/wiki/Baidu_Tieba (accessed on 22 July 2019).
  39. Bu, Z.; Wang, Y.; Li, H.J.; Jiang, J.; Wu, Z.; Cao, J. Link prediction in temporal networks: Integrating survival analysis and game theory. Inf. Sci. 2019, 498, 41–61. [Google Scholar] [CrossRef]
  40. Wandelt, S.; Sun, X.; Zhang, J. Evolution of domestic airport networks: a review and comparative analysis. Transp. B Transp. Dyn. 2019, 7, 1–17. [Google Scholar] [CrossRef]
  41. Holme, P.; Kim, B.J.; Yoon, C.N.; Han, S.K. Attack vulnerability of complex networks. Phys. Rev. E 2002, 65, 056109. [Google Scholar] [CrossRef] [Green Version]
  42. Kim, H.J.; Lee, Y.; Kahng, B.; Kim, I. Weighted scale-free network in financial correlations. J. Phys. Soc. Jpn. 2002, 71, 2133–2136. [Google Scholar] [CrossRef]
  43. Montgomery, D.R. Slope distributions, threshold hillslopes, and steady-state topography. Am. J. Sci. 2001, 301, 432–454. [Google Scholar] [CrossRef]
  44. Glass, D. Social Mobility in Britain; Routledge: London, UK, 1954. [Google Scholar]
  45. Kalmijn, M. Status homogamy in the United States. Am. J. Sci. 1991, 97, 496–523. [Google Scholar] [CrossRef]
  46. Kalmijn, M. Intermarriage and homogamy: Causes, patterns, trends. Annu. Rev. Sociol. 1998, 24, 395–421. [Google Scholar] [CrossRef] [PubMed]
  47. Schwartz, C.R. Trends and variation in assortative mating: Causes and consequences. Annu. Rev. Sociol. 2013, 39, 451–470. [Google Scholar] [CrossRef]
  48. Laumann, E.O.; Ellingson, S.; Mahay, J.; Paik, A.; Youm, Y. The Sexual Organization of the City; University of Chicago Press: Chicago, MA, USA, 2005. [Google Scholar]
  49. Bailey, M.; Cao, R.; Kuchler, T.; Stroebel, J.; Wong, A. Social connectedness: measurement, determinants, and effects. J. Econ. Perspect. 2018, 32, 259–280. [Google Scholar] [CrossRef] [PubMed]
  50. Lu, X. Linked ego networks: improving estimate reliability and validity with respondent-driven sampling. Soc. Netw. 2013, 35, 669–685. [Google Scholar] [CrossRef]
  51. McPherson, M.; Smith-Lovin, L.; Cook, J.M. Birds of a feather: Homophily in social networks. Annu. Rev. Sociol. 2001, 27, 415–444. [Google Scholar] [CrossRef]
  52. Rosvall, M.; Axelsson, D.; Bergstrom, C.T. The map equation. Eur. Phys. J. 2009, 178, 13–23. [Google Scholar] [CrossRef]
  53. Rosvall, M.; Bergstrom, C.T. Maps of random walks on complex networks reveal community structure. Proc. Natl. Acad. Sci. USA 2008, 105, 1118–1123. [Google Scholar] [CrossRef] [Green Version]
  54. Bu, Z.; Li, H.J.; Zhang, C.; Cao, j.; Li, A.; Shi, Y. Graph k-means based on leader identification, dynamic game and opinion dynamics. IEEE Trans. Knowl. Data Eng. 2019. [Google Scholar] [CrossRef]
  55. Li, H.J.; Daniels, J.J. Social significance of community structure: Statistical view. Phys. Rev. E 2015, 91, 012801. [Google Scholar] [CrossRef]
  56. Blei, D.M.; Ng, A.Y.; Jordan, M.I. Latent dirichlet allocation. J. Mach. Learn. Res. 2003, 3, 993–1022. [Google Scholar]
  57. Mixing Dirichlet Topic Models and Word Embeddings to Make lda2vec. Available online: https://arxiv.org/pdf/1605.02019.pdf (accessed on 22 July 2019).
  58. Zhang, P.; Chi, X.; Wu, M. Status and influencing factors of attitude toward LGBT among college students. Chin. J. Public Health 2012, 28, 921–923. [Google Scholar]
  59. Zou, H.; Tucker, J.D.; Fan, S.; Xu, J.; Yu, M.; Luo, Z.; Cai, W.; Grulich, A.E. Learning about HIV the hard way: HIV among Chinese MSM attending university. Lancet Infect. Dis. 2018, 18, 16–18. [Google Scholar] [CrossRef]
  60. Dorogovtsev, S.N.; Mendes, J.F.F.; Samukhin, A.N. Giant strongly connected component of directed networks. Phys. Rev. E 2001, 64, 025101. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  61. Watts, D.J.; Strogatz, S.H. Collective dynamics of ‘small-world’networks. Nature 1998, 393, 440. [Google Scholar] [CrossRef]
  62. Colizza, V.; Flammini, A.; Serrano, M.A.; Vespignani, A. Detecting rich-club ordering in complex networks. Nat. Phys. 2006, 2, 110. [Google Scholar] [CrossRef]
  63. Newman, M.E.J. Assortative mixing in networks. Phys. Rev. Lett. 2002, 89, 208701. [Google Scholar] [CrossRef]
  64. Peel, L.; Delvenne, J.C.; Lambiotte, R. Multiscale mixing patterns in networks. Proc. Natl. Acad. Sci. USA 2018, 115, 4057–4062. [Google Scholar] [CrossRef] [Green Version]
  65. Newman, M.E.J. Mixing patterns in networks. Phys. Rev. E. 2003, 67, 026126. [Google Scholar] [CrossRef] [Green Version]
  66. Valente, T.W. Network interventions. Science 2012, 337, 49–53. [Google Scholar] [CrossRef]
  67. Valente, T.W. Putting the network in network interventions. Proc. Natl. Acad. Sci. USA 2017, 114, 9500–9501. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  68. Valente, T.W.; Pumpuang, P. Identifying opinion leaders to promote behavior change. Health Educ. Behav. 2007, 34, 881–896. [Google Scholar] [CrossRef] [PubMed]
  69. Van Woudenberg, T.J.; Bevelander, K.E.; Burk, W.J.; Smit, C.R.; Buijs, L.; Buijzen, M. A randomized controlled trial testing a social network intervention to promote physical activity among adolescents. BMC public health. 2018, 18, 542. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Overview of the research framework.
Figure 1. Overview of the research framework.
Ijerph 16 04322 g001
Figure 2. (A) The degree distribution of the MSM dating network. The horizontal axis represents the in-degree or out-degree, and the vertical axis represents the proportion of nodes with corresponding in-degree or out-degree in the network. (B) The distribution of the number of users’ posts on gay-bar. The horizontal axis represents the number of posts, and the vertical axis represents the proportion of corresponding users. (C) The distribution of the weights on the edges in the MSM dating network. The horizontal axis represents the edge weight, and the vertical axis represents the proportion of edges with the weight. All figures are in double-logarithmic (e) scales using log-bin [42,43].
Figure 2. (A) The degree distribution of the MSM dating network. The horizontal axis represents the in-degree or out-degree, and the vertical axis represents the proportion of nodes with corresponding in-degree or out-degree in the network. (B) The distribution of the number of users’ posts on gay-bar. The horizontal axis represents the number of posts, and the vertical axis represents the proportion of corresponding users. (C) The distribution of the weights on the edges in the MSM dating network. The horizontal axis represents the edge weight, and the vertical axis represents the proportion of edges with the weight. All figures are in double-logarithmic (e) scales using log-bin [42,43].
Ijerph 16 04322 g002
Figure 3. (A) The size of different communities in the gay-bar network. Each bubble represents a community, and the number in the bubble represents the community’s ID. The larger the bubble, the larger the corresponding community. (B) The word-cloud of topic keywords in community 792 and community 1126. The font size corresponds to the word frequency, and the English translation is presented below each of the word-cloud.
Figure 3. (A) The size of different communities in the gay-bar network. Each bubble represents a community, and the number in the bubble represents the community’s ID. The larger the bubble, the larger the corresponding community. (B) The word-cloud of topic keywords in community 792 and community 1126. The font size corresponds to the word frequency, and the English translation is presented below each of the word-cloud.
Ijerph 16 04322 g003
Figure 4. (A) The time evolution pattern of the number of nodes in the MSM dating network. (B) The time evolution pattern of the number of edges in the MSM dating network. (C) The time evolution pattern of the average degree in the MSM dating network. All horizontal axes represent the time in which the network was formed, and the vertical axes represent the value of indexes in the network for the corresponding time.
Figure 4. (A) The time evolution pattern of the number of nodes in the MSM dating network. (B) The time evolution pattern of the number of edges in the MSM dating network. (C) The time evolution pattern of the average degree in the MSM dating network. All horizontal axes represent the time in which the network was formed, and the vertical axes represent the value of indexes in the network for the corresponding time.
Ijerph 16 04322 g004
Figure 5. (A) The distribution of the degree growth rates of nodes each year. (B) The time evolution pattern of the average of the degree growth rates. (C) The time evolution pattern of the average of the changing friend rates. All these indexes are based on the annual separate networks, not the cumulative network (the MSM dating network).
Figure 5. (A) The distribution of the degree growth rates of nodes each year. (B) The time evolution pattern of the average of the degree growth rates. (C) The time evolution pattern of the average of the changing friend rates. All these indexes are based on the annual separate networks, not the cumulative network (the MSM dating network).
Ijerph 16 04322 g005
Figure 6. The size (A) and the proportion (B) of the GSCC and the GWCC in the MSM dating network. The horizontal axis represents the time when the network was generated. The vertical axis represents the size or the proportion of the GSCC and the GWCC, respectively.
Figure 6. The size (A) and the proportion (B) of the GSCC and the GWCC in the MSM dating network. The horizontal axis represents the time when the network was generated. The vertical axis represents the size or the proportion of the GSCC and the GWCC, respectively.
Ijerph 16 04322 g006
Figure 7. (A) The time evolution pattern of the average shortest-path length, L, in the MSM dating network. (B) The time evolution pattern of the clustering coefficient, C, in the MSM dating network. (C) The time evolution pattern of the assortativity, A, in the MSM dating network. a, b, and c are the networks corresponding to the marked points.
Figure 7. (A) The time evolution pattern of the average shortest-path length, L, in the MSM dating network. (B) The time evolution pattern of the clustering coefficient, C, in the MSM dating network. (C) The time evolution pattern of the assortativity, A, in the MSM dating network. a, b, and c are the networks corresponding to the marked points.
Ijerph 16 04322 g007
Figure 8. (A) The time evolution pattern of the number of communities in the MSM dating network. (B) The time evolution pattern of the size of the largest community in the MSM dating network. (C) The member flows in the 10 largest communities. The thickness of the stripe represents the proportion of the flow members; the thicker the line, the higher the flow ratio (between 0 and 1).
Figure 8. (A) The time evolution pattern of the number of communities in the MSM dating network. (B) The time evolution pattern of the size of the largest community in the MSM dating network. (C) The member flows in the 10 largest communities. The thickness of the stripe represents the proportion of the flow members; the thicker the line, the higher the flow ratio (between 0 and 1).
Ijerph 16 04322 g008
Table 1. Static network measurements of the MSM dating network.
Table 1. Static network measurements of the MSM dating network.
Number of NodesNumber of Weighted EdgesDiameterAverage Shortest-Path LengthAverage DegreeEdge WeightsHomophily for City (Province)Number of Communities
807,5915,132,933124.236.36(0,9887)0.356 (0.478)44,076

Share and Cite

MDPI and ACS Style

Liu, C.; Lu, X. Network Evolution of a Large Online MSM Dating Community: 2005–2018. Int. J. Environ. Res. Public Health 2019, 16, 4322. https://doi.org/10.3390/ijerph16224322

AMA Style

Liu C, Lu X. Network Evolution of a Large Online MSM Dating Community: 2005–2018. International Journal of Environmental Research and Public Health. 2019; 16(22):4322. https://doi.org/10.3390/ijerph16224322

Chicago/Turabian Style

Liu, Chuchu, and Xin Lu. 2019. "Network Evolution of a Large Online MSM Dating Community: 2005–2018" International Journal of Environmental Research and Public Health 16, no. 22: 4322. https://doi.org/10.3390/ijerph16224322

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop