Profiles of a Health Information–Seeking Population and the Current Digital Divide: Cross-Sectional Analysis of the 2015-2016 California Health Interview Survey

Background: Internet use for health information is important, given the rise of electronic health (eHealth) that integrates technology into health care. Despite the perceived widespread use of the internet, a persistent “digital divide” exists in which many individuals have ready access to the internet and others do not. To date, most published reports have compared characteristics of internet users seeking health information vs nonusers. However, there is little understanding of the differences between internet users seeking health information online and users who do not seek such information online. Understanding these differences could enable targeted outreach for health interventions and promotion of eHealth technologies. Objective: This study aims to assess population-level characteristics associated with different types of internet use, particularly for seeking online health information. Methods: The 2015-2016 California Health Interview Survey datasets were used for this study. Internet use was classified as never used the internet ( Never use ), ever used the internet but not to search for health information in the last 12 months ( Use not for health ), and ever used the internet and have used it to search for health information in the last


Introduction
The internet has become a widespread tool of social communication, economic opportunity, and health care information and access [1]. Since 2000, use of the internet has increased nationally from 52% to 89% [2]. In the realm of health and well-being, access to and use of the internet are central to the rise of electronic health (eHealth), which is the integration of technology into health care to improve and facilitate wellness and health maintenance [3][4][5]. The Affordable Care Act, in particular, has expanded the integration of the internet in health care by funding and incentivizing health information technology in the form of patient portals and electronic health records [6]. Although these advances are intended to improve patient wellness, a digital divide exists in which certain individuals have ready access to and can use the internet, whereas others do not [7]. This divide has the potential to exacerbate existing health disparities among vulnerable populations.
Although the digital divide was historically characterized by access to the internet, more recently, the divide has highlighted differences in use and skills [4]. Use of the internet for health information has real-world benefit, including application of information learned online to manage and monitor one's health [4,8,9]. However, internet access and use vary by individual characteristics including race and ethnicity. Prior studies report that individuals of older age, lower income, male gender, and rural locality tend to be less likely to report internet access or use compared to their counterparts [7,10]. Latinos, African Americans, and individuals with low English proficiency are more likely to report no internet use compared to non-Hispanic Whites and English-proficient individuals [7,11]. Assessing characteristics of those who do and those who do not use the internet for health information may be important within the context of the digital divide, because it has implications on the potential impact of eHealth. Specifically, groups least likely to report internet use are also less likely to report use of patient portals and access online health information [12,13].
Published reports about internet use, specifically for seeking health information, have primarily compared individuals who do not use the internet in any capacity to those who use the internet specifically for seeking health information [7,10]. This type of dichotomy tends to focus primarily on internet use in the context of access (ie, those who have access to the internet for seeking health information are users and those who do not are nonusers). This overlooks a third group that exists-individuals who use the internet but not to seek health information. To our knowledge, only one other study explored characteristics of these individuals in comparison to internet users who seek health information online [7]. However, characterization of groups by internet use should include nonusers of the internet as well as other types of users. By delineating group characteristics according to the type of use, findings may reveal new opportunities for intervention. Therefore, this study aims to assess population-level characteristics associated with internet use, particularly for seeking health information online. Sociodemographic and health characteristics will be explored among those who report no use of the internet, those who use the internet for seeking health information, and those who use the internet but not for seeking health information. Varied profiles of use and nonuse may indicate areas of the digital divide that are changing and may benefit from intervention and targeted resources.

Data Source and Participants
Data from the 2015-2016 California Health Interview Survey (CHIS) were used for this study. CHIS is a statewide study conducted by the University of California Los Angeles Center for Health Policy and Research in collaboration with the California Department of Public Health and the Department of Health Care Services. Data are collected annually through surveys administered by random-digit dialing to both landline phones and cellphones. CHIS data yield a representative sample of the noninstitutionalized population of California to explore health, wellness, and access by race and region across the state.
This study used publicly available data files that allow for estimation of state-level outcomes. The 2015-2016 CHIS data files are provided separately for each year and require pooling to aggregate data. The first wave of data was collected from May 2015 to February 2016 and the second, from January 2016 to December 2016. Participants were recruited through landline and cell phone sampling. All individuals who participated in the adult (age≥18 years) 2015-2016 surveys were included in this study.

Types of Internet Use
The CHIS assesses types of internet use by asking participants "Have you ever used the Internet?" and "In the past 12 months, did you use the Internet to look for health or medical information?" [14]. Responses to questions were "yes" or "no." Only those who responded "yes" to having used the internet were asked about searching for health or medical information online. Internet use could be on any computer or electronic device and encompassed a variety of activities like emailing and searching the Web. Health and medical information was broadly defined as searching for health and wellness advice (eg, nutrition and physical activity), disease symptoms, health plans, and other related topics [14]. These questions were combined to create the main outcomes of this study: never used the internet (Never use), ever used the internet but have not used it to search for health information in the last 12 months (Use not for health), and ever used the internet and have used it to search for health information in the last 12 months (Use for health).

Measures
Participant sociodemographic and health information characteristics were included to assess association with the types of internet use. Variables were selected based on previous literature indicating an association of internet access with type of use and online health seeking. Sociodemographic variables included age, gender, marital status, education, primary language used at home, nativity, household size, federal poverty level, and race/ethnicity. Age was categorized as <60 years and ≥60 years based on past research, which highlights a disparity in internet use around this age [10]. Race and ethnicity are reported in CHIS based on the Office of Management and Budget classification and included six categories: Hispanic, Non-Hispanic (NH) white, African-American (NH), Asian (NH), American Indian/Alaskan Native (NH), and Other races. Rural and urban locality is a CHIS-generated variable based on zip code data.
Health information included the type of health insurance (private, public, or no insurance), presence of usual care ("yes" was classified as access to a doctor or clinic/health center and "no" was classified as no reported usual care, emergency department listed as usual care, or no one place used for usual care), current health status (excellent, very good, good, fair, or poor), and visit to a medical provider in the last 12 months ("yes" or "no").

Weighting of Data
As recommended by the CHIS, this study utilized weights for replication methods. The CHIS uses a complex sampling design that involves oversampling by geographic area and group characteristics (eg, race/ethnicity). Weights are thus needed to adjust for the different probabilities of selection and to most accurately reflect the population of California. Design characteristics that require weighted variables include multiple people interviewed in one household (maximum 3), oversampling from certain geographic areas, and random-digit phone dialing for participant recruitment. Weights are designed for the jackknife method of variance and bias estimation.

Statistical Analyses
Descriptive statistics on internet use and variables of interest were performed using design-adjusted or weighted frequencies.
Correlations between variables were assessed to avoid multi-collinearity; closely identified variables (rho≤0.5) were reduced to one variable for inclusion in the final model [15]. Bivariate analyses were conducted between each variable and internet use in order to determine inclusion into the final model at a significance of P<.05. To evaluate the association between the variables of interest and internet use, weighted multinomial logistic regression analysis was conducted to model internet use outcomes, using the NeverUse group assigned as the reference group. Findings are reported as odds ratios (OR) with 95% CIs. The difference of association between Use not for health and Use for health groups was assessed by evaluating overlapping CIs from the weighted multinomial logistic regression. Evaluating CIs is a conservative method of assessing significant differences as compared to other post-hoc analyses/comparisons of groups [16,17]. Assessment of CIs focused on CIs that did not overlap, thus indicating a significant difference between groups [16,17]. All analyses were conducted using SAS OnDemand for Academics (SAS Institute Inc, Cary, NC).

Study Population
The

Internet Use by Population Characteristics
The distribution of population characteristics across levels of internet use is reported in Table 1 Living in an urban location was significant for Use for Health compared to Never Use (OR: 0.7 95% CI 0.6-0.9 for rural vs urban); however, there was no significant difference in geographic locality between Never Use and Use not for health.Use for health individuals were also more likely to be employed (OR: 0.5, 95% CI 0.4-0.7 for unemployed vs employed) and privately insured (OR: 1.7, 95% CI 1.4-2.2 compared to Never Use; Table 2).  [18]. c "Employed" includes those with full-and part-time employment; "Unemployed" includes those looking for work and those not looking for work. d Determined by zip codes. e Public: categorized as only Medicare, only Medicaid, combination of the two, and the combination of one with a classified other insurance. Private: categorized as employment-based insurance and privately purchased insurance. f "Some college" includes vocational school, Associates of Arts, Associates of Science, and some years of college. g Usual care includes doctor or clinic/health center. No usual care includes no reported usual care, emergency department listed as usual care, or no one place used for usual care. h "Other" includes widowed, separated, divorced, or living with partner.
i Asian languages include Chinese, Vietnamese, and Korean.    The Use not for health group had similar characteristics as the Use for health group, except in the likelihood of being female, likelihood of being African American (non-Hispanic), levels of education, and likelihood of not having visited a physician in the last year. Compared to Use for health individuals, Use not for health individuals were significantly less likely to be female but significantly more likely to be African American (NH). Compared to Never Use individuals, more Use not for health individuals had college level or higher education (OR: 4.5, 95% CI 3.2-6.3 for Bachelor of Arts/Bachelor of Science and OR: 7.1, 95% CI 4.5-11.4 for master's degree or higher). These CIs did not overlap when compared to Use for health (OR: 15.3, 95% CI 10.8-21.7 for Bachelor of Arts/Bachelor of Science and OR: 32.1, 95% CI 19.6-52.4 for master's degree or higher), indicating a larger proportion of people with higher education among the Use for health group. Not visiting a physician in the last year was not significantly different between the Never Use and Use not for health groups (OR: 1.2, 95% CI 1.0-1.6) but was different when compared to the Use for health group (OR: 0.6, 95% CI 0.5-0.8).
Overall, the Use not for health and Use for health groups shared more similarities in sociodemographic characteristics than the Never Use group (Figure 1). Only the usual source of care did not significantly differ between the groups. Characteristics of groups by types of internet use. Never use: never used the internet; Use not for health: ever used the internet but not for seeking health information in the last 12 months; Use for health: used internet ever and for health information in the last 12 months.

Principal Results
This study assessed population characteristics associated with types of internet use for seeking health information among a population representative sample in California. The study adds to the literature by updating the assessment of the digital divide across three categories of use.

Limitations
Strengths of this study include use of a population representative sample of California and assessment of internet use, specifically for seeking health information online. The size of the sample and subsequent weighting allowed for nuanced evaluation of small but key groups within California like racial minority and rural residing groups. Limitations of the study include lack of specificity in terms of what type of health information respondents sought online and with what frequency respondents sought this information. Additionally, CHIS does not report on other aspects considered important for internet use including health literacy and technical skills to navigate online searching. As a population-based study specific to California, the CHIS results are not generalizable to other states. Additionally, CHIS is based on self-report and may be subject to recall bias.

Comparison to Prior Work
Compared to the study by Nguyen et al [7], the proportion of Californians who are internet users and who searched for health information has grown from 81.5% to 84.2% and from 64.5% to 65.4%, respectively [7]. This study shows that compared to Use for health individuals, Never Use individuals tend to be older, male, Hispanic, or African American (vs NH white); be of a lower socioeconomic status; and report poorer health (compared to excellent health). These distinctions mirror what is already understood about the digital divide, specifically with regard to age disparities (older adults vs adults) and racial disparities (African American/Hispanic vs NH white) [7,19]. Although general internet use is growing among older adults (67%) and racial/ethnic minorities (87%), these groups still report less internet use and access [2]. Older adults are a vulnerable group, given their high health needs, and may benefit from internet use for health. Barriers to internet use among older adults include technical difficulties, confusion with the amount of information online, disability in terms of psychomotor function, health literacy, costs, and distrust in internet sources [10,20]. Of note, distrust and low health literacy (eg, understanding health and medical information) have been significantly associated with less internet use for seeking health information even when older adults report using the internet [10,20].
Among African American and Hispanic individuals, internet access, defined by desktop, laptop, or handheld computer (ie, smartphone) ownership or broadband subscription, is low (63.5% and 69.6%, respectively) as compared to NH white individuals (78.8%) [21]. Smartphones have helped increase access to the internet for minority groups. Compared to only 9% of NH white individuals, 22% of Hispanic and 15% of African American individuals are smartphone-only internet users [22]. Among these groups, activities such as health information seeking are more likely to be performed on smartphones than on a traditional computer. Among Hispanic individuals, nonusers tend to be less proficient in English, and Spanish is the primary language in which media is consumed [11]. Language and literacy barriers are significant, as searching, reading, and comprehending health information online can be complex. Use of a patient portal may be more complicated than searching for health information online, as the former involves logging in, password creation, and more specific medical information. Thus, language and literacy are significant barriers to internet use generally and in the context of eHealth.
Unique to our study is the classification of Use not for health and comparing this group to both Never Use and Use for health groups. Given the saturation of internet use in our sample (>80% reported internet use), we were able to subcategorize this group into use but not for health and use for health seeking. Differences in educational attainment, gender, race (specifically of African American race), and physician visits in the last year differentiate the Use not for health group from the Use for health group. Nguyen et al [7] reported similar findings when only characterizing internet users who search for health information online. Specifically, they found that individuals who use the internet to search for health information were more likely to have seen a physician in the past year, to have higher educational attainment, to be of an ethnic minority (ie, African American or Latino), and to be female [7]. Not having visited a physician in the last 12 months may explain the low use of the internet for seeking health information, because it may indicate no serious diagnosis and thus no concerns or interests in seeking health information online. A serious diagnosis, or risk of one, is associated with health information seeking, as evidenced by a national US sample in which health information seeking was more prevalent among cancer survivors (69.8%) and individuals with a family history of cancer (51.2%) than those who had no history of cancer (29.6%) [23]. In the context of eHealth and health technology integration, it is important to explore why individuals who have access to the internet in some capacity do not use it for health-seeking activities (eg, searching for health information).
Types of internet use may indicate the extent to which eHealth expansion and integrated online health tools may be missing key populations. Health systems and policy makers may consider the characteristics of populations identified in this study by Never Use and Use not for health groups, because these populations may be lacking in the intended benefit of patient portals and eHealth resources. In particular, if Never Use and Use not for health groups do not frequent medical providers, accessing these groups may require other techniques besides, for example, designating a patient portal during a clinic visit. Types of use may also reflect existing disparities in health care access continuing into a new era of health technology. As evidenced by a study of a representative sample in the Dutch population, disparities in internet usage reflected real-life disparities experienced within the Dutch population according to gender, age, and level of education [24]. Careful consideration and purposeful design of health and technology integration may have the potential to eliminate disparities [25]. Patient portals have already been shown to improve patient outcomes and engagement and reduce health care costs, especially among patients with chronic conditions [26]. However, this only applies to patients who are able to access, use, and comprehend the information being shared through these portals. Healthy People 2020 has incorporated internet access and use into its objectives to target these disparities [27]. Particularly relevant are objectives to increase the proportion of online health information seekers who report easy access to information to increase health literacy and the proportion of health-related websites that are simple and usable [27].

Conclusions
Advances in health care and management have promoted internet integration and its use to support health and wellness. Although our study found a high prevalence of internet use across the study population, findings suggest that a digital divide continues to exist. The widest divide still remains between nonusers and users, and a lesser divide exists among users who search for health information and users who do not. Disparities identified in both internet use and health information seeking reflect a lack of health equity in a new era of technological advances in society and health care. The internet is one important tool for the development of an empowered patient and individual. Underutilization of the internet as a tool of health information leaves behind vulnerable populations and may have an adverse impact on health care for these individuals. Future studies may explore specific barriers to internet use and online health information seeking among Never Users and Use but not for health to inform and shape targeted and tailored interventions. Targeting interventions and educational materials to nonusers and users of the internet may improve internet and health literacy and support the ultimate goal of developing a more informed and empowered individual.