Dataset on waste management behaviors of urban citizens in large cities of Indonesia

This study used a statistical approach to measure how urban citizens in certain provinces of Indonesia handle their waste. It illustrates how the desirable habits related to environmental consciousness differ across urban citizens among different regions and economic classes. A wide disparity was found in people's understanding of a healthy and clean environment across provinces and cities. Waste management ignorance was also found to be prevalent. Inculcating personal awareness of the local environment was found to be a good start toward keeping the environment clean. The observed positive correlation between the overall living conditions and littering behavior indicates that households that exhibit littering behavior also tend to score higher on living conditions. This significant positive correlation is indicative of self-interest and ignorance. The study also suggests that a higher level of household economic prosperity correlates with a more desirable behavior toward maintaining a clean and healthy environment; such behaviors are also adopted by citizens living in clean neighborhoods. Furthermore, a clean and healthy lifestyle is also supported by environmental consciousness in conjunction with hygienic environmental conditions.


a b s t r a c t
This study used a statistical approach to measure how urban citizens in certain provinces of Indonesia handle their waste. It illustrates how the desirable habits related to environmental consciousness differ across urban citizens among different regions and economic classes. A wide disparity was found in people's understanding of a healthy and clean environment across provinces and cities. Waste management ignorance was also found to be prevalent. Inculcating personal awareness of the local environment was found to be a good start toward keeping the environment clean. The observed positive correlation between the overall living conditions and littering behavior indicates that households that exhibit littering behavior also tend to score higher on living conditions. This significant positive correlation is indicative of self-interest and ignorance. The study also suggests that a higher level of household economic prosperity correlates with a more desirable behavior toward maintaining a clean and healthy environment; such behaviors are also adopted by citizens living in clean neighborhoods. Furthermore, a clean and healthy lifestyle is also supported by environmental consciousness in conjunction with hygienic environmental conditions.
© 2020 The Authors. Published by Elsevier Inc. This is an open access article under the CC BY license.
( http://creativecommons.org/licenses/by/4.0/ ) Specifications Table   Subject Environmental Science Specific subject area Waste Management Behavior Type of data Table  How data were acquired Data were acquired from a primary data survey across Indonesia's large cities in six provinces. The survey utilized a questionnaire, which is attached as Supplementary File S1. The compiled answers related to the questionnaire are provided as Supplementary File S2. Data format Raw Analyzed Parameters for data collection The questionnaire was developed to measure urban citizens in certain provinces of Indonesia handle their waste Description of data collection The data used in this study were obtained from members of 600 households (respondents) living in 6 different provinces in Indonesia. The data from nearly 100 households were gathered from the cities within each of these provinces. Data source location Provinces: Special Region of Jakarta, Jambi, West Sumatra, West Java, East Java, MalukuCities: West Jakarta, East Jakarta, Central Jakarta, North Jakarta, South Jakarta, Jambi, Muaro Jambi, Padang, Surabaya, Tasikmalaya, AmbonCountry: IndonesiaLatitude and longitude: -

Value of the Data
• These inferential statistical data and analyses are useful to understand how lifestyle and littering behavior affect the self-assessed living conditions of Indonesia's urban citizens. • Indonesia's urban citizens as well as policymakers can benefit from these data. The data show that environmental conditions can be improved by enhancing the economic prosperity and focusing on the education of the urban citizens. • The study data can aid policymakers at the city-and province-level in reshaping citizen behavior regarding waste management. • The study's analysis of representative cities indicates a degree of ignorance regarding the environment in Indonesia's urban citizens; in these cities, a positive relationship is observed between desirable "environmental habits" and living conditions. Hence, to aid in solving the waste-handling problem, the issue of ignorance must be addressed. • This analysis also suggests that correlations between certain waste-handling behaviors, socioeconomic factors, and other external factors affect the self-assessed living conditions.

Data description
The data used in this study were acquired from over 600 households in 6 different provinces in Indonesia. Data were gathered from 100 households in different cities within these provinces. The same family ID was used across all provinces (that is, Family ID 01 exists for each province). However, each family ID was unique within the province; only one family was coded 01 in each province, and no two families were assigned the same code within the same province. The number of observations by provinces and by cities are shown in Figs. 1 and 2 , respectively. The complete questionnaire used to gather the data is provided as Supplementary File S1. The  questionnaire consisted of seven sets of questions covering the descriptions of spatial and household recognition, household summary, household members, self-regulated garbage disposal routines/habits, knowledge and contribution to waste-handling, perception and attitudes on waste handling, and surveyor-guided self-assessment conditions. The answers to each question in the questionnaire were compiled and are presented in another supplementary file.
The provinces corresponding to these data include the Special Administrative Region of Jakarta, Jambi, West Java, East Java, Maluku, and West Sumatra. Sampled cities include Jakarta, Ambon, Tasikmalaya, Jambi, Muaro Jambi, Padang, and Surabaya. These cities were chosen as they are characterized by relatively high urban populations within the provinces of interest. Meanwhile, provinces were chosen based on their population density in conjunction with the magnitude of the waste-handling problem. The cities were also chosen as representative major cities in Indonesia with urban waste management problems. Here, in the following discussion, the words "respondents" and "urban citizens" are used interchangeably, and both refer to the urban citizens who participated in this research.
In the study, the questions on waste-handling activities were classified into certain categories. These categories cover self-regulated routines/habits with regard to trash disposal, waste disposal facilities, waste-handling awareness and perceptions, knowledge and contribution, and "self-assessment" of lifestyle [1] .
Moreover, citizens across the economic strata were chosen for random sampling. The respondents were queried on behavioral information related to waste-handling activities. To ensure greater depth of analysis, the respondents were also asked about their income and expenditure per capita per day. Table 1 lists the general information on the waste-handling behavior of the respondents in each province.
From Table 1 , it can be observed that the Maluku Province has both the most income as well as expenditure per capita relative to other provinces. West Sumatra has the lowest expenditure per capita, followed by Jakarta. Citizens of Maluku, West Sumatra, and Jambi tend to be least concerned about their living environment (as indicated by the low number of households that suitably disposed garbage the last time they did). The most prevalent reasons why the inmates of some households disposed garbage correctly are self-initiative to maintain a clean environment or exemplary behavior for others to emulate. These desirable habits are most likely formed before these citizens entered school, as indicated by respondent answers on having such habits before beginning school. Additionally, more than 80% of them were taught not to litter during their schooling years. This is an indication of how investment through education influences not only human productivity in the labor market, but also the ability to make good choices [2] , including those that affect investments in their wellbeing, as in the case of waste management. An understanding of good health can encourage individuals to follow precautionary behaviors such as maintaining a clean environment through responsible actions related to proper waste management. These choices are lacking for those with no or little knowledge of environmental cleanliness as the key to their health and wellbeing.
According to the summarized statistics described previously, there is a significant difference between the means of the per capita expenditure among provinces and cities. The significant differences among geographical factors show that the data are normally distributed across locations in terms of socioeconomic conditions.
The scores of the overall living conditions and living environments also differ significantly across the expenditure groups. A low score for the living conditions is observed in the case of low-expenditure households and regions with residents with low daily expenditure [ 3 , 4 , 5 ]. The significant differences among citizens grouped by expenditure per capita also show that the living conditions are normally distributed across income groups.
In the study, the first part of the analysis focused on detecting whether the self-assessment scores significantly differed among the groups of respondents. In addition, the analysis focused on detecting the existence of interactions between the tested variables [ 6 , 7 ].
The first stage involved the evaluation of the score of the overall living conditions. This score was measured on a scale of 1 to 5, with 1 denoting unsatisfactory and 5 denoting very satisfactory and comfortable. The score categorization was explained by the surveyors under certain criteria. From Table 2 , it can be observed that this score differs significantly across cities and income groups. However, the geographical location and income-group categories exhibit no correlation; there is no interaction between these variables that affects the living-conditions score [ 8 , 9 , 10 ]. The next stage involved the evaluation of the environmental hygiene/cleanliness score. Environmental cleanliness was measured from scale 1 to 5, with a score of 5 denoting the highest level of hygiene. The ANOVA test results in Table 3 indicate that there exist mean differences in environmental hygiene by cities and expenditure groups (as indicate by statistical significance). However, there are no significant interaction factors or tendencies attributable to living in certain cities, and citizens in certain income/expenditure groups exhibit significantly deteriorating  or improving hygiene behaviors. In other words, while the score variable and the choice of city may be correlated, they appear as independent variables [11] .
As regards the score of the knowledge/awareness of the need for a healthy and clean environment ( Table 4 ), the observed trend is similar to the previous two cases. A statistical significance of the mean differences is observed based on the city and income/expenditure. People living in better economic conditions are more knowledgeable. Furthermore, there is no interaction factor between the city of living and the score of environmental consciousness, which indicates that there are no tendencies of variable interactions that affect the score [3] . These two parameters may also be correlated; however, they appear as independent variables.
As regards the score of environmental cleanliness/hygiene ( Table 5 ), the results are similar to those of the previous cases. There is a statistical significance of the mean differences based on cities and income/expenditure. People living in poorer economic conditions tend to live in less cleaner environments. However, there is no interaction between the city of living and the cleanliness score; there are no tendencies of variable interactions that affect the score [12] . These two variables again appear independent of each other. The last stage of the analysis focuses on how the chosen variables affect each other and the significance of the correlation. The confidence level in the following analysis was set to 95%, with the alpha value (significance level) set to 5%. Table 6 summarizes the Spearman correlation matrix.
In Table 6 , the coefficient correlation is indicated by enlarged numbers in gray boxes, and the numbers in bold (accompanied by an asterisk) indicate strong and significant correlations. The variables in blue-colored fonts indicate that their values were acquired via self-assessment of respondents [2] .
The abovementioned findings are convergent with those of other studies [ 13 , 14 , 15 ]; these studies also reported that a clean and healthy lifestyle in conjunction with education generally correlated with clean environmental conditions. A clean and healthy lifestyle is not usually replaced by an unhygienic lifestyle. In addition, households greater economic prosperity (indicated by a positive correlation with expenditure or income per capita) usually have better living conditions. Better economic conditions, in this case, also aid people in keeping the environment clean. This finding indicates why slums often appear unkempt and unhygienic [5] . A surprising finding refers to the dummy variable of littering behavior. A positive correlation between the overall living conditions and littering behavior indicates that households with littering behavior also tend to score higher on the living conditions. This significant positive relationship is an indicator of self-interest and ignorance. In general, social behavior and lifestyle should suitably conform with government policies [16] . This trend also shows the need for participation in waste management at the community level [17] .
The major factor toward creating a clean and healthy lifestyle among urban citizens is environmental consciousness, as indicated by the coefficient correlation of 0.526, meaning that the overall environmental cleanliness score will increase by 0.526 when the environmental consciousness score is increased by 1. Here, it is noted that such a correlation is not an accurate reflection of how urban citizens handle waste; however, it reveals how each of the variables correlates and the anomalies occurring in society.

Experimental design, materials, and methods
The experimental design also aimed at evaluating how economic conditions correlate with the littering behavior of the households surveyed. The analysis also measured the significance of the mean differences among groups based on the survey data. Here, it is noted that the survey covered 600 households from 6 different provinces, with 100 samples being acquired from different cities within these provinces. Importantly, the respondents were residents and not migrants. The dataset containing the respondent records was suitably filtered by data processing   software. In particular, qualitative records were encoded, and quantitative ones were cleared from string inputs. The survey also elaborates the socioeconomic dimensions of waste-handling behavior by focusing on the economic status, which affects the quality of living and waste-handling behavior of urban citizens. These indicators were embedded in the questionnaire form, which consisted of eight sets of questions. The details and description of the survey are listed in Table 7 . The respondent economic conditions were measured via the approximate expenditure and income for each household. The income and expenditure were stated in Rupiah, and they were converted to USD assuming that 1 USD equals IDR 14,071 (average exchange rate in October 2019). The approximate income and expenditure per capita were calculated by dividing the approximate household income and expenditure by the number of household members.
The following variables were scored based on self-assessment (from session 7): (i) living state/conditions, (ii) knowledge/awareness of healthy environment and lifestyle, (iii) cleanliness and hygienic living conditions, and (iv) hygiene and lifestyle relative to socioeconomic conditions (approximated by decile group of expenditure per capita), city of living, and the dummy variable of littering behavior. The self-assessment scores are designed such that higher scores correspond to better conditions and vice versa . The self-assessment scores were not completely subjective as the surveyor explained the classification criteria for each score to the respondents. Hence, the scores can be considered as guided self-assessment scores.
The variables were analyzed via various possible analysis of variance (ANOVA) approaches in conjunction with the non-parametric Spearman's correlation test. The tests were conducted to check whether these variables exhibit a significant mean among the categories of cities of living, decile group of incomes, and households with littering habits.
In addition, the Spearman's rank correlation matrix (including significance levels and asterisk) was used to analyze the correlation tendencies between variables. The significance levels and asterisk under the coefficient correlation mark important significant tendencies between two variables, and this can enable focus on the factors of interest. The recorded significance levels can also guide researchers and policymakers on what factors or variables they should focus on to raise awareness of the need for a clean and healthy lifestyle.
In inferential statistical analysis, the definitions of groups (categories) or the division of respondent groups are important. In this study, respondents (households) were classified into several types of groups: First, the domicile-based group categorized respondents at the city or province level. Second, respondents were categorized based on waste-handling behaviors or littering. Here, it must be noted that only some households were surveyed on their littering behavior. Thirdly, respondent data were classified into deciles based on their expenditure per capita. The 1st decile indicated the 10% of respondents with the lowest expenditure per capita, while the 10th decile indicated the 10% of respondents with the highest expenditure per capita. For the sake of simplicity, the summarized statistics are provided based on domicile-based groups at the province level.