Datasets of demographic profile and perpetrator experience in committing crime among young offenders in Malaysia

The datasets in this article provides supplementary information related to: (1) demographic profile of young offenders and (2) perpetrator experience in committing a crime. A quantitative approach based on a cross-sectional survey design was employed to collect data among 306 young offenders undergoing Community Service Order initiated by the Malaysian Social Welfare Department. The resultant data were analysed descriptively using Statistical Package for the Social Sciences (SPSS). The result stipulates that the majority of respondents are consist of male young offenders aged 20 years old, Malays, single in marital status, and unemployed. Based on the crime involvement aspect, the result indicates that young offenders involved in stealing (26.1%), does not carry any weapons while committing a crime (50.0%), and entangled in criminal activity due to peer influence (40.0%). Moreover, unfavorable luck contributes to the failure in executing crime (52.6%) which subsequently leads them to be arrested by the police (52.0%).


Value of the data
• The data can serve as an indication to the Malaysian Social Welfare Department to understand the crime pattern among young offenders in Malaysia. • The data is valuable to improvise the existing prevention program thus the crime rate among the younger generation can be reduced in the near future. • The data can be useful for the stakeholders and policymakers working in the fields of crime and social welfare by imposing proper measures to reduce the crime rate among the younger generation in Malaysia.

Data
The dataset in this article is obtained through a survey conducted among 306 young offenders undergoing Community Service Order. The dataset is divided into two Tables. Table 1 stipules the demographic profile of young offenders whereas Table 2 depicts the perpetrator experience in committing a crime. The raw data file is included as supplementary material in this article.

Experimental design
A quantitative approach based on a cross-sectional survey design was employed to collect data among 306 young offenders under going Community Service Order. Nine survey questions were developed based on previous studies in the field of crime [1 , 2] . Upon developing the instrument, face validity and content validity were executed to ensure that the developed items in the instrument represent the measured phenomena. In general, face validity refers to the researcher's subjective assessment to verify whether the items in the instrument appear to be relevant, clear and reasonable [3] . Correspondingly, according to Anastasi and Urbina (1997)  [4] content validation plays a primary role to test the accuracy of the domain that is aimed to be measured. Face validity was employed by getting feedback from the subject matter expert (panel) to review and validate all the items (question) within the instrument. Five panels were selected based on their expertise in the field of psychology, crime, community development, social work, and statistical data analysis. Specific guidelines were also used for selecting the experts including; (i) experienced academicians (more than 5 years) and (ii) familiar with evidenced-based practice (teach or publish articles in their field of expertise) [5] . Table 3 shows the expertise and years of experience of the panels.
The criteria for face validity assessment for this study is based on Oluwatayo (2012) [3] guidelines that focus on six main aspects namely; (i) unambiguity items, (ii) appropriate grammar, (iii) correct sentence structure, (iv) correct spelling, (v) proper format and structure of the instrument, and (vi) appropriate font size. Moreover, the panel was also requested to provide additional suggestions and comments to improvise the instrument. The summary of the panel's comments for face validity is shown in Table 4 .
Amendments to the instrument were done after obtaining feedback from the panels. Following this, content validity was carried out to provide evidence about the degree to which the developed instrument is relevant to the targeted construct. The content validity of the instrument was established based on the Content Validity Index (CVI) where an item is considered not relevant if the CVI score is less than 0.78 [5] . In addition, a dichotomous rating of favorable or unfavorable was also used to quantify the content validity [6 , 7] . Favorable denotes that an item is relevant and concise [8] . As a result, these items are assigned a score of + 1.0 [7] . On the contrary, unfavorable denotes that an item is irrelevant or negligible [8] . Hence, these items were given a score of + 0.00 [7] .
For this study, a favorable rating by three or more members of the expert panel and a CVI greater than 78% = 0.78 indicates that the items (questions) are considered relevant/related to the topic of study. Table 5 stipulates the content validity index of the study.    What is the weapon that was used while commiting the crime? 5 1.00 3.
What is the main factor that leads you to commit the crime? 5 1.00 4.
What is the main factor/reason that leads to the failure in commiting the crime?

Research design
A cross-sectional survey design was used to complete the data collection process. According to Malhotra et al. (1996) [9] , a cross-sectional survey design is a method that involves data collection from a selected population within a specific time based on the attribution of the current respondent.

Population
In this study, the population refers to all the young offenders undergoing Community Service Order initiated by the Malaysian Social Welfare Department. A report obtained from the Malaysian Social Welfare Department disclosed that currently, a total number of 540 young offenders are actively undergoing the Community Service Order.

Sample and location of study
A sample refers to a smaller and manageable version of a larger group. According to Sangoseni et al. (2013) [7] , a sample is a subset containing the characteristics of a larger population. The sample size in this study was determined based on Sample Size Calculator developed by Cohen et al. (2001) [10] whilst taking into consideration the significant level at p < .05 (significant level = 95%). Based on Cohen's Sample Size Calculator, if the population of the study is 540 and the level of significance required is .05 thus the number of respondents needed for the study is 278 respondents. Taking into consideration aspects such as dropout rate and errors in filling up the survey by the respondents, the researchers agree to increase the sample size up to 10%. Therefore, the sample size for this study is 306 respondents. Assuredly, Abdul Ghaffar (1999) [11] have supported that enlarging the sample size will help to elevate the reliability and validity scores of a particular study.
Stratified random sampling was used to select the young offenders from four different zones in Malaysia namely; (i) North Zone, (ii) Central Zone, (iii) East Zone, and (iv) Southern Zone. According to Hayes (2020) [12] , stratified random sampling allows a researcher to obtain a sample that best represents the entire population that is being studied. In the context of this study, stratified random sampling was employed in order to create equitable representation from the total population since the number of young offenders within each zone was different.
Two institutions with the highest number of young offenders within each zone was selected as the location of study including; North Zone -Kedah and Pulau Pinang, (ii) Central Zone -Selangor and Federal Territory of Kuala Lumpur, (iii) East Zone -Pahang and Kelantan, and (iv) Southern Zone -Melaka and Johor. The cut-off number for an institution to be selected as the location of the study is at least by having a minimum number of 35 young offenders who are actively undergoing Community Service Order. These criteria were included since it is cost-effective to focus on zones with a higher number of young offenders. Table 6 depicts the location of the study.

Ethical considerations
High values and norms were upheld throughout the data collection process. The participation of the respondent in this study is strictly voluntary. Prior to participation, the researcher's explained to the respondents regarding the purpose of the study. After consent was given, respondents were assured that all their responses will be recorded confidentially and reported anonymously. Moreover, respondents were also informed that they could withdraw at any stage of the study without repercussions. Furthermore, no incentives were provided to encourage participation.

Procedure
The survey questions were disseminated by the researcher to the respondents after getting permission from Malaysian Department of Social Welfare (JKMM 100/12/2/2:2016/013). During the data collection process, the researcher's assist and clarify all the questions asked by the respondents regarding the survey questions. Moreover, respondents were also informed about their rights to confidentiality. Thus, all the respondents were reminded not to write their names or other personal information on the given materials. There was no time limit for the respondents to answer the survey questions. Approximately, respondents took about 15-20 minutes to complete the questionnaire.

Data analysis
Descriptive analyses were used to obtain information related to frequency and percentage. Data were analysed using Statistical Package for the Social Sciences (SPSS).

Declaration of Competing Interest
There is no conflict of interest regarding the research, publication, and authorship of this article.