Data Descriptor of Snakebites in Brazil from 2007 to 2020

: Snakebite envenomations (SBE) are a significant global public health threat due to their morbidity and mortality. This is a neglected public health issue in many tropical and subtropical countries. Brazil is in the top ten countries affected by SBE, with 32,160 cases reported only in 2020, posing a high burden for this population. In this paper, we describe the data structure of snakebite records from 2007 to 2020 in the Notifiable Disease Information System (SINAN), made available by the Brazilian Ministry of Health (MoH). In addition, we also provide R scripts that allow a quick and automatic updating of data from the SINAN according to its availability. The data presented in this work are related to clinical and demographic information on SBE cases. Also, data on outcomes, laboratory results, and treatment are available. The dataset is available and freely accessible; however, preprocessing, adjustments, and standardization are necessary due to incompleteness and inconsistencies. Regardless of these limitations, it provides a solid basis for assessing different aspects and the national burden of envenoming.

Envenomation by snakebites is a significant global public health threat, especially in tropical countries, due to their morbidity and mortality.A total of 4.5-5.4 million people get bitten by snakes annually; 1.8-2.7 million develop clinical illnesses; and 81,000-138,000 die from complications [1,2].Despite this high mortality, the World Health Organization (WHO) still classifies envenomations as being neglected due to low investments in research, control, and elimination [3,4].Brazil reports the highest number of snakebite cases in Latin America [5] and, unsurprisingly, is one of the countries with the most significant experience in diagnosing and treating snake envenomation.In 2020, 32,160 cases and 138 deaths by snake envenomation were recorded [6].
Data 2024, 9, 91 2 of 8 Envenomation from snakes, spiders, scorpions, caterpillars, bees, fishes, beetles, and ants requires compulsory notification in Brazil [5].Consequently, epidemiological data on envenomation, including clinical, laboratory, treatment, and demographic information, are available in the Information System of Notifiable Diseases (SINAN) from the Brazilian Ministry of Health (MoH) platform through the Department of Informatics of the Unified Health System (DATASUS) [7][8][9].SINAN is available for health units across Brazil to both enter and query data.The system allows continuous data consolidation and supports health surveillance and prevention efforts, identifying public health concerns, providing valuable morbidity and risk assessment information, and prioritizing and evaluating control action impact [9].The microdata are made publicly available with no sensitive data [10].Therefore, it can be widely used in several epidemiological studies without submitting it to ethical boards [1,11].Despite the high availability, the data quality must be assessed for inconsistencies and completeness before any analysis [12,13].The data descriptor presented here aims to improve the availability of a high-quality, comprehensive national snakebite dataset, thus allowing for a greater standardization and reproducibility of epidemiological studies.

Data Description
All data and R scripts associated with the dataset are stored in the GitHub repository [14].
The final dataset consists of 74 attributes grouped into socio-demographic and clinical/laboratory variables detailed in Tables S1 and S2, respectively.The socio-demographic and clinical/laboratory characteristics are presented in Table S3.Severity was also proportionally higher among patients who died (836/1535 [54%]) compared to the discharged group (23,003/332,814 [6.9%]), as shown in Table S4.
Slight variations can be seen along the timeline, with an increase in the last two years 2019 and 2020 (Figure 1).Also, most envenomations occurred among adults and were caused by the Bothrops genus, although significant numbers of other genera and non-venomous bites have been reported.Many cases without information on the specific snake were reported (Figure 2).Most cases were reported in the north and central west.As previously stated, most reports are from Bothrops and Lachesis genera (Figure 3).Most cases were reported in the north and central west.As previously stated, most reports are from Bothrops and Lachesis genera (Figure 3).Most cases were reported in the north and central west.As previously stated, most reports are from Bothrops and Lachesis genera (Figure 3).
There is little change in severity over time.There are increasingly fewer reports of severe cases for Micrurus across the whole timeline; however, for Lachesis, Crotalus, and Bothrops, there is an increase in the proportion of severity over time (Figure 4).The most severe cases with bad prognoses can progress to death.In this regard, most of the deaths were caused by Bothrops, with an increase in frequency and proportion in the last two years (Figure 5).There is little change in severity over time.There are increasingly fewer reports of severe cases for Micrurus across the whole timeline; however, for Lachesis, Crotalus, and Bothrops, there is an increase in the proportion of severity over time (Figure 4).The most severe cases with bad prognoses can progress to death.In this regard, most of the deaths were caused by Bothrops, with an increase in frequency and proportion in the last two years (Figure 5).

Methods
The data were collected from the SINAN; original files are available at https://datasus.saude.gov.br/transferencia-de-arquivos/ in DBC format, a compressed version of DBF files, accessed on 23 September 2021.
To download the files, the following procedures were performed: (1) in the "Fonte" (source) option, "SINAN-Sistema de Informações de Agravos de Notificação" was selected; (2) in "Modalidade" (modality), the option "Dados" (data) was selected; (3) in "Tipo de Arquivo" (File type), the option "ANIM-Acidente por animais peçonhentos" (Venomous animals accidents) was chosen; (4) in "Ano" (year), the period from 2007 to 2020 was selected; and (5) in "UF" (acronym for Brazilian states), all the options were selected.The data sets obtained comprise 378 organized files (representing the 27 states and 14 years of study).No individually identifiable information is made available in the dataset.
All data processing was performed using the "R 4.3.2"language in its integrated development environment (IDE) "RStudio 2023.09.1-494" [15].We imported and decompressed the datasets through the "read.dbc"library and then merged them into one single dataset using the "tidyverse" library, function "bind_row".This resulted in a dataset with 2,422,825 records.After filtering for snakebites, a dataset of 400,848 was produced.This second dataset was processed to create labels and standardize the variables through the "Hmisc" library, resulting in the final database.For the graphics, the "ggplot2" library was used.The maps required the use of municipal and state boundaries provided by the "rgdal" library (Figure 6).
For the risk factors analysis, the outcome considered was death due to snake envenomation.The variable "EVOLUCAO" was adjusted to exclude "Death from other causes" from the study.Additionally, all other fields filled with "unknown" were also transformed into NA values, thus allowing the right tests to be applied.The analysis was made with the R 4.3.2environment in its IDE RStudio 2023.09.1-494 [15].dataset using the "tidyverse" library, function "bind_row".This resulted in a dataset with 2,422,825 records.After filtering for snakebites, a dataset of 400,848 was produced.This second dataset was processed to create labels and standardize the variables through the "Hmisc" library, resulting in the final database.For the graphics, the "ggplot2" library was used.The maps required the use of municipal and state boundaries provided by the "rgdal" library (Figure 6).For the risk factors analysis, the outcome considered was death due to snake envenomation.The variable "EVOLUCAO" was adjusted to exclude "Death from other causes" from the study.Additionally, all other fields filled with "unknown" were also transformed into NA values, thus allowing the right tests to be applied.The analysis was made with the R 4.3.2environment in its IDE RStudio 2023.09.1-494 [15].
The SINAN undergoes constant changes related to system update processes, so it is important to provide a processed database.For this reason, we made the originals available on GitHub in a compressed folder called "ANIMAC.zip"[14].Tables S1 and S2 describe each variable in the database according to the codebook provided by the Ministry The SINAN undergoes constant changes related to system update processes, so it is important to provide a processed database.For this reason, we made the originals available on GitHub in a compressed folder called "ANIMAC.zip"[14].Tables S1 and S2 describe each variable in the database according to the codebook provided by the Ministry of Health (https://portalsinan.saude.gov.br/acidente-por-animais-peconhentos,accessed on 8 January 2024).The information in Table S3 was made from the database processed using the "R_script_for_Silva-Neto_et.al.05.07.2022.zip"script [14].Table S3 and the final dataset in CSV format, named "DATA_BASE.csv",along with Table S4 containing the data analysis are also available on GitHub [14].Previous publications corroborate these findings [16,17].
The variables removed from the raw dataset were: • TP_ACIDENT-filtering for snakebites, only a single value remained.
• CLI_LOCA_1 and CLI_OUTR_3-open field variables that present (i) a lack of standard filling procedures, (ii) subjectivity in the filling, and (iii) the presence of special characters that can cause loss of records when exporting data to other formats.• ANI_TIPO_1, ANI_ARANHA, and ANI_LAGART-removed due to the low percentage of completeness caused by the filter and the presence of records related to envenomation from sources other than snakes.

User Notes
Snakebites are responsible for significant social and economic impacts associated with sequelae and deaths [18,19].In Brazil, between 2007 and 2020, 400,848 cases of snakebites were reported to SINAN across the country, a number considered relatively high when compared to other Latin American countries [20].The wide distribution of snakes in Brazil is responsible for many snakebite cases [21].To meet the needs of epidemiologists and health managers, we offer this dataset with a script in open "R" language that allows standardization for future studies generating evidence for decision-making in public health.Knowing the vulnerability to snakebites, information about clinics, treatment, and access to snakebite serum is essential for improving services and reducing morbidity and mortality.

8 -Figure 1 .
Figure 1.The number of records in the dataset by type of snakebite reported in Brazil from 2007 to 2020.Bothrops envenomations were frequent in all years.In 2020, the highest number of total cases (32,160) was recorded; the lowest number of total cases during the 14 years was recorded in 2014.The high frequency of ignored snake envenomation cases is noteworthy.

Figure 2 .
Figure 2. The age structure of reported cases in this dataset, divided into three categories: young (up to 18 years of age), adults (aged between 20 and 59 years), and elderly (60 years and over).

Figure 1 .Figure 1 .
Figure 1.The number of records in the dataset by type of snakebite reported in Brazil from 2007 to 2020.Bothrops envenomations were frequent in all years.In 2020, the highest number of total cases (32,160) was recorded; the lowest number of total cases during the 14 years was recorded in 2014.The high frequency of ignored snake envenomation cases is noteworthy.

Figure 2 .
Figure 2. The age structure of reported cases in this dataset, divided into three categories: young (up to 18 years of age), adults (aged between 20 and 59 years), and elderly (60 years and over).

Figure 2 .
Figure 2. The age structure of reported cases in this dataset, divided into three categories: young (up to 18 years of age), adults (aged between 20 and 59 years), and elderly (60 years and over).

8 Figure 3 .
Figure 3.The number of reported snakebite cases by snake genus and by municipality in Brazil.Color intensity is related to the frequency of cases.The northern region has the highest number of cases, mainly caused by Bothrops and Lachesis genera.

Figure 3 .
Figure 3.The number of reported snakebite cases by snake genus and by municipality in Brazil.Color intensity is related to the frequency of cases.The northern region has the highest number of cases, mainly caused by Bothrops and Lachesis genera.

Data 2024, 9 , 8 Figure 4 .
Figure 4.The distribution of severity by type of snakebite.Non-venomous envenomations are classified as mild, although those from Micrurus have a higher proportion of severity.Lachesis and Cortalus cases are responsible for moderate cases.Bothrops are associated with cases with symptoms ranging between mild and moderate.

Figure 4 .
Figure 4.The distribution of severity by type of snakebite.Non-venomous envenomations are classified as mild, although those from Micrurus have a higher proportion of severity.Lachesis and Cortalus cases are responsible for moderate cases.Bothrops are associated with cases with symptoms ranging between mild and moderate.

Figure 4 .
Figure 4.The distribution of severity by type of snakebite.Non-venomous envenomations are classified as mild, although those from Micrurus have a higher proportion of severity.Lachesis and Cortalus cases are responsible for moderate cases.Bothrops are associated with cases with symptoms ranging between mild and moderate.

Figure 5 .
Figure 5.The number of deaths from envenomations over the focal period.The highest and lowest frequencies of envenomations were reported in 2019 (145 deaths) and 2014 (97 deaths), respectively.The primary cause of death is envenomation by Bothrops sp.

Figure 5 .
Figure 5.The number of deaths from envenomations over the focal period.The highest and lowest frequencies of envenomations were reported in 2019 (145 deaths) and 2014 (97 deaths), respectively.The primary cause of death is envenomation by Bothrops sp.