Personal name in Igbo Culture: A dataset on randomly selected personal names and their statistical analysis

This data article contains the statistical analysis of Igbo personal names and a sample of randomly selected of such names. This was presented as the following: 1). A simple random sampling of some Igbo personal names and their respective gender associated with each name. 2). The distribution of the vowels, consonants and letters of alphabets of the personal names. 3). The distribution of name length. 4). The distribution of initial and terminal letters of Igbo personal names. The significance of the data was discussed.


Specifications
Computational Linguistics, pattern analysis in naming Type of data Table and MS Excel How data was acquired The data was obtained from freely available textbooks, online baby name websites, oral interview, published articles and online discussion forum. Data format Raw, partial analyzed Experimental factors Simple random sampling of some selected Igbo personal names. The alphabets were presented in their written form (the way they are written in English). Experimental features Statistical analysis of the distribution of the following: characters for each name, consonants, vowels, initial letters, terminal letters and total or word length. Comparative ranking of frequency of occurrence. Data source location N/A Data accessibility All the data are in this data article

Value of the data
The datasets can serve as a reference for Igbo baby names. Similar statistical analysis can be applied to other identified names in other languages. The dataset can be helpful to the following fields, linguistics, Igbo language studies and lexicology, Anthroponymy, Onomastics, etymology, Igbo name neologism, semantics and morphology of identified Igbo names and so on. See [1] and [2] for research on patterns of writing language texts.
The data can be used to study the effects of God called "Chi" or "Chukwu" in Igbo personal naming.
This can be achieved by studying the occurrence of such names compared with others.
The data can provide insight on the effect of Christianity and Pentecostalism in Igbo personal naming.
The reference section can serve as useful resources for researchers in this area.

Data
The data contained in this article are listed as follows. The dataset of randomly selected Igbo personal names and their respective gender associated with each name, and the distribution of the name length. This data can be assessed as Supplementary data 1. Secondly, the distribution of the vowels, consonants and alphabets of the Igbo personal names was included in this data article.
This data can be assessed as Supplementary data 2. Lastly, this data article contains the distribution of initial and terminal letters of Igbo personal names. In addition, tables showing the statistical analysis of the above listed datasets were also included.

Detailed data description
Personal names can be classified as given name, first name, middle name, forename, Christian name, local name or adopted name. These are opposite of last name, surname, family name or clan name. Igbo is one of the major tribes in Nigeria and the language is spoken by over 25 million and characterized by dialects. The Igbo people are originally from the eastern part of Nigeria but can be found in virtually every country of the world. Similar to any other ethnic groups in Africa, naming in Igbo is premeditated venture that is designed to speak to the future of the newly born child. Igbo people are not careless in naming because of their belief that names are tied to destinies and as such have religious, philosophical, psychological, historical, social and linguistic interpretations. Personal names in Igbo land are characterized by the following: 1). Names are clustered along the lines of dialects, largely because of geographical proximity, migration and historical ties. 2). Sentential names are heavily been replaced with Pentecostal names. 3). The influence of God called "Chi" or "Chukwu" is very strong in Igbo personal names. 4). Superstitious beliefs also influence the naming system. 5). Sociological effects such as procreation and the importance of children over barrenness, wealth, status, riches for example "Nwako", caste system like "Osu", "Umeh", traditional post or monarchial lineage, for example; "Adaeze", "Ezedinobi", innuendo or response to mockery, childlessness or taunting for example "Iroahushi", superiority of their siblings, clan or kingship or kinsmen over others, or their wealth, beauty, riches, sexual or intellectual abilities for example "Akubuilo", "Ofunneka". 6). Igbo personal names are gender sensitive because of the patriarchal nature of Igbo people. The males are often named based on issues such as: gods or deities, physical and spiritual objects, intellectual prowess and dexterity in trade or agriculture, natural or mysterious phenomena, sportsmanship and craftsmanship, animals and so on. On the other hand, female names are often associated with good lineage, fruitfulness, beauty and intelligent, moral responsibility, favor, good luck and tidings, joy, happiness, wealth, purity and so on. 7). Maternal lineage or descent can influence the personal names given to Igbo people. 8). Historical or geographical events for example waterfall, market days, birth of prince or princess, disease outbreak, war, famine, draught, great harvest, fruitful period and so on. Numerous investigators have worked on various aspects of personal names and naming in Igbo but the actual distribution and frequency of the alphabets that made up each name have not been reported to the best of the knowledge of the authors.

Experimental design, materials and methods
This research was as a result of rigorous research gaps observed from the works of numerous authors. Few of which are listed .

The random sample of Igbo personal names
The limitations of accessing the target population is compensated with a well-defined sample which must be a true representative of the studied population. See [49][50][51][52][53][54][55][56][57][58][59][60][61][62][63][64][65][66][67] for some selected survey research done to study some observed population attributes. Simple random sampling of some selected Igbo personal names yielded 965 names which are subsets of larger population. The samples were collected in such a way as to reflect the dialectal classification of Igbo people. The data was obtained from freely available textbooks, online baby name websites, oral interview, published articles and online discussion forums.

Distribution of name length of Igbo personal names
Statistical analysis of the personal name (word) length of Igbo people are summarized in Table 1. This was done using simple statistical tools.
On the average, a randomly selected Igbo personal name will have a word length of eight. The description can be done using histogram as shown in Fig. 1.
It is most likely that the word length will be greater than eight as seen in the histogram (skewness).  Table 2 Lower case letters of the alphabets of Igbo language. a, b, ch, d, e, f, g, gb, gh, gw, h, i, ị, j, k, kp, kw, l, m, n, ṅ, nw, ny, o, ọ, p, r, s, sh, t, u, ụ, v, w, y, z Table 3 Lower case letters used for this article (written form in English form). a, b, c, d, e, f, g, h, i, j, k, l, m, n, ṅ, o, p, r, s, t, u, v, w, y, z Surprisingly none of the 965 names contain the letter "v".

Distribution of letters of alphabets and their comparative ranking in Igbo personal names
Igbo language is made up of 36 letters of the alphabets comprising of 8 vowels and 28 consonants. This is shown in Table 2.
The research was restricted to 25 letters of the alphabets of the written form of Igbo language (Anglo-Igbo) version which is currently used in child registry, school registration, international passport and national identification and so on. The form is shown in Table 3.
Excel command was used to determine the frequency of letters of alphabets of Igbo personal names. The command is: ¼ SUMPRODUCT(LEN(A2) -LEN(SUBSTITUTE(A2, "letter", ""))). The result was presented along with their corresponding comparative ranks. This is shown in Table 4. The rank is from the most frequent to the least.

Distribution of double letter consonants in Igbo personal names
Igbo language comprises of nine double letter consonants. These are shown in Table 5. Excel command was used to determine the frequency of double letter consonants of alphabets of Igbo personal names. The command is: ¼SUMPRODUCT(LEN(A2) -LEN(SUBSTITUTE(A2, "letters", "")))/2. The result was presented along with their corresponding comparative ranks. This is shown in Table 6.  The high frequency of occurrence of "ch" is a pointer to the influence of God in the naming systems of Igbo people. This is because of the presence of "Chi", Chukwu, Chuku in almost 50% of Igbo personal names.

Distribution of consonants and vowels in Igbo personal names
It should observed from Table 4 that the 5 vowels are rank first to fifth and the consonants are ranked after that. However this can be clearly seen in the histogram. The histogram of the   distributions of consonants and vowels of 965 randomly selected Igbo personal names are shown in Figs. 2 and 3. The total number of vowels and consonant and their respective percentages were shown in Table 7. There are a total of 8049 letters. On the average any random selection of Igbo personal name would likely comprised of 48% vowel and 52% consonant.
2.6. Distribution of initial and terminal letters in Igbo personal names and their comparative ranking Initial and terminal letters constitute a major component of the study of words, nouns, proper nouns and personal names. Excel command was used to determine the frequency of initial and terminal letters of Igbo personal names. The command for the initial letter is: ¼COUNTIF(A2: A966, "letter*"). The command for the terminal letter is: ¼ COUNTIF(A2: A966, "*letter"). The result was presented along with their corresponding comparative ranks. This is shown in Table 8.
Areas of similarity and differences and relationship of the initial and terminal letters can be obtained by further analysis and use of statistical methods like correlation and chi-square.