Adaptation and norm determination of the Boston Naming Test for healthy Lebanese adults aged between 50 and 88 years

The Boston Naming Test is a well-known neuropsychological test widely used to evaluate linguistic abilities, encompassing object naming and word retrieval in subjects representing various clinical pathologies. Our study has two main stages: (1) a pilot study aimed at adapting the BNT to the linguistic and cultural particularities of Lebanese society and (2) norm determination for the Lebanese version of the BNT through the analysis of participants’ responses. The primary goal of this study is to develop a Lebanese version of the BNT comprising 60 images adapted to the Lebanese language and culture. This version is based on normative data derived from healthy Lebanese adults aged between 50 and 88 years. The study seeks to assess the influence of age, gender, and education level on the naming performance of participants. In the pilot study, 103 Lebanese volunteers participated, while the normative study involved 280 healthy volunteers aged between 50 and 88 years. Three screening tests—Montreal Cognitive Assessment (MOCA), Language Experience and Proficiency Questionnaire (LEAP-Q), and Geriatric Depression Scale 15-item (GDS)—were administered to select participants meeting inclusion criteria. The findings revealed a statistically significant effect of age and education level on the BNT (Lebanese version) total score. The total score decreased with age and increased with education. However, the effect of gender was not significant, a result confirmed by the generalized linear model. This study successfully produced a Lebanese version of the BNT comparable to the original English version. Additionally, it provided normative data crucial for evaluating naming ability


Introduction
The Boston Naming Test (BNT) serves as a widely employed neuropsychological assessment tool specifically designed for evaluating language proficiency, encompassing object naming, and word retrieval capabilities (Kaplan et al., 1983;Soylu & Cangöz, 2018).Its widespread application in clinical settings and scientific investigations is attributed to its simplicity and heightened sensitivity in identifying language impairments across diverse age groups, including children, adults, and the elderly, who present with various clinical conditions such as communication disorders, aphasia, dementia, and brain lesions (Soylu & Cangöz, 2018;Thomas et al., 2019;Vestito et al., 2021).
Various countries have employed several methods to translate and adapt the BNT for use in their respective linguistic and cultural contexts.Some have implemented minor modifications, such as changing object names to align with the target culture, while others have undergone more significant changes (Roberts & Doucet, 2011;Thomas, 2019).In the Swedish version, for instance, the drawings remained unchanged, but the sequence in which they were presented was modified (Tallberg, 2005).Other adaptations involve replacing original test items with culturally relevant alternatives belonging to the same semantic category; for instance, the substitution of the American-centric "pretzel" with a culturally familiar food in some versions (Patricacou, 2007;Cruise, Worrall, & Hickson, 2000).Notably, the Brazilian adaptation of the BNT involved replacing 20 of the original 60 items with culturally relevant alternatives.These replacements were carefully chosen based on their frequency, ambiguity, familiarity, and similarity to the original picture.For instance, "pretzel" was replaced with "bolo" (cake), a more familiar food, while "rhinoceros" became "elefante" (elephant), a more commonly known animal, and "Pão de Açúcar" (Sugarloaf ) replaced the mythical "sphinx" (Miotto et al., 2010).The Greek adaptation of the BNT replaced four items: "pretzel" with "a kind of cake, " "door knocker" with "mailbox, " "stethoscope" with "blood pressure instrument, " and "scroll" with "ancient Greek column" to better reflect the cultural context (Patricacou, 2007).Finally, the Korean adaptation of the BNT by Kim and Na (1999) involved replacing 49 of the 60 pictures, which can again be categorized as a substantial change.
Demographic variables such as age, gender, and education have been investigated due to their potential impact on the performance of the BNT, irrespective of neurological conditions (Neil et al., 1995;Grima & Franklin, 2016).Research on the relationship between age and Boston Naming Test (BNT) scores reveals inconsistent findings; while numerous studies suggest a modest decrease in naming ability with advanced age, particularly in those aged 80 and above, some studies show no significant age effects or even better performance by older adults (Soylu & Cangöz, 2018;Thomas, 2019;Tallberg, 2005).These discrepancies are attributed to variations in research design and sample characteristics (Grima & Franklin, 2016).Similar inconsistent findings exist regarding the effect of gender on naming performance.Some studies suggest that men perform slightly better than women, while others find no difference between the sexes (Patricacou, 2007;Tallberg, 2005).This discrepancy has been explained in some studies by suggesting that items presented in the BNT are more familiar to men than women (Thomas, 2019;Olabarrieta-Landa et al., 2015).Education has also been shown to affect naming performance (Nour et al., 2021).Many studies have found a positive correlation between naming ability and the number of years of education attained.People with fewer years of education tend to have lower scores, especially those with less than 12 years of education (Hawkins & Bender, 2002;Neils et al., 1995).
Naming disorders are prevalent and extend beyond individuals with brain injuries or advanced age; they manifest in young adults without overt pathological indications.Anomia, characterized by the inability to name objects and the phenomenon of having a "word on the tip of the tongue" represent common facets of these disorders (Burke & Shafto, 2004).This cognitive deficit can instigate frustration and discontent among affected individuals, potentially serving as an early indicator of more severe neurodegenerative conditions, including Alzheimer's disease, aphasia, and dementia (Burke & Shafto, 2004).Therefore, it is imperative to employ a diagnostic test that assesses objectnaming proficiency in both normative adults and those with pathological disorders.
Lebanon exhibits high dementia rates among individuals aged 60 and above (Phung et al., 2017).The United Nations projects that the Lebanese population aged 65 and older will constitute 31.2% by 2050 (United Nations, 2017).The prevalence of naming disorders in Lebanon aligns with global patterns, emphasizing the necessity of employing a validated tool like the BNT, commonly used for identifying such disorders.With the cultural and linguistic variations, caution is required in the indiscriminate application of such instruments.Given the absence of standardized scoring norms for the BNT in Lebanon, our focus is on establishing its applicability for the adult Arabic-speaking Lebanese population.
The aims of the current study were (1) to describe the development of an adapted version of the BNT suitable for Lebanese Arabic speakers using a sample of healthy elderly Lebanese individuals and (2) to investigate the effects of age, education, and gender on the scores of the adapted version of the BNT as applied to Lebanese Arabic speakers.

Method
The adaptation and standardization of the BNT for the Lebanese language and culture were carried out through a three-phase approach, utilizing methodologies previously employed in other studies (Grima & Franklin, 2016;Soylu & Cangöz, 2018).It received approval from the Ethical Committee at the Hospital of Notre Dame de Secours in Byblos, Lebanon, on December 10, 2020.It consists of three main phases:

The first phase: translation
The first phase consists of the translation of the 60 items of the original BNT from English to Arabic by using two standard dictionaries (Ba'labakki, 2004;Reverso Dictionary n.d.).This translation allowed us to determine the Modern Standard Arabic (MSA) naming for each item.Based on Classical Arabic, MSA is the official written language used in government affairs, news, broadcast media, books, and education (Ibrahim, 2009;Kwaik et al., 2018).However, Arabic languages are characterized by diglossia: while MSA is their common means of formal communication, Arabic dialects are the medium of oral communication within each community (Kwaik et al., 2018).MSA shares with Lebanese Arabic a considerable number of lexical, semantic, syntactic, and morphological features, but several differences emerge as well (Boudelaa & Marslen-Wilson, 2013).Therefore, it was essential to check in the second phase that the Lebanese speakers agree on these translations and use them frequently.

Second phase: pilot study
Different populations may use more than one word to refer to the same object depending on the particular cultural characteristics.Hence, all the possible Lebanese responses for each item of the BNT were collected even if they differed from the translated ones.
A pilot study was conducted on a sample of volunteers from a Lebanese population to help determine the name agreement for each image from the original BNT and, from that, to elaborate an adapted version that takes into consideration the Lebanese dialect and culture.

Participants of the pilot study
This study was conducted among a sample of Lebanese volunteers of different ages, levels of education, and gender in alignment with previous studies (Patricacou et al., 2007;Miotto et al., 2010;Grima & Franklin, 2016;Soylu & Cangöz, 2018).All participants reside in Lebanon and speak Lebanese Arabic as their native language.
A total of 103 volunteers participated in the study, of which there were 63 women (61.2%) and 40 men (38.8%).The range of age was between 18 and 86 years (mean = 35.54;standard deviation = 1.8).The level of education was divided into four groups: primary level (12.6%) with basic education, supplementary education till grade 9, secondary level (16.5%) with high school education from grade 10 to grade 12, university level (67%) with a bachelor's or master's degree, and doctorate level (3.9%).

Procedure
The pilot study was conducted by asking participants to name each of the 60 pictures of the BNT in Arabic as quickly as possible (with a limit of 20 s) using the word that most spontaneously came to their mind.No semantic or phonemic cues were given in the pilot study as there is no current Lebanese version to base the cueing upon.
Responses for each image of the BNT were entered in Excel to be analyzed.Name agreements, possible alternative synonyms, and lexical variations were determined for each image based on the responses given by the participants (see Table 1).

Development of the Lebanese version of the BNT
In order to analyze and adapt the BNT to Lebanese-Arabic, all responses provided by the participants were subdivided into two categories, and each of these categories was further subdivided into several more subcategories.
The first category contains all the responses that are accepted based on their alignment with the following: -The name agreement (NA): a picture that prompts a consistent name from the majority of participants is considered to have high NA, whereas a picture that elicits multiple different names is considered to have low NA (Boukadi et al., 2016).For example, more than 80% of the participants agreed on naming « ‫»تخت‬ the image of "bed".-Synonyms or alternatives are defined as different correct names given to the same image that share the same meaning.For example, the image "bed" received another valid common name, which is ‫."سرير"‬The latter is the word found in the dictionary when translated in its MSA form.-Answers that were correct according to the image but were named using their foreign origin (Arabicized) can be attributed to the characteristic of bilingualism, which is prevalent in Lebanese society.For example, the word "helicopter" was named "helicopter" by 78% of the participants (80 out of 103).
The second category contains all responses with errors, where the response was nonconcordant or nonspecific to the target name of the image that had been presented.The incorrect responses or the responses with errors were divided into five categories.The five categories were inspired by previous studies, particularly the Swedish, Turkish, and Dutch versions of the test (Mariën et al., 1998;Soylu & Cangöz, 2018;Tallberg, 2005).1-Omission: no response or "I don't know." 2-Visuo-perspective errors or visual misperception: for example, the picture of a "pretzel" was misperceived as a "rope" ‫)حبلة(‬ or "snake" ‫.)حية(‬ 3-Intra-categorial errors or side-ordinated words: for example, giving the answer of "squirrel" ‫)سنجاب(‬ or "rat" ‫)فارا(‬ for the image "Beaver." 4-Superordinated word errors: for example, instead of "harp, " they said, "musical instrument" ‫موسيقية(‬ ‫,)الة‬ or for "sea horse, " they said, "animal" ‫.)حيوان(‬ 5-Periphrases: for example, instead of "muzzle, " participants described the image using periphrases such as "a thing to prevent the dog from biting" « ‫ما‬ ‫حتى‬ ‫للكلب‬ ‫بحطوا‬ ‫شي‬ ‫».يعض‬ The analysis was based on the frequency rate of responses given by the 103 participants to each of the 60 original items of the BNT.This was done by calculating the total score of the correct answers for each image (the target word or Name agreement, synonym answers, and responses named using their foreign origin (Arabicized names)) and the total of answers with errors (visual misperception, intra-category errors, superordinated errors, periphrases responses, wrong part, no responses, or "don't know") (see Table 1).
To perform a careful selection of the items to be included in the Arabic version of the BNT, the selection was based on the analyses of the difficulty index (p).The difficulty index (p) is equal to the sum of correct responses divided by the total number of participants.It indicates how much each image was successfully recognized, in other words, correctly named.It varies from 0 to 1; 1 meaning that the image was named correctly in 100% of cases and 0.5 meaning it was correctly named in 50% of cases.If the image took a value smaller than 0.5, it was eliminated and replaced by another image more accessible to the Lebanese population according to previous studies who used the same criterion for screening (Pedraza et al., 2011;Soylu & Cangöz, 2018) (see Table 1).
The image that had the most visual-perceptual error was the image of "pretzel" which is a familiar food in the USA, unlike Lebanon.The participants saw this image as a snake or a rope due to the shape and lack of exposure to this concept.The image that had the most intra-categorical errors was the image of an "abacus".The participants mistook the object for a toy.In addition, the word "abacus" does not have an equivalent in Arabic, so participants had to describe it in order to name it.Also, the "beaver" had a high rate of intra-categorial error, as the participants mistook this object for a "squirrel" or a "mouse".The naming of this item respects the semantic category of the concept presented but remains far from the correct name.The image that had the most periphrastic responses was "yoke".The participants recognized the object and its mode of use, but the majority of them were unable to find its name.As a result, they had to use periphrases in order to name it.The image that had the most superordinated responses was "wreath".Participants had difficulty identifying the exact word, and instead referred to the image with general terms such as "decoration, " ‫,"زينة"‬ or "a decorative object".Also, for "harp" and "harmonica, " the participants gave these images the generic name of "musical instrument." This is probably due to the fact that these two instruments are known in the West more than in the East where their use is very rare.These two instruments do not have equivalent common names in Arabic.
In total, 18 pictures were eliminated because they did not receive enough name agreement (NA) from the participants, with a very high error rate.In order to replace these words with semantically similar ones with the same level of complexity and frequency, a number of Lebanese words were selected from books, newspapers, and television, a procedure used before (Heaton et al., 2004;MacKay et al., 2005;Storms, Saerens, &De Deyn 2004;Tsang HL & Lee TM, 2003;).Afterwards, the images were replaced by others from the same semantic category ensuring that the word length, number of syllables, and phonemes were as similar as possible to the original item (see Tables 2 and 3).The substitution of images was as follows: pretzel to manouche ‫,"منقوشة"‬ sea horse to an alligator ‫,"تمساح"‬ wreath to chandelier ‫,"تريا"‬ beaver to hedgehog ‫,"قنفذ"‬ harmonica to flute ‫,"مزمار"‬ igloo to grotto" ‫,"مغارة‬ stilts to clown ‫,"مهرج"‬ harp to zither ‫,"القانون"‬ knocker to kebbe pestle ‫الكبة"‬ ‫,"مدقة‬ muzzle to cage ‫,"قفص"‬ dominos to chess" ‫,"شطرنج‬ unicorn to snowman ‫ثلج"‬ ‫,"رجل‬ tripod to binocular ‫,"ناضور"‬ yoke to saddle ‫,"سرج"‬ trellis to windmill ‫,"طاحون"‬ palette to paintbrush ‫,"ريشة"‬ protractor to compass" ‫,"بوصلة‬ and abacus to pulley ‫.»بكرة«‬To gather lexical and semantic norms for the new items, we enlisted 37 participants aged between 20 and 35 years.The cohort comprised 19 men and 18 women, all possessing a minimum of 13 years of education.Following the methodology outlined by Patricacou et al. (2007), these individuals were selected based on their age and educational background, deemed conducive to proficient task performance.For the familiarity and subjective frequency norms, we followed the same procedure of Chedid et al. (2022) who collected normative Lebanese Arabic data for 380 pictures.In the familiarity task, participants were asked to rate the familiarity of the concept depicted by each picture based on how common an object was in the language speaker's realm of experience (Bonin et al., 2003;Boukadi et al., 2016;Chedid et al., 2018).In the subjective frequency task, participants were asked to estimate the frequency of use of each word in the written or spoken language (Desrochers & Thompson, 2009).The results obtained allowed us to reorganize the items of the original BNT, as well as the newly replaced images.Adhering to the American Standard order and psycholinguistic variables, items were organized from the most familiar, highest-frequency object to the least familiar, lowest-frequency counterpart.This method mirrors the procedure implemented for the Portuguese version of the BNT (Patricacou, 2007).Thus, the Lebanese version of the BNT was created (see Table 4) and was used for the norm determination study to investigate the effects of age, gender, and level of education on naming performance.

Participants of this study
The sample of the norm determination study consists of 280 participants, all volunteers, aged between 50 and 88 years (mean age = 68.57;SD = 10.77).They were recruited from different Lebanese villages covering different regions of Lebanon.All participants were informed about the objectives of this study and signed a prior consent form.The sample consists of 138 women (49.3%) and 142 men (50.7%) in total, divided into four age groups (group 1 = 50 to 59 years, group 2 = 60 to 69 years, group 3 = 70 to 79 years, and group 4 = 80 years and over) (Grima & Franklin, 2016;Mariën et al., 1998;Patricacou , 2007;Roberts & Doucet, 2011;Slegers et al., 2018).They were recruited based on three levels of education (primary, secondary, and university), and all participants had to meet the inclusion criteria established by prior studies (Soylu & Cangöz, 2018;Slegers et al., 2018;Grima & Franklin, 2016;Baerecke, 2013;Miotto et al., 2010;Patricacou et al., 2007;Tallberg, 2005;Mariën et al., 1998;Tombaugh & Hubley, 1997Welch et al.,1996): (1) born and residing in Lebanon; (2) speak Lebanese Arabic as their native language; (3) have no particular cognitive impairment, nor any visual, auditory, neurological disorder, or other problem that may disrupt naming abilities; (4) no current antidepressant, antiepileptic, or anxiolytic treatment; (5) no excessive alcohol consumption or history of withdrawal; and (6) no drug consumption.Moreover, three screening tests were applied to determine whether the participants were elderly adults with healthy cognitive functions and fit to participate in the study.The first one was the Language Experience and Proficiency Questionnaire (LEAP-Q) in its standardized Arabic version.It collected data on the knowledge and the degree of mastery of our participants of the Arabic language as well as other languages and allowed us to divide them into groups (Kaushanskaya et al., 2020).Participants who did not report using Arabic for at least 70% of their daily life activities were excluded from our study.The second one is the Montreal Cognitive Assessment (MoCA).This test was to assess mild cognitive dysfunctions: attention, concentration, executive functions, memory, language, visuo-constructive abilities, abstraction skills or conceptual thinking, calculation, and orientation (Nasreddine  , 2005).The maximum score is 30 points.Participants who scored below 21 were excluded.The third test was the Geriatrics Depression Scale 15-item (GDS).This scale was designed to screen older adults for depressive feelings and suicidal intentions (Chaaya et al., 2008).Participants who received scores of 14 and above from GDS were not included in the study.The scores of MOCA (27.16 ± 1.34) and GDS (2.70 ± 1.22) varied between 23 and 30, as well as 0 and 5, respectively.Sociodemographic characteristics along with the mean and standard deviation values for the screening test scores have been described in Table 5.

Administration procedure
All participants were tested individually and were administered all 60 items of the BNT in order of increasing difficulty.Participants were asked to give the names of all the images presented to them in Lebanese Arabic.Each image was presented for twenty seconds.If an answer is not provided after this time, a semantic cue is given to make it easier to find the name.A semantic cue provides information related to the meaning of the target word, offering hints like superordinate categories ("vegetable" for "asparagus"), actions associated with the word ("you ring it" for "bell"), or definitions and sentence completions ("stethoscope" a medical instrument to listen to the heart and internal sounds) (Python et al., 2021).If, however, the answer is still not found, another phonemic cue was provided to facilitate word retrieval.A phonemic cue assists word retrieval by focusing on its sound, offering hints like the first phoneme(s) (it starts with "c" for "cactus"), the first syllable for longer words, or a rhyming word (it rhymes with "nail" for "snail") (Python et al., 2021).This process was suspended if the participant was in distress or refused to continue.The response for each item was recorded via a digital audio recorder and transcribed.The total score of the test represents the number of correct spontaneous responses and all correct responses obtained only after a semantic cue (Olabarrieta-Landa, 2015).All accepted responses of the BNT Lebanese version can be found in the Appendix 1.We organized participants' responses using our instruction grid, which covers all possible types of responses (see Appendix 2).This grid was designed based on methods used in previous studies by Slegers et al. (2018) and Nicholas et al. (1989).This process sorted responses into two scores: one from responses given without semantic or phonemic cues and another from those given with cues.The total score combined correct spontaneous responses and correct responses with semantic cues (Cronbach's alpha = 0.754).This score gauges overall performance on the BNT.

Statistical analysis
Data analysis for the norm determination study was conducted using SPSS software version 26.We used the following information in our study: -The sociodemographic characteristics of the participants -The total score of the BNT (score of spontaneous responses and after a semantic cue) -The four sub-scores (the numbers of the semantic and phonological cues given and the numbers of the correct responses following the semantic and phonological cues), -error scores (no response, visual misperception, intra-categorical errors, superordinate errors, phonological errors, and periphrastic responses) -The means, standard deviations (m ± SD), medians, interquartile deviations (me ± IQR), and minimum and maximum were reported for continuous variables, while frequencies (n) and percentages (%) were reported for categorical variables.Cronbach's alpha assessed the total score.
The normality of continuous variables was assessed using the Shapiro-Wilk test.Since the distribution of scores was skewed, as well as the variance of these scores not being equal between the groups studied, the generalized linear model (GzLM) was used to quantify the relationship between age (50-59; 60-69; 70-79; and 80 and above), education level (primary, secondary, university, and above), and gender (male; female) with the total Lebanese BNT score.Thus, the β coefficient and the 95% confidence interval (95% CI) were reported.The age × gender × education interaction was tested; in addition to the Akaike information criterion (AIC), the Bayesian information criterion (BIC) and the corrected Akaike information criterion (AICC) were used to choose the best model (the smallest value corresponds to a better model) (Neil et al., 1995).The correlation between BNT sub-scores and the correlation between sociodemographic variables with BNT subscores and error scores was tested.The Spearman correlation coefficient was reported when there was a correlation between two continuous variables, the point-biserial correlation coefficient when there was a correlation between a continuous variable and a dichotomous variable, and the Kendall correlation coefficient when a correlation between a continuous variable and an ordinal variable was reported.Finally, a P-value below 0.05 was considered statistically significant (Tallberg, 2005;Tombaugh & Hubley., 1997).

Results of the norm determination study for the Lebanese version of the BNT
The mean (standard deviation), median (interquartile deviation), minimum, and maximum of the total score of the BNT (Lebanese version), as well as its sub-scores and the scores of errors made by the participants, were all described in (Table 6).The results showed that the total score of the BNT (Lebanese version) varied between 31 and 60, with a mean of 51 ± 6.09 and a median of 52 ± 8. The mean number of phonological cues (phonemic cue) (9.16 ± 6.08) provided was higher than the average number of semantic cues (4 ± 5.15).The intra-categorical errors had the highest mean (4.16 ± 3.44) among all error types, while the phonological errors had the lowest mean (0.27 ± 0.60).A good internal consistency was found for the Lebanese version of the BNT as indicated by Cronbach's alpha at 0.754 (Pedraza et al., 2011;Tombaugh & Hubley, 1997).
The generalized linear model (GzLM) was applied to investigate the effect of age, gender, and education on BNT (Lebanese version) performance (Peña-Casanova et al., 2009;Tsapkini et al., 2011).The results show that the total score of the BNT (Lebanese version) decreases with age and increases with education.It has been observed that the BNT score decreases by 2.3 in people aged between 60 and 69 years (p = 0.001), by 7.2 in those aged between 70 and 79 years (p < 0.001), and by 11.4 in those aged over 80 years (p < 0.001) when compared to those aged between 50 and 59 years.
On the other hand, the BNT score increases by 2.12 in people with secondary education (p = 0.001) and by 3.50 in those with a university education (p < 0.001) when compared to those with primary education.
There were no significant differences found between men and women (p = 0.07).It is worth noting that the interaction between age × gender × education was tested and found to be non-significant.Additionally, the model not including interaction (Table 7) has the lowest indices of AIC, BIC, andAICC (1615 vs. 1595;1621vs. 1595;and1706 vs. 1624 respectively) when compared to the model with interaction.
The norms for sub-scores and errors of the BNT (Lebanese version) by age, gender, and education were also analyzed.The results are presented in Tables 8 and 9.
The study found that the number of semantic and phonological cues provided, as well as the number of errors, increase with age and decrease with education.The results showed that there are significant positive correlations between age and   sub-scores, as well as error scores, on the Lebanese version of the BNT.However, a negative correlation was found between education and these scores, as shown in Table 10.
Additionally, it is worth noting that there was a strong linear correlation between the number of semantic cues given and the number of correct responses following the semantic cues (r = 0.834; p < 0.001), as well as between the number of phonological cues and the number of correct responses following the phonological cues (r = 0.962; p < 0.001), as evidenced by the statistically significant correlations (Fig. 1).

Discussion
This research facilitated the development of a Lebanese iteration of the Boston Naming Test (BNT) that aligns comparably with the original American version.Additionally, normative data were collected based on a cohort of healthy Lebanese adults, stratified according to age, gender, and educational level.
The response analysis results guided the selection of an optimal Lebanese translation aligning with the typical naming tendencies of the Lebanese population.A noteworthy distinction lies in the fact that the American version of the BNT suggests a single correct name for each image, while Lebanon exhibits linguistic and cultural diversity, allowing for multiple correct names for the same image due to regional and cultural variations.For instance, the Arabic term for "dart" ("sahem"; ‫)"سهم"‬ can be denoted by diverse names such as "nablat, " "nashabat, " and "sanka" ‫"سنكة"(‬ ‫"نشابة",‬ ‫.)"نبلة",‬To ensure precision, a comprehensive list of accurate responses has been compiled, encompassing the target word and all its synonyms.
An examination of correct responses within our sample of participants revealed a mean score and standard deviation of correct responses (51 ± 6.09), closely aligned with international norms observed in various studies: 52.76 in Worrall et al. (1995); 49.6 in Mariën et al., 1998;53.0 in Neils et al. (1995); 53.04 in Tallberg (2005); and 54.5  (1989).Furthermore, the reliability of the Lebanese Boston Naming Test (BNT) demonstrated acceptability and comparability to the original version.The internal consistency, as measured by Cronbach's alpha, was determined to be 0.75 in our study, closely resembling the reported value of 0.78 in healthy individuals for the original BNT (Tombaugh & Hubley, 1997).
While it is true that the average age of participants differed between the pilot study and the normative study, it is important to note that the primary reason for this variation stems from the distinct objectives of each study.The pilot study aimed to assess initial feasibility and gather preliminary data; hence, participants were recruited with a focus on diversity rather than homogeneity in age (Patricacou et al., 2007;Miotto et al., 2010;Grima & Franklin, 2016;Soylu & Cangöz, 2018).In contrast, the normative study aimed to establish standardized benchmarks for elderly individuals by recruiting participants aged between 50 and 88 years, thereby ensuring representation of the target population.Therefore, the difference in participant age reflects the deliberate divergence in study objectives rather than a flaw in methodology (Li et al., 2022;Mariën et al., 1998;Roberts & Doucet, 2011).The pilot study not only served to devise scoring and administration rules but also provided guidelines tailored to the psycholinguistic and cultural variables inherent in the Lebanese population for the BNT's Lebanese version.Accordingly, we restructured the item list maintaining conformity with the original version's order.Consequently, certain items underwent repositioning in the Lebanese version compared to the American version.Notably, "tongs" was elevated from its original position at number 55 to number 10 due to its heightened frequency in Lebanon, as affirmed by our pilot study.Conversely, "octopus" was relocated from number 23 in the American version to number 47 in our version.The reordering also allowed us to demonstrate that the difficulty of the new items was similar to those they replaced.
The derived Lebanese version of the Boston Naming Test (BNT) enabled an examination of the impact of age, gender, and educational attainment on the cognitive performance of healthy adult participants within our cohort.Our investigation revealed significant influences of age and educational level on accurate response rates, aligning with existing literature (Neils et al., 1995;Tombaugh & Hubley, 1997;Welch et al., 1996).Specifically, older adults exhibited diminished naming performance compared to their younger counterparts.Higher error rates among older individuals correlated with an increased reliance on semantic or phonemic cues to compensate for cognitive deficits.Conversely, participants with elevated education levels demonstrated superior BNT scores, reduced dependence on cues, and fewer errors than those with lower educational levels.Notably, adults aged over 70 with more than 12 years of education outperformed peers with primary education, indicating a clear association between age, education, and declining performance.However, variations in performance decline were evident among those aged over 70, with individuals possessing higher education levels exhibiting better naming proficiency.Importantly, all healthy elderly participants aged over 70, regardless of education, retained lexical-semantic knowledge and naming ability with aid cues.Retrieval following phonemic cues yielded notably high correct response rates.Participants with higher education levels exhibited shorter response times after receiving aid cues compared to those with lower education levels.
In examining the impact of gender on naming performance, our study found no statistically significant difference between men and women, aligning with findings from previous studies (LaBarge et al., 1986;Ross et al., 1995).However, our results contradict studies suggesting significant gender effects on naming (Mariën et al., 1998;Welch et al., 1996).A possible explanation for this inconsistency is proposed by certain studies, positing that men and women have divergent interests, particularly concerning images exclusively associated with men.Such images represent specific points of interest for men but are less engaging for women, given that they fall outside their spheres of occupation or interest.The lack of statistical significance in the gender effect within the Lebanese version may be attributed to nearly equal familiarity with the replaced images among men and women in our study.
These findings underscore the significance of conducting normative studies tailored to specific cultural and linguistic contexts, employing extensive samples that encompass individuals of diverse ages and educational backgrounds.The present outcomes imply that the adapted BNT may exhibit enhanced suitability for clinical utilization within the Lebanese Arabic-speaking population.However, the clinical validity of this adapted version awaits confirmation through future investigations across diverse patient cohorts.
This study aimed to standardize the Boston Naming Test (BNT), a widely utilized neuropsychological assessment for evaluating linguistic capabilities in naming and word retrieval, especially applicable to various clinical pathologies such as communication disorders, aphasia, and language impairments resulting from conditions like stroke,

Fig. 1
Fig. 1 Correlation matrix between the given cues and the generated correct responses.1 semantic cue given, 2 number of the correct responses following the semantic cues, 3 phonological cue given, 4 numbers of the correct responses following the phonological cues

Table 1
Analysis of the frequency rate of responses to the 60 items of the original BNT

Table 1 (continued) Correct responses Incorrect responses Items No NA Syn NAR Omi WP VM ICE SOE P Total correct response Total errors Difficulty index (p)
NO number of responses, NA name agreement or target word, Syn synonyms, NAR non-Arabic responses, Omi omission, WP wrong part, VM visual misperception, ICE intra-categorial errors, SOE superordinated errors, P periphrases, total correct responses, total errors, p difficulty index ** next to the items designates the words that have been substituted a Target word is Arabized

Table 2
Normative data for psycholinguistic variables in Lebanese Arabic for the 60 items of the original BNT

Table 2
(continued) LA intended name: Lebanese Arabic intended name from dictionary translation; LA model name: the Lebanese Arabic model is the actual name given by the majority of participants to the item presented in Lebanese Arabic; Name agreement (%) refers to the consistency with which different people agree on the name of an object depicted in an image indicating how often a specific name is chosen for the image compared to all responses et al.

Table 3
Normative data for psycholinguistic variables in Lebanese Arabic for the newly replaced items LA intended name: Lebanese Arabic intended name from dictionary translation; LA model name: the Lebanese Arabic name given by most participants in Lebanese Arabic

Table 4
The Lebanese version of the BNT with the sum of frequency and familiarity for each item

Table 5
Description of sociodemographic characteristics along with mean and standard deviations of screening test scores of the participants (N = 280) MOCA Montreal Cognitive Assessment, GDS Geriatrics Depression Scale

Table 6
Description of the BNT (Lebanese version) scores/sub scores and errors (N = 280)

Table 7
The results of the generalized linear model (GzLM) for the total score of the BNT (Lebanese version)β the estimated effect size of age, gender, and education on the BNT performance, IC the range of values

Table 8
Norms for the sub-scores of the BNT (Lebanese version) by age, gender, and education (N

Table 9
Norms for the error scores of the BNT (Lebanese version) by age, gender, and education (N

Table 10
Correlation between sub-scores and errors of the BNT (Lebanese version) with age, gender, and education inNicholas et al.