The development and assessment of temperament tests for adult companion dogs

doi:10.1016/j.jveb.2006.09.002

Journal of Veterinary Behavior

Volume 1, Issue 3, November–December 2006, Pages 94-108

https://doi.org/10.1016/j.jveb.2006.09.002 Get rights and content

Abstract

Temperament tests have been created by a range of organizations and individuals in order to assess useful, predictable behavioral tendencies in working dogs and, increasingly, in companion dogs. For the latter group, such tests may help to select suitable pets from rescue centers or to identify those already in the population that are, or are likely to be, unsuitable as pets (e.g., those with behavior problems involving aggression). Unfortunately, many of these tests seem to have been developed without a systematic scientific approach. Perhaps as a result there are few reports of these tests in the scientific literature and even fewer that fully report their reliability and specific aspects of validity. This pattern is unfortunate, because the outcome of tests for companion dogs may have the potential to affect their welfare and survival. This paper attempts to encourage a more scientific approach to the development, conduct, and evaluation of temperament tests for adult companion dogs. Five key measures of the quality of a temperament test (purpose, standardization, reliability, validity, and practicality) are identified and explained in detail. Methods for the assessment of these qualities are given together with discussion of their limitations.

Introduction

The ability to select a dog for a particular role, particularly from a very young age, is an attractive idea for breeders and trainers. What might make this a feasible endeavor is the idea that individuals possess stable behavioral tendencies, i.e., they have what has been called “temperament.” Temperament is defined as differences in behavior between individuals that are relatively consistently displayed when tested under similar situations (Diederich and Giffroy, 2006). Using this definition, these differences are considered to be the product of both genetically determined and acquired behavioral traits (Stur, 1987), and therefore the age at which they can be considered to be stable is still debatable. Terms such as “personality” (Gosling and John 1999, Svartberg 2005), “character” (Ruefenacht et al., 2002) and “emotional predispositions” (Sheppard and Mills, 2002) have also been used in the same context. Temperament is made up of traits that are “correlations of internal factors that cause consistent individual differences in behavior” (Eysenck, 1994). In an attempt to identify these traits, interested parties have developed behavioral tests that take multiple measures of the dog’s behavior during a series of shorter tests, or subtests (Ledger, 1997). Often these measures are subjected to factor or principal component analysis, which are data reduction techniques that statistically identify consistently correlated measures within a data set and place them into factors (Goodloe and Borchelt, 1998). The composition of these factors can be used to describe the various behavioral traits exposed by the test and to predict the dog’s behavior in another, similar situation.

Tests to identify particular characteristics of interest, such as “sharpness” and “courage” have been a common feature of working dog associations and breed groups (Willis 1995, Wilsson and Sundgren 1997, Brenoe et al 2002, Ruefenacht et al 2002, Svartberg and Forkman 2002, Courreau and Langlois 2005, Fuchs et al 2005). These might include assessment of the dog’s hunting, tracking, or aggressive ability. Tests have also been developed for the assessment of the suitability of dogs as police (Slabbert and Odendaal, 1999), guide (Pfaffenberger et al 1976, Goddard and Beilharz 1983, Goddard and Beilharz 1984, Goddard and Beilharz 1985, Knol et al 1988, Murphy 1995), therapy (Fredrickson 1993, Schaffer and Phillips 1994), or assistance dogs (Weiss and Greenberg 1997, Weiss 2002, Lucidi et al 2005). Over the past 15 years, interest has increased in the development of tests to specifically determine the suitability of dogs as pets. Many of these tests have focused on the assessment of problem behaviors, particularly those involving aggression, which may be associated with an increasing trend toward legislation to ban supposedly dangerous breeds. The possibility of assessing both undesirable or negative and desirable or positive behavioral traits has also been of particular interest to rescue and re-homing groups (Sternberg, 2002). It is hoped that behavioral assessments conducted in the shelter may then help staff match dogs to potential owners (Ledger, 1997) and/or predict behavior that might be problematic in the new home. The results of such assessments have the potential to directly affect the welfare of the dog, because problem behaviors can result in punishment (Hsu and Serpell, 2003), euthanasia, or (repeated) relinquishment to shelters (Arkow 1994, Miller et al 1996, Salman et al 1998). Similarly, the welfare implications of an inaccurate assessment of potential aggressiveness can be disastrous for humans who encounter the dog.

For these reasons, if not for reasons of academic integrity, it is important that published temperament tests be accompanied by appropriate statistical evidence to support their specific claims, something highlighted by Goodloe (1996). Martin and Bateson (1993) have identified 3 specific measures (reliability, validity, and feasibility) that determine the quality of a behavior test. These measures determine whether a test is a good measure, the right measure, and a useful measure (Appendix A).

Reliability concerns the degree to which the test scores are free from errors of measurement (APA, 1985). To determine reliability, one must identify the consistency of the results across subtests, tests, observers, assessment centers, etc. Measures of reliability include consistency within the observer of the test (intraobserver), between observers (interobserver), within the dog (test-retest), and within components of measures designed to assess the same behavior (internal consistency). Evidence of the consistency, and hence the predictability, of the dog’s behavior is what differentiates a temperament assessment from a behavioral one, although this fact is not always explicitly stated (Hsu and Serpell, 2003). Demonstration of test-retest reliability is therefore key for a temperament test (Marston and Bennett, 2003). Additionally, if tests are not reliable, they will not be valid (Diederich and Giffroy, 2006).

Validity concerns the appropriateness, meaningfulness, and usefulness of the specific inferences made from the test results (APA, 1985). Temperament tests need to ensure that they are actually assessing the trait(s) of interest (e.g., fearfulness) if they are to be valid. Validity assessments for temperament tests are fraught with difficulty, because it is unlikely that any test will be wholly predictive of a dog’s behavioral reaction in any given circumstance. The aim of a temperament test is therefore to improve our knowledge of the dog and its likely future behavior above that of chance alone. The probability of this goal being achieved increases with limited context.

Finally, the quality of temperament tests must also address issues of practicality and appropriateness for widespread or commercial use, whether this use is in rescue shelters or in breeding and training establishments. Tests that are impractical, overly long, and difficult to assess are unlikely to be performed accurately or reliably, if at all. Accordingly, a scientifically developed test will often require refinement for practical use.

For test developers, two additional considerations need to be made in order to ensure that a test is reliable, valid, and feasible: consideration of the purpose of the test and standardization of the test procedure. If the goals of the temperament test are not clearly identified (i.e., the aspects of temperament that the testers wish to identify are not explicitly stated), then it is unlikely that the test will be valid. The next step in the development process is the selection of appropriate tests and corresponding scores for the dog’s behavior. If this stage is not standardized and formalized, it is unlikely that the test will be reliable. It is important that these two additional prior requirements be fulfilled before the test developers can proceed to assessment of reliability and validity.

Jones and Gosling (2005) and Diederich and Giffroy (2006) have both recently reviewed temperament assessments in dogs. Jones and Gosling (2005) broadly considered the issues of reliability and validity for all forms of temperament assessment, including those derived from individual-based and general questionnaires, but left open the question of the quality of specific temperament tests used in practice. Nonetheless, they found evidence that the issue of reliability, in particular, had been poorly addressed, and evidence for validity was low for tests conducted on young dogs. Diederich and Giffroy (2006) specifically highlighted the lack of standardization of temperament tests for a range of dog roles.

The initial aim for our paper was to review in detail the extent to which temperament tests specifically for adult pet dogs had demonstrated reliability, validity, and feasibility. Our search involved a Pub Med and Science Direct search using the terms “dog” or “canine,” “temperament” or “behavior(u)r,” and “test.” Only six papers relating to primary research were revealed in this search of the peer-reviewed literature. Van der Borg et al. (1991) and Hennessy et al. (2001) described tests to predict a range of problem behaviors in rescue dogs. De Palma et al. (2005) described a test to assess general temperament and re-homing suitability of rescue dogs. Netto and Planta 1997, Van den Berg et al 2003, and Kroll et al. (2004) described tests specifically to assess aggression in pet dogs. A number of other tests have been reported in conference proceedings (particularly those of the International Veterinary Behavior Meetings and the Companion Animal Behaviour Therapy Study Group) but have not been reported formally in the literature. This number includes tests for specific problem behaviors (McPherson and Bradshaw 1998, Notari et al 2005) and general temperament in rescue dogs (Heidenberger 1993, Ledger and Baxter 1997, Marder et al 2003, Mondelli et al 2003). The lack of publication is disappointing, because it is well known that many shelter organizations have also devised their own temperament tests (Sternberg, 2002). Given the lack of relevant, data-based scientific publications and the problems identified by other reviewers of this procedure, it is appropriate to review and reiterate the process of valid test development in order to provide a benchmark for future test developers. This paper reviews the range of evaluations required before claims can be made about either reliability or validity with the intent to guide future research and test developers. This process is broken down into identification of the purpose and content of the test, standardization, assessment of reliability and validity, and refinement for practical use, or feasibility (See Appendix A for definitions and Appendix B for key points for each of these.).

Section snippets

Purpose and content of the temperament test

The first step in creating a valid and useful temperament test is careful consideration of its purpose (Appendix B). Test developers need to first consider why they want a temperament test. What behaviors or traits should the test reveal (e.g., fearlessness), and what behaviors and traits should it avoid revealing (e.g., aggressiveness, stress-related responses)? Determination of the purpose of a test is key to determining the method to be used to reveal the properties under investigation

Standardization

For a test to have any chance of being reliable and valid, standardization of the test procedure is a minimum requirement. Standardization relates to the extent to which a protocol for carrying out the test is provided and consideration for minimization of variability between tests has been made. In standardization, all potential sources of variability need to be identified and controlled for so that the only variable is the dog’s response (Diederich and Giffroy, 2006). Considerations for

Intra-observer reliability

Intra-observer reliability measures the consistency of the reports of a single observer (Martin and Bateson, 1993). In theory, the observer’s assessments should report similarly when the same dog is tested using the same test on another occasion. However, in order to control for behavioral changes on the part of the dog, rather than by the observer, it is recommended that intra-observer reliability be assessed by the use of video recordings (Martin and Bateson, 1993). In this way, the observer

Content validity

Content validity evaluates whether the test measures what it should and whether the components of the measure cover all aspects of the behavior in question. Face validity is one aspect of content validity and refers to the subjective assessment of whether the item appears to be measuring the variable it claims to “on the face of it” (Eiser and Morse, 2001). For example, van den Berg et al. (2003) performed a principal components analysis on all the behaviors shown by the dogs in their

Feasibility for practical use

The ultimate aim of many temperament tests for pet dogs is that interested groups can perform the test themselves and make use of the results (Ledger and Baxter, 1997). Accordingly, the test needs to be standardized and short, easy to perform, and amenable to easily recording the dog’s response. Many of the tests reviewed here may be prohibitively long for practical use in a working environment like a shelter (Hsu and Serpell, 2003), taking one hour per dog (Planta et al 1991, Ledger and Baxter

Conclusion

Fewer than ten reports of temperament tests specifically for the selection of suitable adult dogs as pets could be found in the peer-reviewed scientific literature. Even among these, the reports of reliability, validity, and feasibility are not complete, with authors typically reporting on one, but not all, aspects. The absence of reports of the methodology, reliability, and validity of temperament tests for dogs in general has been noted by a number of authors (Hsu and Serpell 2003, Marston

Acknowledgments

This paper formed part of a wider review of approaches to the assessment of temperament, welfare, and quality of life in kenneled dogs commissioned by Dogs Trust, U.K., and we are indebted to this organization for its support of this work. The first author was supported by this charity to undertake these reviews. We would also like to thank members of the Dogs Trust “quality of life working party” for their support and comments: Jon Bowen, John Bradshaw, Keith Butt, Rachel Casey, Philip

References (83)

B. Beerda et al.
Chronic stress in dogs subjected to social and spatial restriction In: Behavioural responses
Physiol. Behav.
(1999)
U.T. Brenoe et al.
Estimates of genetic parameters for hunting performance traits in three breeds of gun hunting dogs in Norway
Appl. Anim. Behav. Sci.
(2002)
J.-F. Courreau et al.
Genetic parameters and environmental effects which characterise the defence ability of the Belgian shepherd dog
Appl. Anim. Behav. Sci.
(2005)
C. Diederich et al.
Behavioural testing in dogs: A review of methodology in search for standardization
Appl. Anim. Behav. Sci.
(2006)
J. Feaver et al.
A method for rating the individual distinctiveness of domestic cats
Anim. Behav.
(1986)
M.A. Fredrickson
Temperament testing procedures for animals involved in nursing home, school and hospital visiting programs through Delta Society Pet Partners
Appl. Anim. Behav. Sci.
(1993)
M.E. Goddard et al.
Genetics of traits which determine the suitability of dogs as guide dogs for the blind
Appl. Anim. Ethol.
(1983)
M.E. Goddard et al.
The relationship of fearfulness, sex, age and experience on exploration and activity in dogs
Appl. Anim. Behav. Sci.
(1984)
M.E. Goddard et al.
Individual variation in agonistic behaviour in dogs
Anim. Behav.
(1985)
M.B. Hennessy et al.
Plasma cortisol levels of dogs at a county animal shelter
Physiol. Behav.
(1997)

M.B. Hennessy et al.

Behaviour and cortisol levels of dogs in a public shelter, and an exploration of the ability of these measures to predict problem behaviour after adoption

Appl. Anim. Behav. Sci.

(2001)

M.B. Hennessy et al.

Influence of male and female petters on plasma cortisol and behaviour: can human interaction reduce the stress of dogs in a public animal shelter?

Appl. Anim. Behav. Sci.

(1998)

A.C. Jones et al.

Temperament and personality in dogs (Canis familiaris): a review and evaluation of past research

Appl. Anim. Behav. Sci.

(2005)

T. King et al.

Fear of novel and startling stimuli in domestic dogs

Appl. Anim. Behav. Sci.

(2003)

R.K. Lore et al.

Avoidance reactions of domestic dogs to unfamiliar male and female humans in a kennel setting

Appl. Anim. Behav. Sci.

(1986)

P. Lucidi et al.

Ethotest: A new model to identify (shelter) dogs’ skills as service animals or adoptable pets

Appl. Anim. Behav. Sci.

(2005)

J.D. Lund et al.

Behaviour patterns and time course of activity in dogs with separation problems

Appl. Anim. Behav. Sci.

(1999)

L.C. Marston et al.

Re-forging the bond-towards successful canine adoption

Appl. Anim. Behav. Sci.

(2003)

J.L. Millot

Olfactory and visual cues in the interaction systems between dogs and children

Behav. Proc.

(1994)

J.A. Murphy

Describing categories of temperament in potential guide dogs for the blind

Appl. Anim. Behav. Sci.

(1998)

W. Netto et al.

Behavioural testing for aggression in the domestic dog

Appl. Anim. Behav. Sci.

(1997)

K.L. Overall

Proceedings of the Dogs Trust Meeting on Advances in Veterinary Behavioural Medicine London, 4th–7th November 2004: Veterinary behavioural medicine: a roadmap for the 21st century

Vet. J.

(2005)

N.J. Rooney et al.

A comparison of dog-dog and dog-human play behaviour

Appl. Anim. Behav. Sci.

(2000)

S. Ruefenacht et al.

A behaviour test on German Shepherd dogs: heritability of seven different traits

Appl. Anim. Behav. Sci.

(2002)

C.B. Schaffer et al.

The Tuskagee behaviour test for selecting therapy dogs

Appl. Anim. Behav. Sci.

(1994)

J.A. Serpell et al.

Development and validation of a novel method for evaluating behaviour and temperament in guide dogs

Appl. Anim. Behav. Sci.

(2001)

J.M. Slabbert et al.

Early prediction of adult police dog efficiency—a longitudinal study

Appl. Anim. Behav. Sci.

(1999)

K. Svartberg

Shyness-boldness predicts performance in working dogs

Appl. Anim. Behav. Sci.

(2002)

K. Svartberg

A comparison of behaviour in test and in everyday life: evidence of three consistent boldness-related personality traits in dogs

Appl. Anim. Behav. Sci.

(2005)

K. Svartberg et al.

Personality traits in the domestic dog (Canis familiaris)

Appl. Anim. Behav. Sci.

(2002)

K. Svartberg et al.

Consistency of personality traits in dogs

Anim. Behav.

(2005)

J.A.M. van der Borg et al.

Behavioural testing dogs in animal shelters to predict problem behaviour

Appl. Anim. Behav. Sci.

(1991)

J. Vas et al.

A friend or an enemy?Dogs’ reaction to an unfamiliar person showing behavioural cues of threat and friendliness at different times

Appl. Anim. Behav. Sci.

(2005)

E. Weiss et al.

Service dog selection tests: effectiveness for dogs from animal shelters

Appl. Anim. Behav. Sci.

(1997)

D.L. Wells et al.

Male and female dogs respond differently to men and women

Appl. Anim. Behav. Sci.

(1999)

E. Wilsson et al.

The use of a behaviour test for the selection of dogs for service and breeding I: Method of testing and evaluating test results in the adult dog, demands on different kinds of service dogs, sex and breed differences

Appl. Anim. Behav. Sci.

(1997)

E. Wilsson et al.

Behaviour test for eight-week old puppies – heritabilities of tested behaviour traits and its correspondence to later behaviour

Appl. Anim. Behav. Sci.

(1998)

P.S. Arkow

A new look at pet “over-population”

Anthrozöos.

(1994)

A. Bowling

Measuring Health: a review of quality of life measurements scales, 2nd ed

(1997)

L.A. Clark et al.

Constructing validity: Basic issues in objective scale development

Psychol. Assess.

(1995)

Cited by (130)

Mind your language! Lessons from the application of an English published version of a Japanese horse personality instrument to a French population
2024, Applied Animal Behaviour Science
Replicability is a fundamental tenet of the scientific method and scientific reporting, but there is a preponderance to publish scientific research in English to increase international recognition, regardless of the country of origin of the research. Questionnaires are widely used to assess personality in animals. These psychometric instruments are mainly published in English but can be used all over the world in other languages. However, without safeguards relating to the translation process, the replicability of the quality of the instrument may change from its originally reported value. This study focuses on the particular issue of cross-cultural reliability of psychometric instruments used for assessing animals that have been translated from their original context. We examined the replicability of the structure of a personality scale originally used in Japanese (but reported in the English literature) on an English population (n=100), and then the reliability of the structure of a French translation of the English version with additional translational safeguards (e.g. back-translation and sense checking) on a French population (n = 159 horses). Horses were rated by 3 evaluators to also allow calculation of inter-rater reliability. We found that there was greater reliability and similarity of structure between the adapted English translation and French version of the Japanese scale, than with the originally published structure of the instrument used in Japan. These results highlight the importance of never assuming the reliability and thus validity of semantic instruments used to assess animal behaviour which have been published in a different language to that in which they were originally developed.
Prediction of working outcomes in trainee dogs using the novel Assistance Dog Test Battery (ADTB)
2024, Applied Animal Behaviour Science
Canine behaviour is commonly assessed using test batteries comprising a test protocol and ethogram scoring system. These are particularly valuable for assistance dog organisations as a tool for evaluating trainee dogs’ proficiency in fundamental skills. The goal of this study was to design and validate a new test battery to assess the suitability of trainee dogs for assistance work at different stages of the training programme. The main objective was to develop a machine-learning tool capable of predicting working outcomes. Accordingly, the novel Assistance Dog Test Battery (ADTB) was developed. Trainee assistance dogs participating in this research performed the test at 3 weeks and 10 weeks after starting formal training. The results from the univariate logistic regression analysis were used to select the variables for the reduced feature sets that were used for modelling. The machine learning models were built using the data collected at 3 and 10 weeks separately and predicted working outcomes with an area under the ROC curve of 0.74 and 0.84, respectively. This research demonstrated the relationship between the novel ADTB ethogram measures and working outcomes in assistance dogs. The machine learning model created using the data collected at 3 weeks achieved comparable performance to the state-of-the-art, while the model built using the data collected at 10 weeks substantially outperformed it. These preliminary results suggest that the ADTB is a reliable tool for the prediction of working outcomes in trainee assistance dogs. Hence, assistance dog organisations can reduce the cost of training by using model predictions as a guide for deciding which dogs to withdraw from training. The data collected and the code developed in this research are publicly available on Mendeley Data (https://doi.org/10.17632/5mzfpt455r.1) and GitHub, respectively (https://github.com/mmarcato/dog_ethogram/).
Observational behaviors and emotions to assess welfare of dogs: A systematic review
2024, Journal of Veterinary Behavior
Observing dogs’ behavior to assess their welfare is relevant in various applied settings, such as veterinary clinics and animal-assisted interventions. Yet, no field-wide consensus or complete overview of observable behaviors to assess dogs’ welfare seems to exist. In this review, we carefully analyze and categorize observational measures of a) dog welfare and b) their emotional state as described in the literature. Adhering to the Preferred Reporting Items for Systematic Reviews and Meta-Analysis guidelines, we searched two major electronic databases (PubMed, ScienceDirect) between October and December 2021 and included peer-reviewed articles—published in the last 10 years—about observable indicators of the welfare and/or emotional state of dogs. We included 39 studies in total. Based on these studies, nine overarching themes of behavioral indicators could be formulated, of which vocalizations, stress-related behaviors, and interaction with the nonsocial environment were mostly mentioned in the literature. Most articles described observable indicators that were both positively and negatively framed. Only five articles mentioned some form of validity assessment, while 23 studies mentioned inter-rater reliability measures. We conclude that having more validated observation instruments would be valuable for both research and practice. Although a clear and simple way of observing dog welfare without complicated tools is of great importance, the field would also benefit from instruments using combinations of physiological parameters and observable behaviors to assess dogs’ welfare.
Involving caregivers in behavioural research: A SWOT analysis of two citizen science research methodologies to study cat-cat interactions at home
2024, Applied Animal Behaviour Science
Citizen science, which involves engaging the general public in research tasks, is increasingly used in animal behaviour studies. In this review we conducted a SWOT analysis (Strengths, Weaknesses, Opportunities, Threats) to evaluate two methodologies of data collection using citizen science in order to study cat-cat interactions: online survey responses and caregiver-recorded home videos analysed by researcher(s). Using the SWOT-analysis on both methodologies, we listed intrinsic aspects that facilitate (Strengths) or interfere with (Weaknesses) reaching scientific goals, as well as the features that the methodology may be able to capitalise on (Opportunities) or which limit its value (Threats). A major strength of online surveys is the possibility to access caregivers’ specific knowledge of their cats, while sampling bias often is a potential weakness. Opportunities of surveys are the methodology´s flexibility and data collection efficiency, but at the same time suffering from threats related to biases associated with caregiver interpretation of their pet’s behaviour. Strengths of caregiver-recorded videos capturing cats’ behaviour include that they allow expert behavioural observations and scoring in a systematic manner (e.g. using an ethogram) and thus yielding quantitative data (whose reliability can be tested between and within observers). Furthermore, given the ubiquity of smartphones, filming cats is not a burden for most caregivers, and the collected recordings can potentially contain high-quality data that may otherwise be inaccessible, or subject to bias if a researcher had been present in the home environment. Though, caregivers’ influence on and lack of standardisation of the recordings are weaknesses which possibly influence the quality of the collected data. Opportunities include public engagement with science, while possible Threats may be related to privacy of the caregivers participating. In this review we consider in more detail each of the four SWOT components related to each methodology in order to optimise cat behaviour research in the future. The authors suggest strategies for future studies using the research methodologies discussed in this review and give specific recommendations when using caregiver-recorded videos in behavioural studies. Additionally, smart combinations of both online surveys with home videos recorded by caregivers might overcome some limitations of the individual methodologies, and would thus be a potentially stronger approach.
What caregivers don't tell you. A comparison between survey responses and home videos of cat-cat interactions
2023, Applied Animal Behaviour Science
Domestic cats are increasingly popular as companion animals, but behavioural problems are often reported, especially in multicat households. Social tension is a common stressor, so understanding intercat interactions and their dynamics is crucial. Nevertheless, direct research in the home setting is rare. As caregivers witness their cats’ behaviour on a daily basis, they are a potentially important source of information, but might be unreliable and subject to bias. This study examined the reliability of caregiver reporting by comparing survey answers with behaviours observed in home videos collected after the survey was completed. The occurrence of five cat-cat interactions (head rubbing, allogrooming, sleeping in physical contact with each other, tail up greetings and social play) was examined in 42 two-cat households using 210 survey answers and 775 videos. The percentage of false negative survey responses for behaviours observed in the videos was conservatively estimated (cFN) at 8%, with 22.9% of the negative answers being falsely negative (FOR – False Omission Rate) and 77.1% truly negative (NPV – Negative Predictive Value). Broad false negatives (bFN), which included uncertain responses as negative reports, were 9.5% of the survey responses with a FOR of 75% and NPV of 25% in this context. Highest values were obtained for head rubbing (cFN: 10.5%, bFN: 14.3%) and allogrooming (cFN: 9.8%; bFN: 11.9%). When focusing on individual cat caregivers, 14 out of 42 caregivers (33.3%) failed to reliably report the occurrence of at least one of the surveyed cat-cat interactions. For interactions that were seen on camera, 23.8% of caregivers (10/42) responded that their cats did not show these interactions and 9.5% (4/42) reported uncertainty about whether it ever occurred. These results should be considered a lower estimate of the magnitude of errors (false negatives) in caregiver reports, and their implications need to be considered in both research that depends on caregiver report, and clinical assessments within behavioural medicine. Many cat-cat interactions, and in particular head rubbing and allogrooming, will be underreported when relying exclusively on caregiver reporting.
Assistance dog selection and performance assessment methods using behavioural and physiological tools and devices
2022, Applied Animal Behaviour Science
This article provides a comprehensive overview of methods for evaluating the suitability of trainee dogs for assistance and guide work. It presents both current practices in industry as well as modern techniques with the aim of identifying important behavioural traits. It is divided into (1) selection and training methods, including breed, genetics, and training programme considerations; (2) behaviour assessment methods such as traditional test batteries, individual ratings and observational tests plus emerging techniques such as canine activity monitoring; (3) physiological assessment methods including cardiac, respiratory and hormonal biomarkers. Assistance dog organisations around the world share a similar overall structure of their training programmes and behavioural assessment methods, however the implementation details vary as no standardised technique is widely employed. Physiological indicators have demonstrated great potential to estimate affective states and personality characteristics such as emotional regulation and coping style. Further investigation is encouraged to validate and define the use of physiological measures to complement behavioural scores in evaluating the suitability of prospective dogs for assistance work. A number of commercially available off-the-shelf (COTS) devices are discussed in the terms of their suitability and reliability for monitoring canine activities and cardio-respiratory parameters. This interdisciplinary collaboration is key to further understanding the connection between behaviour and physiology, allowing a more complete evaluation of an individual’s capability which will ultimately enable a highly accurate prediction of their training outcome. We recommend that assistance dog organisations and researchers work together to design new assessment protocols considering validated practices and promising techniques from state-of-the-art literature.

View all citing articles on Scopus

View full text

ReviewThe development and assessment of temperament tests for adult companion dogs

Abstract

Introduction

Section snippets

Purpose and content of the temperament test

Standardization

Intra-observer reliability

Content validity

Feasibility for practical use

Conclusion

Acknowledgments

Physiol. Behav.

Appl. Anim. Behav. Sci.

Appl. Anim. Behav. Sci.

Appl. Anim. Behav. Sci.

Anim. Behav.

Appl. Anim. Behav. Sci.

Appl. Anim. Ethol.

Appl. Anim. Behav. Sci.

Anim. Behav.

Physiol. Behav.

Appl. Anim. Behav. Sci.

Appl. Anim. Behav. Sci.

Appl. Anim. Behav. Sci.

Appl. Anim. Behav. Sci.

Appl. Anim. Behav. Sci.

Appl. Anim. Behav. Sci.

Appl. Anim. Behav. Sci.

Appl. Anim. Behav. Sci.

Behav. Proc.

Appl. Anim. Behav. Sci.

Appl. Anim. Behav. Sci.

Vet. J.

Appl. Anim. Behav. Sci.

Appl. Anim. Behav. Sci.

Appl. Anim. Behav. Sci.

Appl. Anim. Behav. Sci.

Appl. Anim. Behav. Sci.

Appl. Anim. Behav. Sci.

Appl. Anim. Behav. Sci.

Appl. Anim. Behav. Sci.

Anim. Behav.

Appl. Anim. Behav. Sci.

Appl. Anim. Behav. Sci.

Appl. Anim. Behav. Sci.

Appl. Anim. Behav. Sci.

Appl. Anim. Behav. Sci.

Appl. Anim. Behav. Sci.

A new look at pet “over-population”

Anthrozöos.

Measuring Health: a review of quality of life measurements scales, 2nd ed

Constructing validity: Basic issues in objective scale development

Psychol. Assess.

Review
The development and assessment of temperament tests for adult companion dogs