Elsevier

Forensic Chemistry

Volume 3, March 2017, Pages 41-51
Forensic Chemistry

Full length articles
Investigative probabilistic inferences of smokeless powder manufacturers utilizing a Bayesian network

https://doi.org/10.1016/j.forc.2016.12.001Get rights and content

Highlights

  • Bayesian network inference of smokeless powder manufacturers.

  • Bayesian network based on physical and chemical properties.

  • Manufacturer inferred based on largest posterior probability.

  • Network instantiation with full and partial information investigated.

  • Overall cross validated manufacturer correct inference specificity was 0.74.

Abstract

Chemical and physical characteristics for 169 smokeless reloading powders were utilized in the development of a Bayesian network for inference of the powder manufacturer. The chemical characteristics of the smokeless powders were encoded using the most intense ions in their total ion spectra from gas chromatography-electron ionization-mass spectrometry (GC-EI-MS), which were previously determined from agglomerative hierarchical cluster analysis. Physical characteristics included as network nodes were the average kernel diameter and length, shape, color, luster, absence/presence of a bias cut and absence/presence of a perforation, which are commonly considered in casework. A Bayesian network was compiled using R code, written in-house. Performance of the network was validated by 100 iterations of stratified cross validation, withholding 10% of the data for testing and using the remaining 90% of the data to develop probability tables for the network. Posterior probabilities of the powder manufacturers were calculated for each test sample, and manufacturer inferences were made based on the highest posterior probability. The sensitivity and specificity of the fully instantiated network was examined for each manufacturer. Other performance metrics, including the positive and negative predictive values (PPV and NPV), which take into account the prevalence of each manufacturer, were also examined. The PPV ranged from 0.59 to 0.81 for individual manufacturers when all nodes of the network were instantiated. The NPV for fully instantiated networks ranged from 0.82 to 0.99 for individual manufacturers.

Introduction

Smokeless powders are low explosive propellants which undergo decomposition by deflagration [1], [2], and are categorized as single base, double base or triple base, as defined by their primary energetic materials [3]. Smokeless powders also contain additional organic compounds that function as plasticizers, deterrents, stabilizers, flash suppressants, opacifiers, and dyes [3], [4], [5]. The powders have a range of physical properties, including kernel shape, presence or absence of a perforation, color, kernel size, and in some cases colored markers are added to the powder. The manufacturer’s aim is to optimize the ballistic performance of the product [5], [6], [7], [8], [9].

Single and double base powders are available for civilian purchase in the United States from sporting goods and internet retailers, and are typically used to manually reload ammunition. Commercial availability of smokeless powders contributes to their use in improvised explosive devices (IEDs) such as pipe bombs [10]. In the forensic sphere, the goal of smokeless powder analysis is to identify particles as smokeless powders, and, if possible, to determine the product identity and/or manufacturer of the smokeless powder [2]. Identification of some brands (products) has been reported if the colored markers are present as a physical attribute in the smokeless powder [2], [8]; however, identification of brands using markers could be misleading since the marker color may vary between lots [8]. Furthermore, brand identification becomes increasingly difficult in the absence of colored markers, if the physical features and chemical formulation of a product has been altered between production lots, or by a combination of these factors.

Laboratory protocols for the analysis of intact (pre-blast) smokeless powders commonly include stereomicroscopy [2], Fourier transform infrared spectroscopy (FT-IR) [11], [12], and gas chromatography – electron ionization – mass spectrometry (GC-EI-MS) [13]. The results obtained from these analyses combined with notation of the powder’s physical attributes (shape, dimensions, color, etc.) are often used in a database search with the goal of identifying the powder’s brand, or to generate a short list of potential products [2], [8]. Product identification from a database search is limited if the database content is neither exhaustive nor dynamic. Smokeless powders industrial practices include altering product formulation and morphology to improve ballistic performance, as well as purchasing domestic and imported products or reworking surplus military powders for repackaging and retail [8], [14]. These practices reinforce the need for continual purchase and analysis of smokeless powder products to ensure that database records accurately reflect product availability in the market. Perhaps the most pertinent forensic question arises when a database search has been conducted, and assertions are to be made regarding the product’s identity. The question addresses whether a recovered sample and a control sample are the same or different products. Concluding that the recovered and control samples are different brands would be reached when either, or both, the chemical formulations or the physical attributes of the powders are different; however, when the two samples share the same physical and chemical characteristics, the problem is more challenging. Categorical rather than probabilistic statements are typically made as to the product’s identity, although the general need for statistically based assertions in forensic science has been addressed [15]. It is generally held that categorical statements should be avoided in favor of statements of evidentiary value in all areas of forensic science; however, at the time of this writing, categorical statements remain accepted in many areas of forensic science in the United States. Our ability to replace categorical statements is dependent on continued research and collaboration between the academic and practicing forensic communities. Specifically in the area of smokeless powder analysis, a recent study which was conducted using data for a set of intact smokeless powder database samples demonstrated the limited evidentiary value that could be expressed in probabilistic assertions of same product or different product, based on chemical and physical characteristics [10]. In this work we examine the use of a Bayesian network to make a probabilistic inference regarding the manufacturer of a smokeless powder. Inference of the manufacturer, rather than product/brand, was chosen to limit the size of the probability tables associated with each node in the network, and to allow for more reliable estimates of the probability tables from a relatively small number of recently purchased powders where the manufacturer is definitely known. The use of Bayesian networks also allows for calculation of the posterior probability of each manufacturer, even when only part of the chemical and physical parameters for a powder are available. The potential absence of some chemical or physical characteristics for smokeless powders in casework samples was an important factor in the choice of a Bayesian network model for this work. In the models presented here, as with other possible models, the quality of the calculated posterior probabilities is dependent upon having a good estimate of the relevant population when creating the model. The Bayesian network models presented here are tested by cross validation with a random stratified draw from the database samples. This ensures that the test set is representative of the population used to make the models and consequently, the results can be considered as representing a best case scenario. Estimation of the distributions of relevant populations for forensic applications continues to be a challenge to advances in many areas of forensic science. This work does not address the problem of estimating manufacturer prevalence in a casework relevant population.

Bayesian networks are probabilistic graphical structures comprised of nodes and edges. Within the network structure, the nodes, which are depicted by circles, represent random variables or events. The edges are shown as arrows to denote conditional relationships between the nodes, resulting in a directed acyclic graph (DAG). A node located at the arrow’s tail is referred to as a parent node, a node at the arrow’s head is referred to as a child node, and a node without an arrow pointing into it is known as a root node. Fig. 1 illustrates a simple Bayesian network structure comprising four nodes: A, B, C, and D. Nodes A and C are parent nodes; A is the parent of B and C, and C is the parent of D. Node A is also a root node, since there is no arrow directed into the node.

Within the network, each node represents either a discrete or continuous random variable and contains an exhaustive list of mutually exclusive states [16]. States represent possible outcomes for the random variable, and each state has a probability value ascribed to it. A table associated with each node encodes the probability distribution across all states, or combination of states, within the node. A root node’s table encodes unconditional probabilities for the states within that node, and the table associated with a child node encodes conditional probabilities for all states within the child node, where each state is conditioned on the states of the parent node(s) [17]. Referring to Fig. 1, the table associated with node A encodes unconditional probabilities, while the tables associated with nodes B, C, and D encode conditional probabilities. The probabilities for each state in nodes B and C are conditioned on each state in node A, and the probabilities for each state in node D are conditioned on each state in node C. An example of a simple Bayesian network and the associated probability tables is shown in Appendix A.

The probabilities across all states of a child node, conditioned on each state in its parent node, sum to one. A requirement which must be satisfied within the structure of a Bayesian network is that of conditional independence, where two child nodes must be conditionally independent given their parent. In Fig. 1, nodes B and C must be conditionally independent given node A. Consequently, a Bayesian network describes the probabilistic and independence relationships between a set of random variables by conditioning child nodes upon their parent nodes. Additionally, the joint probability distribution for a set of random variables, X1,,Xn, is decomposed to the product of their probabilities conditioned on their parents [16]. The expression describing this relationship, known as the chain rule or Markov property, is given in Eq. (1).Pr(X1,,Xn)=i=1nPr(Xi|parents(Xi))

Bayesian networks are instantiated to update probabilities of all variable states based on the observation of new evidence. A node is instantiated when we know its true value (i.e., we have evidence). It is possible to instantiate only some of the nodes in a network and still make inferences. In other words, if you only know some of the physical or chemical characteristics of a smokeless powder, it is possible to estimate posterior probabilities for each of the manufacturers; however, as the amount of known information decreases, the accuracy of the inference is expected to decrease. Appendix A provides an example of calculations using a small Bayesian network.

Section snippets

Smokeless powder extraction and GC-MS analysis

The National Center for Forensic Science (NCFS), Orlando, Florida, hosts and maintains a Smokeless Powders Database comprised of analytical data records for commercially available single and double base smokeless powder products [18]. The database, developed in collaboration with the Explosives Committee of the Scientific Working Group for Fire and Explosions (SWGFEX), contains records contributed by NCFS; the Federal Bureau of Investigations (FBI); and the Netherlands Forensic Institute (NFI).

Conditional independence testing of the smokeless powder random variables

The smokeless powder physical features which were tested for conditional independence include kernel shape, color, luster, average diameter, average length, the presence/absence of a perforation, and the presence/absence of a bias cut. Additional physical characteristics which may be observed in smokeless powders, such as marker color; and distinguishing features such as teardrop and oblong shaped kernel, etc., were not included in this study since they are observed infrequently, or their

Conclusion

A Bayesian network for inferring smokeless powder manufacturers based on the highest posterior probability has been examined. The physical and chemical properties of smokeless powders which are typically utilized in a database search were used to instantiate all the nodes in the network and to calculate posterior probabilities for each smokeless powder manufacturer. In addition, the network was instantiated by withholding information pertaining to one physical property, a combination of

Acknowledgments

This research was supported by Award No. 2013-R2-CX-K008, from the National Institute of Justice, Office of Justice Programs, U.S. Department of Justice. The opinions, findings, and conclusions or recommendations expressed in this publication/program/exhibition are those of the author(s) and do not necessarily reflect those of the Department of Justice.

References (30)

  • R.B. Hopler

    The history, development, and characteristics of explosives and propellants

  • W.A. MacCrehan et al.

    Investigating guns, bombs, and rockets: a new nist reference material for smokeless powder measurements

    Am. Lab.

    (2006)
  • C.L. Wallace et al.

    Smokeless powder characterization: an investigative tool in pipe bombings

  • W.F. Rowe

    Firearms identification

  • C.R. James

    The ABCs of Reloading: The Definitive Guide for Novice to Expert

    (2010)
  • Cited by (11)

    • Correlation and analysis of smokeless powder, smokeless powder residues, and lab generated pyrolysis products via GC–MS

      2021, Forensic Chemistry
      Citation Excerpt :

      The TIS encompassed all ions present in the data, while the TIC encompassed all chromatographic peaks that were present at a signal to noise ratio of 3 or greater. The use of TIS data is well established in ignitable liquid analysis and has also seen use in smokeless powders [9,10,29–31]. The TIS allowed for more direct comparison of the samples analyzed by different chromatographic conditions, due to its lack of method-dependent chromatographic data [29].

    • Novel method based on ion mobility spectrometry sum spectrum for the characterization of ignitable liquids in fire debris

      2019, Talanta
      Citation Excerpt :

      Hence, a reliable identification of trace of neat ILs or ignitable liquid residues (ILRs) at the fire scene is really important not only to determine the cause of the fire but also to try and prevent this crime [3–6]. However, proper identification of ILRs is often hindered by many factors, such as the destructive nature of fire, interfering pyrolysis products as well as background compounds such as the fire suppression agents used by firefighters to extinguish a fire among others [7,8]. Different analytical techniques have been proposed during the last few years for a reliable determination and classification of the ILRs found in a fire scenario.

    View all citing articles on Scopus
    View full text