A NEW REFERENCE CHEMICAL COMPOSITION FOR TMC-1

, , , , , , and

Published 2016 August 11 © 2016. The American Astronomical Society. All rights reserved.
, , Citation P. Gratier et al 2016 ApJS 225 25 DOI 10.3847/0067-0049/225/2/25

0067-0049/225/2/25

ABSTRACT

Recent detections of complex organic molecules in dark clouds have rekindled interest in the astrochemical modeling of these environments. Because of its relative closeness and rich molecular complexity, TMC-1 has been extensively observed to study the chemical processes taking place in dark clouds. We use local thermodynamical equilibrium radiative transfer modeling coupled with a Bayesian statistical method which takes into account outliers to analyze the data from the Nobeyama spectral survey of TMC-1 between 8 and 50 GHz. We compute the abundance relative to molecular hydrogen of 57 molecules, including 19 isotopologues in TMC-1 along with their associated uncertainty. The new results are in general agreement with previous abundance determination from Ohishi & Kaifu and the values reported in the review from Agúndez & Wakelam. However, in some cases, large opacity and low signal to noise effects allow only upper or lower limits to be derived, respectively.

Export citation and abstract BibTeX RIS

1. INTRODUCTION

Complex organic molecules (COMs; e.g., ${\mathrm{CH}}_{3}\mathrm{OH}$, ${\mathrm{CH}}_{3}\mathrm{CHO}$, ${\mathrm{HCOOCH}}_{3}$, ${\mathrm{CH}}_{3}{\mathrm{OCH}}_{3}$; see Herbst & van Dishoeck 2009) were detected in various sources, e.g., hot cores or hot corinos or warm star-forming regions such as SgB2 (Cummins et al. 1986), OMC-1 (Blake et al. 1987), or IRAS 16293-2422 (Bottinelli et al. 2004). Matthews et al. (1985) and Friberg et al. (1988) also detected some of these COMs (${\mathrm{CH}}_{3}\mathrm{OH}$ and ${\mathrm{CH}}_{3}\mathrm{CHO}$) in the TMC-1 and L134N dark clouds. Recently, Cernicharo et al. (2012) detected the methoxy radical (${\mathrm{CH}}_{3}{\rm{O}}$) in the B1 cold dense core. Bacmann et al. (2012) also reported the detections of a variety of COMs (${\mathrm{CH}}_{3}\mathrm{OH}$, ${\mathrm{CH}}_{3}{\mathrm{OCH}}_{3}$, ${\mathrm{CH}}_{3}\mathrm{OCHO}$, ${\mathrm{CH}}_{2}\mathrm{CO}$) in the cold prestellar core L1689b along with few more recent detections of additional COMs (${{\rm{C}}}_{3}{\rm{O}}$, t-$\mathrm{HCOOH}$) in the L1544 prestellar core reported by Vastel et al. (2014). These detections have triggered a renewed interest in the study of the astrochemistry of these environments (Garrod & Herbst 2006; Herbst & Cuppen 2006; Vasyunin & Herbst 2013; Ruaud et al. 2015). Among the different COMs, ${\mathrm{CH}}_{3}\mathrm{OH}$ is the only one for which the production is well understood. Its formation goes through the the successive hydrogenation of carbon monoxide accreted from the gas phase on ice mantles of interstellar dust grains (Watanabe & Kouchi 2002). For several other COMs, studies by Vasyunin & Herbst (2013) and Ruaud et al. (2015) have shown that non-thermal desorption processes can bring COMs to the gas phase from the grain surfaces in regions where the grain temperatures remain low (${T}_{\mathrm{dust}}\lt 15\,{\rm{K}}$) and the UV radiation is also very low (${A}_{v}\gt 10$) for a standard interstellar radiation field.

Astrochemical models are based on microphysics processes that can be studied in laboratories or computed by quantum chemistry theories. Nevertheless, these models must be benchmarked on template sources that are extensively studied. TMC-1 is very close (140 pc; Elias 1978; Kenyon et al. 1994) with a very rich molecular compostion and has thus been extensively observed to understand dark cloud chemistry. In order to improve astrochemical models, their outputs must be compared to the widest set of observations as possible. Spectral surveys enable the unbiased detection of a large number (several tens) of species, often with many detected lines per species, while at the same time minimizing relative calibration errors. With the advent of new wideband receivers and spectrometers, spectral surveys are de facto the default observation mode at millimeter facilities. To interpret this wealth of data, new statistical tools have to be devised.

In this article, we propose to use a Bayesian approach to compute abundance values along with uncertainty estimates associated with all the detected molecules in the Nobeyama spectral survey of TMC-1 from Kaifu et al. (2004).

The paper is organized as follows. We present the observational and spectroscopic data used in Section 2, the methods to determine abundances in Section 3, and the derived values of abundances in Section 4. Finally, these results are discussed and compared with previous abundance values from the literature in Section 5. The Appendix presents, in detail, the Bayesian statistical method used and the individual results for all molecules.

2. DATA

2.1. Observational Data

Kaifu et al. (2004) used the Nobeyama 45 m dish between 1984 and 1996 to observe a full spectral survey of the Cyanopolyyne peak position of TMC-1 (TMC-1(CP). For the rest of the article, we will use TMC-1 as a shorthand for TMC-1(CP)) between 8.8 and 50 GHz with a spectral resolution of 37 kHz (this corresponds to a velocity resolution ranging from 1.26 to 0.22 km s−1) with a typical sensitivity of 10 $\,\mathrm{mK}$ per channel.

The observational data from Kaifu et al. (2004) consists of integrated intensities associated with a single (or several in the case of unresolved hyperfine structure) line of a given molecule. These integrated intensities are computed by fitting Gaussian line profiles to the spectral data. No uncertainty is given for these integrated intensities but the authors give a measured line width ${\rm{\Delta }}V$ and a local noise level per channel σ. From these values, we compute the noise on the integrated intensity, assuming a Gaussian profile, using the following formula:

Equation (1)

,where the numerical term comes from the integral of a Gaussian function of $\mathrm{FWHM}={\rm{\Delta }}V$

The data from Kaifu et al. (2004) take into account the coupling with the source (they assume TMC-1 is a Gaussian source of FWHM $160^{\prime\prime} $). Thus, the column densities and abundances we compute correspond to this spatial scale.

2.2. Spectroscopic Data

Under the local thermal equilibrium hypothesis, integrated intensities are dependent on a set of microphysic parameters: the Einstein coefficients of spontaneous de-excitation between levels i and j: Aij, the degeneracy of the upper level i, gi, the energy of the upper level i, Ei, and the partition function Q(T) measuring the number of occupied states at temperature T. These values are available from online databases, mainly the CDMS (Müller et al. 2005) and the JPL Spec databases (Pickett et al. 1998). The online databases often only give values of the partition functions down to a temperature of 9.75 K, but the measured excitation temperatures in dark clouds are often lower than that. Using the energy levels Ei and the corresponding level degeneracies gi, we compute when necessary the values of the partition function at 2.375 and 5 K using the following formula

Equation (2)

We systematically check that the value obtained by this sum for 9.75 K is similar to the tabulated value of the databases.

Between these tabulated values, the logarithm of the partition function is interpolated linearly in temperature. The last two columns of Table 2 give the source of the spectroscopic parameters and the lowest temperature for which the partition function values are tabulated in the databases.

3. METHODS

3.1. Local Thermal Equilibrium

We determine the column densities and abundances in the local thermodynamical equilibrium (LTE) hypothesis. For a given molecule, we compute the integrated intensities for each line, using the hypothesis of Gaussian opacity profiles as a function of frequency. The modeled opacity is a function of the species column density, excitation temperature, and line width. The integrated intensity is derived from the opacity assuming a temperature of the cosmic microwave background of 2.73 K.

Some lines detected in Kaifu et al. (2004) are attributed to several blended transitions of a given molecule. In this case, we sum the computed intensity for the identified transitions. Following Ohishi & Kaifu (1998), the column density of a species X, noted $[{\text{}}X]$, are converted into abundance assuming a constant molecular hydrogen column density of $N({{\rm{H}}}_{2})={10}^{22}\,{\mathrm{cm}}^{-2}$:

Equation (3)

3.2. Bayesian Approach: Outlier Detection

We devise a Bayesian approach which enables us to retrieve the LTE model parameter distributions. The advantages of a Bayesian approach over a more traditional optimization approach are threefold.

  • 1.  
    The full distribution of parameters can be recovered. This means that upper abundance limits in the case of marginal detections and lower abundance limits in the case of highly optically thick lines are readily obtained.
  • 2.  
    Prior knowledge can be included for some parameters. For example, with a single detected line, the abundance can only be constrained by assuming a prior value of the excitation temperature. A Bayesian approach enables us to include a prior distribution of excitation temperatures. The derived abundance then takes into account the prior uncertainty on the knowledge of the excitation temperature.
  • 3.  
    Additional parameters can be added to the model, for example, to identify data points that are outlier to the physical model. These additional parameters are marginalized out in the end to obtain a posterior distribution of the parameters of interest; in our case, the molecular abundances.

For a few of the molecules, some of the integrated intensities were clearly outliers and could not be modeled consistently with the rest of the data set. This can arise when a line has either: (1) erroneous spectroscopic parameters, (2) been misattributed to the modeled species, or (3) when a line from another species is blended with one from the modeled species. The method we have developed automatically takes into account the probability of each data point being an outlier. This enables us to recover the parameters for the modeled species such as the abundance and excitation temperature in a robust way. Although the method is applied to radiative transfer modeling of molecular lines, it is quite general and can be readily applied to a wide range of problems where outlier detection is needed.

The model parameters are divided into two groups: the first concerning the LTE modeling, and the second concerning the modeling of outliers. The LTE model parameters are the column density N, the excitation temperature Tex, the line width ${\rm{\Delta }}V$, and a scatter term σ that enables us to describe both the possible underestimation of the observational uncertainty and the fact that the LTE model is simpler than would be needed to perfectly describe the observations. The outlier model parameters are the two parameters (mean integrated intensity Yo and dispersion around this mean ${\sigma }_{o}$) describing the Gaussian distribution from which the outliers are supposed to be originating. Additionally, each data point is attributed a binary parameter qi which can take a value of either 0 if the point is an outlier or 1 if the point is not. There are thus as many qi as there are data points. A final parameter is Po, the probability that a point is an outlier considering the binomial distribution of the qi parameters. It possible to analytically marginalize the qi parameters while at the same time recovering the posterior probability of a point being an outlier. The full model thus has 10 free parameters and does not become more complicated to solve as the number of data points increase.

The mathematical details of this approach, in particular the expression of the likelihood used, are presented in Appendix A.

When only a single line is detected for a given species, the identification of outliers makes no sense. In this case, the number of parameters is limited to four: N, Tex , ${\rm{\Delta }}V$, and $\mathrm{log}\sigma $. Since with only one line the additional scatter parameter is completely unconstrained, an informative prior consisting of a Gaussian function of mean −1.2 and standard deviation 0.4 was used. This is the median and sample standard deviation of the values of $\mathrm{log}\sigma $ determined from the 26 species with more than 2 detected lines.

The Bayesian approach requires us to choose prior distributions for all model parameters. Some of them can be rather uninformative. For example, the prior distribution of the logarithm of the column density is chosen to be uniform between 9 and 17 (with the column density in units of ${\mathrm{cm}}^{-2}$). Similarly, the prior probability of the outliers Po is chosen as uniform between 0 and 1. Conversely, we use informative priors on two specific parameters. Following previous studies (Ohishi & Kaifu 1998) that found excitation temperatures around 5 K in TMC-1, we use a Gaussian prior on ${T}_{\mathrm{ex}}$ of mean 5 K and standard deviation 2 K along with a lower temperature limit of 2.73 K corresponding to the background temperature. Similarly, we choose a Gaussian distribution of mean 0.5 km s−1 and standard deviation 0.2 km s−1 for the prior distribution of line widths. Table 1 summarizes these choices.

Table 1.  Prior Parameter Distributions

Parameter Distribution Expression Lower Bound Upper Bound
$\mathrm{log}N$ Uniform $p(\mathrm{log}N)$ = ${[{(\mathrm{log}N)}_{\max }-{(\mathrm{log}N)}_{\min }]}^{-1}$ 9 17
${T}_{\mathrm{ex}}$ Gaussian $p({T}_{\mathrm{ex}})$ =  ${ \mathcal N }(5,2)$ 2.73 20
${\rm{\Delta }}V$ Gaussian $p({\rm{\Delta }}V)$ = ${ \mathcal N }(0.5,0.2)$ 0 3
$\mathrm{log}\sigma $ Uniform $p(\mathrm{log}\sigma )$  =  ${[{(\mathrm{log}\sigma )}_{\max }-{(\mathrm{log}\sigma )}_{\min }]}^{-1}$ −3 1
$\mathrm{log}\sigma $ Gaussiana $p(\mathrm{log}\sigma )$  =  ${ \mathcal N }(-1.2,0.4)$ −3 1
Yo Uniform $p({Y}_{o})$ = ${[{({Y}_{o})}_{\max }-{({Y}_{o})}_{\min }]}^{-1}$ 0 10
$\mathrm{log}{\sigma }_{o}$  Uniform $p(\mathrm{log}{\sigma }_{o})$ = ${[{(\mathrm{log}{\sigma }_{o})}_{\max }-{(\mathrm{log}{\sigma }_{o})}_{\min }]}^{-1}$ −3 2
$\{{q}_{i}\}$  Binomial $p(\{{q}_{i}\})$ = ${\prod }_{i=1}^{N}{[1-{P}_{o}]}^{{q}_{i}}{P}_{o}^{[1-{q}_{i}]}$     
Po Uniform $p({P}_{o})$ = 1 0 1

Note.

aThe Gaussian prior is used when only one line of a given molecule is detected.

Download table as:  ASCIITypeset image

4. RESULTS

Table 2 summarizes the derived abundances for observed molecules toward TMC-1 by means of the Bayesian approach. In this table, column (1) is the species name, column (2) is the median value of the abundance along with the associated 68% ($1\sigma $) confidence interval, column (3) is similar to column (2) for the excitation temperature, column (4) is the peak signal-to-noise ratio, the ratio between the intensity of the brightest line of each molecule to the noise computed for the fit of this molecule, column (5) is the range of opacities spanned by the lines of a given species, column (6) is the source of the spectroscopic data, and column (7) is the minimum temperature for which the partition functions are tabulated in the spectroscopic databases; below these temperatures, the partition functions are computed according to Equation (2). Individual fits to the observed data are shown in Appendix B

Table 2.  Best-fit Parameters by the Bayesian Approach for Main Isotope-bearing Species

Species $\mathrm{log}N$ a ${T}_{\mathrm{ex}}$ PSNR τ range Number of Min Partition Appendix
  (cm−2) (K)     Transitions Function Temp Figure
${\mathrm{NH}}_{3}$ ${14.70}_{-2.33}^{+0.61}$ ${6.16}_{-1.92}^{+1.76}$ 26.8 $0.00\mbox{--}0.73$ 6 9.375† 3.1
${\mathrm{CH}}_{3}\mathrm{OH}$ ${13.16}_{-1.79}^{+0.25}$ ${5.22}_{-1.43}^{+1.95}$ 2.0 $0.00\mbox{--}0.31$ 1 9.375† 3.2
c-${{\rm{C}}}_{3}{\rm{H}}$ ${13.48}_{-0.05}^{+0.07}$ ${5.73}_{-1.29}^{+1.58}$ 12.7 $0.02\mbox{--}0.12$ 6 9.375† 3.3
l-${{\rm{C}}}_{3}{\rm{H}}$ ${12.75}_{-0.03}^{+0.07}$ ${5.90}_{-1.40}^{+1.56}$ 14.5 $0.03\mbox{--}0.14$ 6 9.375 3.4
l-${{\rm{C}}}_{3}{{\rm{H}}}_{2}$ ${11.77}_{-1.90}^{+0.50}$ ${6.02}_{-1.97}^{+1.77}$ 1.6 $0.01\mbox{--}0.04$ 4 9.375 3.5
c-${{\rm{C}}}_{3}{{\rm{H}}}_{2}$ ${13.27}_{-2.90}^{+1.13}$ ${5.45}_{-1.53}^{+1.86}$ 6.6 $0.07\mbox{--}0.59$ 4 2.725 3.6
${\mathrm{CH}}_{3}\mathrm{CCH}$ ${14.06}_{-1.26}^{+1.16}$ ${5.47}_{-1.89}^{+2.03}$ 2.6 $0.08\mbox{--}0.39$ 3 9.375 3.9
${{\rm{C}}}_{2}{\rm{O}}$ $\lt 12.57$ ${5.50}_{-1.57}^{+1.71}$ 0.4 $\lt 0.02$ 1 9.375† 3.10
${{\rm{H}}}_{2}\mathrm{CCN}$ ${13.58}_{-0.34}^{+0.24}$ ${3.46}_{-0.35}^{+1.66}$ 5.3 $0.01\mbox{--}1.74$ 38 9.375 3.11
${\mathrm{CH}}_{3}\mathrm{CN}$ ${12.61}_{-0.18}^{+0.19}$ ${5.04}_{-1.27}^{+1.85}$ 3.2 $0.04\mbox{--}0.20$ 5 2.725 3.12
${{\rm{H}}}_{2}\mathrm{CCO}$ ${12.68}_{-1.71}^{+0.35}$ ${9.50}_{-4.68}^{+3.46}$ 1.6 $0.01\mbox{--}0.01$ 3 2.725 3.13
$\mathrm{HNCO}$ ${13.03}_{-0.05}^{+0.10}$ ${5.26}_{-1.09}^{+1.58}$ 10.5 $0.03\mbox{--}0.24$ 5 9.375† 3.14
$\mathrm{CS}$ $\gt 13.46$ ${4.45}_{-0.52}^{+1.60}$ 24.2 $\gt 3.96$ 1 2.725 3.15
${\mathrm{CH}}_{3}\mathrm{CHO}$ ${12.43}_{-2.03}^{+0.31}$ ${5.24}_{-1.49}^{+1.80}$ 1.7 $0.04\mbox{--}0.04$ 2 9.375† 3.16
${\mathrm{HCS}}^{+}$ ${12.76}_{-0.38}^{+0.53}$ ${5.33}_{-1.74}^{+1.62}$ 5.5 $0.10\mbox{--}3.37$ 1 9.375 3.18
${{\rm{H}}}_{2}\mathrm{CS}$ ${13.62}_{-0.13}^{+0.53}$ ${5.16}_{-1.55}^{+1.55}$ 7.7 $0.26\mbox{--}6.74$ 1 2.725 3.21
$\mathrm{SO}$ $\gt 13.67$ ${4.44}_{-1.21}^{+2.06}$ 11.8 $\gt 0.50$ 1 2.725 3.23
${{\rm{C}}}_{4}{\rm{H}}$ [13.43,14.94]b ${4.79}_{-0.39}^{+0.62}$ 26.4 $0.03\mbox{--}2.78$ 27 9.375 3.24
o-${{\rm{C}}}_{4}{{\rm{H}}}_{2}$ ${13.28}_{-0.30}^{+0.23}$ ${5.04}_{-0.77}^{+1.52}$ 6.5 $0.08\mbox{--}0.25$ 8 9.375 3.25
p-${{\rm{C}}}_{4}{{\rm{H}}}_{2}$ ${12.42}_{-0.49}^{+0.45}$ ${4.63}_{-1.18}^{+1.78}$ 3.6 $0.10\mbox{--}0.22$ 4 9.375 3.27
${{\rm{C}}}_{3}{\rm{N}}$ ${13.55}_{-0.30}^{+0.21}$ ${3.31}_{-0.22}^{+0.64}$ 3.6 $0.13\mbox{--}0.94$ 12 9.375 3.28
${\mathrm{HC}}_{3}{\rm{N}}$ ${14.37}_{-0.06}^{+0.13}$ ${5.22}_{-0.58}^{+0.80}$ 33.0 $0.04\mbox{--}19.50$ 20 2.725 3.29
${\mathrm{HNC}}_{3}$ ${11.68}_{-0.18}^{+0.19}$ ${5.69}_{-1.67}^{+1.97}$ 3.5 $0.05\mbox{--}0.12$ 4 9.375† 3.30
$\mathrm{HCCNC}$ ${12.93}_{-0.11}^{+0.31}$ ${5.41}_{-1.45}^{+1.80}$ 5.0 $0.04\mbox{--}0.27$ 5 9.375† 3.31
${{\rm{C}}}_{3}{\rm{O}}$ ${11.92}_{-1.29}^{+0.25}$ ${5.03}_{-1.23}^{+1.77}$ 1.7 $0.04\mbox{--}0.04$ 2 9.375 3.37
${\mathrm{HC}}_{3}{\mathrm{NH}}^{+}$ ${11.87}_{-1.81}^{+0.45}$ ${5.55}_{-1.49}^{+1.85}$ 1.4 $0.01\mbox{--}0.01$ 2 9.375 3.38
${\mathrm{CH}}_{2}\mathrm{CHCN}$ ${12.81}_{-0.08}^{+0.22}$ ${5.00}_{-1.36}^{+2.07}$ 3.9 $0.04\mbox{--}0.21$ 12 9.375 3.39
$\mathrm{HCCCHO}$ ${11.26}_{-1.54}^{+0.95}$ ${5.29}_{-1.42}^{+1.39}$ 0.7 $0.00\mbox{--}0.08$ 1 9.375 3.40
${{\rm{C}}}_{2}{\rm{S}}$ ${14.01}_{-0.16}^{+0.14}$ ${4.19}_{-0.34}^{+0.60}$ 9.9 $0.28\mbox{--}5.08$ 8 9.375 3.41
$\mathrm{OCS}$ $\lt 13.26$ ${5.27}_{-1.39}^{+1.88}$ 0.7 $\lt 0.06$ 1 9.375 3.42
${{\rm{C}}}_{5}{\rm{H}}$ ${12.27}_{-0.10}^{+0.12}$ ${4.80}_{-0.72}^{+0.94}$ 4.7 $0.03\mbox{--}0.05$ 16 9.375 3.43
${\mathrm{CH}}_{3}{{\rm{C}}}_{4}{\rm{H}}$ ${13.17}_{-0.17}^{+0.23}$ ${6.85}_{-2.01}^{+1.68}$ 2.7 $0.01\mbox{--}0.04$ 8 2.725 3.44
${\mathrm{CH}}_{3}{{\rm{C}}}_{3}{\rm{N}}$ ${11.99}_{-0.17}^{+0.18}$ ${6.26}_{-1.71}^{+1.63}$ 2.8 $0.01\mbox{--}0.05$ 8 2.725 3.45
${{\rm{C}}}_{3}{\rm{S}}$ ${13.14}_{-0.11}^{+0.16}$ ${5.19}_{-0.70}^{+0.90}$ 7.4 $0.22\mbox{--}0.83$ 7 2.725 3.46
${{\rm{C}}}_{6}{\rm{H}}$ ${12.74}_{-0.05}^{+0.04}$ ${5.95}_{-0.45}^{+0.35}$ 6.1 $0.04\mbox{--}0.09$ 42 9.375 3.47
${\mathrm{HC}}_{5}{\rm{N}}$ ${13.77}_{-0.09}^{+0.10}$ ${6.39}_{-0.51}^{+0.54}$ 9.5 $0.20\mbox{--}2.33$ 16 9.375 3.48
${\mathrm{HC}}_{7}{\rm{N}}$ ${13.66}_{-0.10}^{+0.14}$ ${6.14}_{-0.41}^{+0.32}$ 14.3 $0.04\mbox{--}3.52$ 34 9.375 3.56
${\mathrm{HC}}_{9}{\rm{N}}$ ${13.02}_{-0.06}^{+0.06}$ ${8.51}_{-1.15}^{+1.07}$ 4.6 $0.00\mbox{--}0.03$ 16 9.375 3.57

Notes. Spectroscopic data are from the CDMS database except for species marked by a † which are from form the JPL database.

aThe abundance [X] can be derived using $N(X)/N({{\rm{H}}}_{2})$ with $N({{\rm{H}}}_{2})={10}^{22}\,{\mathrm{cm}}^{-2}$. bValues in brackets are the 1σ confidence interval. See the text for details concerning these specific molecules.

Download table as:  ASCIITypeset image

4.1. Main Isotopes

Figure 1 gives a graphical representation of the computed abundances for the main isotope-bearing molecules.

Figure 1.

Figure 1. Graphical representation of the computed abundances of molecules in TMC-1. The thick black vertical lines are the medians of the abundance distributions, while the dark and light gray rectangles represent the 68% and 95% confidence intervals, respectively. Large open circles are the values presented in Ohishi & Kaifu (1998), while smaller open circles are the values compiled in Agúndez & Wakelam (2013)

Standard image High-resolution image

Three main classes of outcomes can be identified. For some molecules, all observed lines have high opacities, and in these cases only lower limits can be placed on their abundances. This is the case for $\mathrm{CS}$ and $\mathrm{SO}$. For other molecules, the signal-to-noise ratio is not sufficient enough to obtain more than an upper limit on the abundances. This is the case for ${{\rm{C}}}_{2}{\rm{O}}$ and $\mathrm{OCS}$. For all other molecules, it is possible to determine an abundance along with an associated uncertainty.

4.2. Isotopologues

Twenty isotopologues are also detected in the Nobeyama survey of TMC-1. We have computed their abundances relative to ${{\rm{H}}}_{2}$ and the ratio of the minor isotopologues to the main isotopologues for each molecule. The results are summarized in Figure 2 and Table 3.

Figure 2.

Figure 2. Graphical representation of the computed abundances of isotopologues in TMC-1. The left plot shows the computed abundances with the same definition of the shading as for Figure 1. The right plot shows the ratios of the isotopologue to the molecule with the main isotope. Species are color coded, with main isotopologues in black, isotopologues of deuterium in red, isotopologues of ${}^{13}{\rm{C}}$ in blue, isotopologues of ${}^{34}{\rm{S}}$ in green, and isotopologues of ${}^{33}{\rm{S}}$ in purple. The solid vertical lines spanning the full height of the plot correspond to constant values of the isotopic ratio of ${\rm{D}}/{\rm{H}}=2\,\times \,{10}^{-5}$, ${}^{13}{\rm{C}}{/}^{12}{\rm{C}}=1/68$, ${}^{34}{\rm{S}}{/}^{32}{\rm{S}}=4.4\,\times \,{10}^{-2}$, ${}^{33}{\rm{S}}{/}^{32}{\rm{S}}=7.9\,\times \,{10}^{-3}$.

Standard image High-resolution image

Table 3.  Best-fit Parameters by the Bayesian Approach for Rarer Isotope-bearing Species

Species $\mathrm{log}N$ a ${T}_{\mathrm{ex}}$ PSNR τ Range Number of Min Partition Appendix
  (cm−2) (K)     Transitions Function Temp Figure
${}^{13}{\mathrm{CCH}}_{2}$ $\lt 13.01$ ${5.31}_{-1.49}^{+1.69}$ 0.9 $\lt 0.04$ 2 9.375 3.7
c-${{\rm{C}}}_{3}\mathrm{HD}$ ${12.71}_{-2.03}^{+2.19}$ ${5.23}_{-1.91}^{+2.13}$ 2.0 $0.11\mbox{--}0.28$ 4 9.375 3.8
${}^{13}\mathrm{CS}$ ${12.26}_{-1.78}^{+0.27}$ ${5.30}_{-1.66}^{+1.99}$ 2.3 $0.00\mbox{--}0.51$ 1 9.375 3.17
${{\rm{C}}}^{33}{\rm{S}}$ $\lt 12.42$ ${5.22}_{-1.40}^{+1.52}$ 1.2 $\lt 0.02$ 1 9.375 3.19
${{\rm{C}}}^{34}{\rm{S}}$ ${12.70}_{-0.14}^{+1.55}$ ${4.94}_{-1.61}^{+1.91}$ 6.9 $0.40\mbox{--}546.68$ 1 9.375 3.20
$\mathrm{HDCS}$ $\lt 13.80$ ${5.21}_{-1.57}^{+2.00}$ 1.5 $\lt 0.41$ 1 9.375 3.22
${}^{34}\mathrm{SO}$ $\lt 13.04$ ${5.35}_{-1.55}^{+1.93}$ 1.1 $\lt 0.08$ 1 9.375 3.26
${\mathrm{DC}}_{3}{\rm{N}}$ ${12.47}_{-0.24}^{+0.44}$ ${5.53}_{-1.66}^{+1.90}$ 4.3 $0.01\mbox{--}0.28$ 5 9.375 3.35
${{\rm{H}}}^{13}{\mathrm{CC}}_{2}{\rm{N}}$ ${12.25}_{-0.03}^{+0.03}$ ${6.54}_{-1.07}^{+1.32}$ 14.6 $0.01\mbox{--}0.15$ 6 9.375 3.36
${\mathrm{HC}}^{13}\mathrm{CCN}$ ${12.26}_{-0.02}^{+0.02}$ ${5.38}_{-0.64}^{+0.57}$ 20.6 $0.02\mbox{--}0.20$ 6 9.375 3.32
${\mathrm{HC}}_{2}^{13}\mathrm{CN}$ ${12.47}_{-0.03}^{+0.03}$ ${5.90}_{-0.93}^{+1.29}$ 11.5 $0.03\mbox{--}0.21$ 6 9.375 3.34
${\mathrm{DC}}_{5}{\rm{N}}$ ${12.05}_{-0.13}^{+0.16}$ ${5.80}_{-0.85}^{+0.96}$ 6.5 $0.02\mbox{--}0.04$ 6 9.375 3.50
${{\rm{H}}}^{13}{\mathrm{CC}}_{4}{\rm{N}}$ ${12.03}_{-0.23}^{+0.34}$ ${6.31}_{-1.55}^{+1.66}$ 3.8 $0.02\mbox{--}0.03$ 3 9.375 3.51
${\mathrm{HC}}^{13}{\mathrm{CC}}_{3}{\rm{N}}$ ${12.06}_{-0.20}^{+0.25}$ ${4.03}_{-0.58}^{+0.99}$ 6.9 $0.03\mbox{--}0.08$ 4 9.375 3.53
${\mathrm{HC}}_{2}^{13}{\mathrm{CC}}_{2}{\rm{N}}$ ${11.92}_{-0.11}^{+0.15}$ ${5.49}_{-1.05}^{+1.09}$ 4.8 $0.02\mbox{--}0.04$ 6 9.375 3.52
${\mathrm{HC}}_{3}^{13}\mathrm{CCN}$ ${12.25}_{-0.12}^{+0.14}$ ${4.38}_{-0.45}^{+0.49}$ 6.7 $0.03\mbox{--}0.12$ 6 9.375 3.54
${\mathrm{HC}}_{4}^{13}\mathrm{CN}$ ${12.01}_{-0.36}^{+0.23}$ ${5.61}_{-1.33}^{+1.56}$ 2.7 $0.03\mbox{--}0.04$ 5 9.375 3.49

Note.

aThe abundance [X] can be derived using $N(X)/N({{\rm{H}}}_{2})$ with $N({{\rm{H}}}_{2})={10}^{22}\,{\mathrm{cm}}^{-2}$ Spectroscopic data are from the CDMS database.

Download table as:  ASCIITypeset image

5. DISCUSSION

5.1. Outlier Lines

Using the Bayesian approach, we have demonstrated that it is possible to identify outliers in the data set: observed line intensities that are not explainable by the global model best fitting the rest of the lines for a given species. Table 4 summarizes the properties of the outlier lines.

Table 4.  List of Lines that Have an Outlier Probability Larger than 0.75.

Species Transition Observed Outlier Modeled Observed
    Frequency Probability Intensity Intensity
    ($\,\mathrm{MHz}$)   (K km s−1) (K km s−1)
${\mathrm{HC}}_{3}{\rm{N}}$ $J^{\prime} -J^{\prime\prime} ,F^{\prime} -F^{\prime\prime} $ = 1–0, 2–1 9098.329 0.90 1.3 ± 0.1 2.638 ± 0.020
${\mathrm{HC}}_{3}{\rm{N}}$ $J^{\prime} -J^{\prime\prime} ,F^{\prime} -F^{\prime\prime} $ = 5–4, 6–5 and 5–4 and 4–3 45490.332 0.93 7.2 ± 0.4 5.239 ± 0.020
${\mathrm{HC}}_{7}{\rm{N}}$ $J^{\prime} -J^{\prime\prime} $ = 11–10 12407.995 0.999 0.56 ± 0.25 0.228 ± 0.009
${\mathrm{HC}}_{7}{\rm{N}}$ $J^{\prime} -J^{\prime\prime} $ = 12–11 13535.999 0.999 0.58 ± 0.24 0.212 ± 0.011
${{\rm{C}}}_{4}{\rm{H}}$ $N^{\prime} -N^{\prime\prime} J^{\prime} -J^{\prime\prime} F^{\prime} -F^{\prime\prime} $ = 4–3, 9/2–7/2, 4–3 38049.629 0.94 0.58 ± 0.24 0.212 ± 0.011

Download table as:  ASCIITypeset image

Two molecules stand out in the outlier detection scheme, ${{\rm{H}}}_{2}\mathrm{CCN}$ and ${{\rm{C}}}_{3}{\rm{N}}$. In both cases, about half of the observed intensities are not compatible with the best-fit model (see Figure Set 3). With so many outlier lines, the problem cannot arise solely from observational effects (possible blended or misattributed lines). The origin of the issue is possibly the values of the spectroscopic catalogs such as erroneous normalized intensities or level degeneracies (the spectroscopy data for ${{\rm{H}}}_{2}\mathrm{CCN}$ is only available from CDMS, while the same data set is present and identical in both the CDMS and JPL catalogs for ${{\rm{C}}}_{3}{\rm{N}}$).

Figure 3.

Figure 3.

Top triangle plot shows the marginalized posterior distributions of each model parameters along with the median and 16th and 84th percentiles for ${\mathrm{NH}}_{3}$. The set of two-dimensional histograms in the lower triangle of the matrix of the plot are useful to identify possible correlations between parameters. The other molecules are shown in the figure set.  (The complete figure set (57 images) is available.)

Standard image High-resolution image

In these two cases, the outlier detection scheme is not used. The simpler model with just the three LTE parameters (N, Tex and ${\rm{\Delta }}V$) and a scatter term is used as described in Section 3.2 for the case when only a single line of a molecule is detected. The sampling method adjusts by increasing the scatter error added in quadrature to the observational uncertainty to encompass all observed points at the expense of a larger uncertainty on the other parameters, in particular the column density and thus the abundance.

5.2. Comparison with Previous Abundance Determination

The derived column densities and abundances for these molecules were published in Ohishi et al. (1992) and Ohishi & Kaifu (1998), which have been used since as references for abundances in TMC-1 and by extension for dark clouds in general. In Ohishi & Kaifu (1998), the derived abundances have no associated uncertainty estimates and the authors, while discussing the advantage of having detected lines for some species spanning a wide range of opacities, do not discuss the possible effect of high opacity or low signal-to-noise detection on the molecular abundances, especially for species where the number of detected lines is low (some species have only one detected line).

Ohishi & Kaifu (1998) have used the data later published in Kaifu et al. (2004) to compute the abundances of observed species. Since the local thermal equilibrium approach is used in both our study and theirs, the main difference should arise from the spectrometric data used and from the fact that our Bayesian approach enables us to compute uncertainties on the abundances and even upper or lower limits when the signal-to-noise ratio is too low or the opacity is too large for all detected lines of a given species. In Figure 1, the values from Ohishi & Kaifu (1998) are shown as large white circles. Agúndez & Wakelam (2013) made a review of the literature to identify molecular abundances in TMC-1. The results they compiled appear as a small circle in Figure 1. Most of the previously determined abundances fall within the 95% (2σ) confidence interval of our Bayesian method. We now discuss notable differences, i.e., species where the old values fall outside our 95% confidence interval. Most of the abundance determination in Ohishi & Kaifu (1998) were carried out without taking into account the hyperfine splitting of the observations. This can lead to biased results when opacities vary across the hyperfine components.

5.2.1. Polyynes

${{\rm{C}}}_{6}{\rm{H}}$ has a similar abundance as the one derived by Ohishi & Kaifu (1998). ${{\rm{C}}}_{5}{\rm{H}}$ has a new abundance that is lower by about an order of magnitude from previous values. The value we compute for linear ${{\rm{C}}}_{3}{\rm{H}}$ falls between the values of Ohishi & Kaifu (1998) and Agúndez & Wakelam (2013). The value reported for this species in Agúndez & Wakelam (2013) comes from IRAM 30 m millimeter observations (Fossé et al. 2001). The difference of a factor of two in beam size between the observations could explain the abundance difference. Cyclic ${{\rm{C}}}_{3}{\rm{H}}$ is found to be about three times more abundant.

The case of ${{\rm{C}}}_{4}{\rm{H}}$ is particular because of the electronic structure of this radical. Previous studies (Woon 1995; Mazzotti et al. 2011) have established that ${{\rm{C}}}_{4}{\rm{H}}$ presents two low-lying electronic states of ${}^{2}{{\rm{\Sigma }}}^{+}$ and ${}^{2}{\rm{\Pi }}$ symmetries. The experimental spectra of ${{\rm{C}}}_{4}{\rm{H}}$ are consistent with the ground state ${}^{2}{{\rm{\Sigma }}}^{+}$  (Dismuke et al. 1975; Gottlieb et al. 1983; Shen et al. 1990). Recently, Senent & Hochlaf (2010) confirmed that ${{\rm{C}}}_{4}{\rm{H}}$ radical presents two electronic states, ${}^{2}{{\rm{\Sigma }}}^{+}$  and ${}^{2}{\rm{\Pi }}$, which are near degenerate. Their computation confirms that the ground electronic state of ${{\rm{C}}}_{4}{\rm{H}}$ radical is ${}^{2}{{\rm{\Sigma }}}^{+}$ , which agrees with the findings of electron paramagnetic resonance by Dismuke et al. (1975). Their computation also confirms the short gap between the two low-lying electronic states. They have found that they are around 9 cm−1 and 2800 cm−1 at the (R)CCSD(T) and CASSCF levels of quantum chemical theory. The dipole moment is specific to each electronic state but since the two electronic states are very close to each other and the molecule is not static, the motion will cause electronic states to mix. However, since the concept of an average dipole moment for a mixed state is not well defined, this issue needs more studies from both theoretical and experimental points of view.

The CDMS database gives a dipole value of 0.870 $\,\mathrm{Debye}$,which corresponds to the electronic ground state of the molecule. The computed dipole value for the ${}^{2}{\rm{\Pi }}$ electronic state is 4.3 $\,\mathrm{Debye}$ (Senent & Hochlaf 2010). Nevertheless, in Table 2 and Figure 1 we show the full range of abundances permitted by any value of the dipole moment between the values computed for the ${}^{2}{{\rm{\Sigma }}}^{+}$  and ${}^{2}{\rm{\Pi }}$  states.

In the optically thin regime, the column density scales as the square of the dipole. Thus, once the correct dipole moment of ${{\rm{C}}}_{4}{\rm{H}}$ has been determined, an updated value of the abundance can be approximated using the following expression:

Equation (4)

where $N{({{\rm{C}}}_{4}{\rm{H}})}_{D}$ is the column density considering a dipole moment of value D and $N{({{\rm{C}}}_{4}{\rm{H}})}_{4.3}=3\,\times \,{10}^{13}\,{\mathrm{cm}}^{-2}$ the column density considering a dipole moment of value 4.3 Debye.

Of the detected molecules which posses distinct ortho and para states, only ${{\rm{C}}}_{4}{{\rm{H}}}_{2}$ has a number of detected lines (12) that is large enough that the abundances of the ortho and para forms can be determined independently. The median ortho-to-para abundance ratio is found to be around 6, but this ratio is highly uncertain with a one sigma confidence interval ranging from 1.7 to 70. The two ortho and para abundances bracket the abundance previously determined by Ohishi & Kaifu (1998).

5.2.2. Cyanopolyynes

Cyanopolyynes are abundant in TMC-1 with the largest cyanopolyyne present in the interstellar medium, ${\mathrm{HC}}_{11}{\rm{N}}$, first detected there by Bell et al. (1997). Our new results are comparable with previous abundance determinations in Ohishi & Kaifu (1998) except for ${\mathrm{HC}}_{7}{\rm{N}}$ and ${\mathrm{HC}}_{9}{\rm{N}}$ which we find to be about three times more abundant than was determined by Ohishi & Kaifu (1998). ${{\rm{C}}}_{3}{\rm{N}}$ is found to be six times more abundant but our derived abundance remains very uncertain (see Section 5.1).

5.2.3. Other Molecules

$\mathrm{HNCO}$ is more abundant by a factor of 3.

5.3. Isotopic Ratios

5.3.1. Sulfur

For the two minor isotopes of sulfur ${}^{33}{\rm{S}}$ and ${}^{34}{\rm{S}}$, only upper limits to the ratio can be computed. This arises because, for SO and CS, the main isotopologues have only lower limits to their abundances because of the high opacity of their lines. In the case of ${{\rm{C}}}_{3}{\rm{S}}$, it is because only an upper limit on the abundance of the ${}^{34}{\rm{S}}$ isotopologue itself can be computed because of the low signal-to-noise ratio of the observed line. The derived upper limits are in agreement with the cosmic abundance ratios of ${}^{34}{\rm{S}}{/}^{32}{\rm{S}}=4.4\,\times {10}^{-2}$, ${}^{33}{\rm{S}}{/}^{32}{\rm{S}}=7.9\,\times \,{10}^{-3}$ (Berglund & Wieser 2011).

5.3.2. Carbon

The overall median value of ${}^{12}{\rm{C}}$/${}^{13}{\rm{C}}$, considering all the ${}^{13}{\rm{C}}$ bearing molecules, is 66. For the three ${}^{13}{\rm{C}}$  isotopologues of ${\mathrm{HC}}_{3}{\rm{N}}$, the median ratio is 130 with values raging from 80 to 133. For the five ${}^{13}{\rm{C}}$  isotopologues of ${\mathrm{HC}}_{5}{\rm{N}}$, the median ratio is 54 with values ranging from 32 to 75. Recently, Taniguchi et al. (2016) have observed ${}^{13}{\rm{C}}$  isotopes of ${\mathrm{HC}}_{5}{\rm{N}}$ toward TMC-1. Our computed column densities tend to be somewhat larger than the ones they derive although within the $1\sigma $ interval we estimate.

The upper limits on the carbon isotopic ratios for the two molecules ${}^{13}\mathrm{CS}$ and ${}^{13}{\mathrm{CCH}}_{2}$ are compatible with the typical value of 68 assumed in the ISM (Milam et al. 2005)

5.3.3. Hydrogen

In the observed sample, only three molecules have measured H/D ratios: ${\mathrm{DC}}_{3}{\rm{N}}$ (H/D = 81(−68, +82)), ${\mathrm{DC}}_{5}{\rm{N}}$ (H/D = 52(−20, +26)), and c-${{\rm{C}}}_{3}\mathrm{HD}$, with an additional upper limit on $\mathrm{HDCS}$. Excluding c-${{\rm{C}}}_{3}\mathrm{HD}$ which is very uncertain, the average H/D ratio is 64. Due to differences in zero level energy, molecules tend to be enriched in deuterium in cold ISM conditions (Albertsson et al. 2013). The value we derive are typical of those found in the cold dense ISM.

6. SUMMARY

In this work, we have developed a statistical Bayesian method to determine the abundance of molecules in the interstellar medium in the presence of possible outlier points. This method has been used on previously published data from a spectral survey of TMC-1 by the Nobeyama Observatory. We derive a new reference set of abundances for TMC-1 which will be used to benchmark the chemistry of dark clouds.

Based on observations carried out with the Nobeyama Radio Observatory. The Nobeyama Radio Observatory is an open-use facility for millimeter-wave astronomy, and is operated under the National Astronomical Observatory of Japan (NAOJ). P.G., L.M., and V.W. thank ERC starting grant (3DICE, grant agreement 336474) for funding during this work. P.G.'s current postdoctoral position is funded by the INSU/CNRS. V.W. and J.C.L. acknowledge the French PCMI for funding of their research.

Facility: National Astronomical Observatory of Japan (NAOJ) 45m Radio Telescope at Nobeyama Radio Observatory (No:45m) - .

APPENDIX A: BAYESIAN METHOD: DETECTING OUTLIERS

In the Bayesian framework, the posterior $p(\theta | D)$ distribution is obtained using the Bayes' equation by multiplying the prior distribution $p(\theta )$ by the likelihood $p(D| \theta )$.

Equation (5)

Here θ is a vector of the parameters describing the model and D is the set of observed data points.

We have devised a method that enables the automatic mitigation of outlier points. This method was inspired by the discussions in Hogg et al. (2010) and Foreman-Mackey (2014) concerning the detection of outliers. The likelihood is constructed by considering two generative models, one for the data points that follow the model and the second one for the outlier points.

The likelihood for the non-outlier points ${p}_{m}({\{{y}_{i}\}}_{i=1}^{N}| N,{\rm{\Delta }}V,{T}_{\mathrm{ex}},\sigma )$ is constructed assuming that the observed uncertainties for point i are generated from a Gaussian of standard deviation ${\sigma }_{{yi}}$. To explain the additional scatter around the modeled data, an additional noise term σ is added in quadrature to the observed uncertainties. We make the hypothesis that the outlier points come from a Gaussian distribution of mean Yo and dispersion ${\sigma }_{o}$. The corresponding likelihood is written as ${p}_{o}({\{{y}_{i}\}}_{i=1}^{N}| {Y}_{o},{\sigma }_{o},I)$ in the following equations. Each data point is associated with a number qi which can be either 0 if the point is an outlier and 1 if it is not. We add an additional parameter Po representing the overall probability of the data points of being outliers.

Making the hypothesis of independent observations, the full likelihood is obtained by multiplying the individual points' likelihood values. Numerically, we consider the logarithm of the likelihood function.

Equation (6)

The total number of parameters is thus K + 7, where K is the number of observed lines. N, ${\rm{\Delta }}V$, ${T}_{\mathrm{ex}}$, σ, ${\{{q}_{i}\}}_{i=1}^{K}$, Po, Yo, ${\sigma }_{o}$.

By specifying a prior for the ${q}_{i}$ parameters, it is possible to marginalize them out analytically. The marginalization can be written:

Equation (7)

where the sum is over all the possible permutations of the qi flags.

Assuming the following prior for qi:

where Po is the prior probability that a point is draw from the outlier distribution. It is then possible to marginalize out the ${q}_{i}$ and obtain the following likelihood:

Equation (8)

It is also possible to obtain the individual posterior distributions of each p(${q}_{i}| $y) which is the probability that a point is either an outlier or not. For each data point i, this probability is written

Equation (9)

where y has been simplified to signify ${\{{y}_{i}\}}_{i=1}^{N}$.

Since establishing an analytical formulation of the posterior distribution is intractable, the posterior distribution is obtained by numerically sampling the posterior distribution using an MCMC algorithm. We use the affine invariant ensemble sampling method developed by Goodman & Weare (2010) in its Python implementation EMCEE (Foreman-Mackey et al. 2013). This sampling method uses an ensemble of walkers which sample simultaneously the posterior distribution. The number of walker has been kept fixed at 100.

For each molecule, the sampling is run during a burning time of about 10,000 steps at the end of which the convergence of the ensemble of chains is checked by plotting the running mean of each parameter.

A final run of 100 steps yields 100,000 points from which the histograms of the marginalized distribution and the two parameter correlations can be plotted.

APPENDIX B: FIGURES

The following set of figures synthesize the results of the Bayesian method for each species. The top triangle plot shows the marginalized posterior distributions of each model parameters along with the median and 16th and 84th percentiles. The set of two-dimensional histograms in the lower triangle of the matrix of plot are useful to identify possible correlations between parameters.

In the bottom left, a comparison of the observations is plotted, shown with red error bars, and the modeled integrated intensities for each lines of a given species. The indices in abscissa correspond to the line identifications from Kaifu et al. (2004) in order of increasing frequency. The modeled intensity distribution is represented by the gray intervals corresponding to 68% and 95% of the samples. The outlier probability for each point is represented by the ratio of the white to black surface of each data point. When at least one point has a probability of being an outlier larger of 50%, the 1 sigma interval of the Gaussian function that describe the outlier distribution is shown in light blue.

On the bottom right the distribution of the base 10 logarithm of opacities for each line of a given species is plotted. Again, the gray intervals represent 68% and 95% of the samples.

Please wait… references are loading.
10.3847/0067-0049/225/2/25