Skip to main content
Log in

Learning Causal Structure from Multiple Datasets with Similar Variable Sets

  • Invited paper
  • Published:
Behaviormetrika Aims and scope Submit manuscript

Abstract

While randomized controlled experiments are often considered the gold standard for predicting causal relationships between variables, they are expensive if one is interested in understanding the complete set of causal relationships governing a large set of variables and it may not be possible to manipulate certain variables due to ethical or practical constraints. To address these scenarios, procedures have been developed which use conditional independence relationships among variables when they are passively observed to predict which variables may or may not be causally related to other variables. Until recently, most of these procedures assumed that the data consisted of a single i.i.d. dataset of observations, but in practice researchers often have access to multiple similar datasets, e.g. from multiple labs studying the same problem, which measure slightly different variable sets and where recording conventions and procedures may vary. This paper discusses recent state of the art approaches for predicting causal relationships using multiple observational and experimental datasets in these contexts.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Bahadur, R. R. (1971). Some Limit Theorems in Statistics. SIA., Philadelphia.

    Book  Google Scholar 

  • Chickering, D. M. (1995). Learning Bayesian networks is NP-complete. In Proceedings of the 5th International Conference on Artificial Intelligence and Statistics.

    Google Scholar 

  • Chickering, D. M. (2002). Optimal structure identification with greedy search. Journal of Machine Learning Research, 3:507–554.

    MathSciNet  MATH  Google Scholar 

  • Claassen, T. and Heske., T. (2010). Causal discovery discovery in multiple models from different experiments. In Advances in Neural Information Processing Systems 22.

    Google Scholar 

  • Claassen, T. and Heske., T. (2012). A Bayesian approach to constraint-based causal inference. In Proceedings of 28th Conference on Uncertainty in Artificial Intelligence.

    Google Scholar 

  • de Campo., C., Zen., Z., and J., Q. (2009). Structure learning of Bayesian networks using constraints. In Proceedings of the 26th International Conference on Machine Learning.

    Google Scholar 

  • Eaton, D. and Murph., K. (2007). Exact Bayesian structure learning from uncertain interventions. In Proceedings of the 10th International Conference on Artificial Intelligence and Statistics.

    Google Scholar 

  • Eberhard., F., Hoye., P. O., and Scheine., R. (2010). Combining experiments to discover linear cyclic models with latent variables. In Proceedings of the 13th International Conference on Artificial Intelligence and Statistics.

    Google Scholar 

  • Eberhardt, F. and Scheine., R. (2007). Interventions and causal inference. Philosophy of Science, 74(5):5–995.

    Article  MathSciNet  Google Scholar 

  • Fisher, R. A. (1950). Statistical Methods for Research Workers. Oliver and Boy, London, 11th edition.

    MATH  Google Scholar 

  • Gomes, C., Kautz, H., Sabharwal, A., and Selma., B. (2007). Satisfiability solvers. In Hendle., J., Kitan., H., and Nebe., B., editor., Handbook of Knowledge Representation. Elsevier B.V., Oxford.

    Google Scholar 

  • Hauser, A. and Bühlmann, P. (2012). Characterization and greedy learning of interventional Markov equivalence classes of directed acyclic graphs. Journal of Machine Learning Research, 13:2409–2464.

    MathSciNet  MATH  Google Scholar 

  • Heckerman, D., Geiger, D., and Chickerin., D. M. (1995). Learning Bayesian networks: The combination of knowledge and statistical data. Machine Learning, 20:197–243.

    MATH  Google Scholar 

  • Hyttinen, A., Eberhardt, F., and Hoye., P. O. (2011). Noisy-OR models with latent confounding. In Proceedings of the 27th Conference on Uncertainty in Artificial Intelligence.

    Google Scholar 

  • Hyttinen, A., Eberhardt, F., and Hoye., P. O. (2012). Learning linear cyclic causal models with latent variables. Journal of Machine Learning Research, 13:3387–3439.

    MathSciNet  MATH  Google Scholar 

  • Hyttinen, A., Eberhardt, F., and Hoye., P. O. (2013a). Experiment selection for causal discovery. Journal of Machine Learning Research. In press.

    Google Scholar 

  • Hyttinen, A., Hoyer, P. O., Eberhardt, F., and Järvisalo, M. (2013b). Discovering cyclic causal models with latent variables: A general SAT-based procedure. In Proceedings of the 29th Conference on Uncertainty in Artificial Intelligence.

    Google Scholar 

  • Jaakkola, T., Sontag, D., Globerson, A., and Meil., M. (2010). Learning Bayesian network structure using LP relaxations. In Proceedings of the 13th International Conference on Artificial Intelligence and Statistics.

    Google Scholar 

  • Koivisto, M. and Soo., K. (2004). Exact Bayesian structure discovery in Bayesian networks. Journal of Machine Learning Research, 5:549–573.

    MathSciNet  MATH  Google Scholar 

  • Lauritzen, S. L., Dawid, A. P., Larsen, B. N., and Leime., H. G. (1990). Independence properties of directed Markov fields. Networks, 20:491–505.

    Article  MathSciNet  Google Scholar 

  • Lazar, N. A., Luna, B., Sweeney, J. A., and Edd., W. F. (2002). Combining brains: A survey of methods for statistical pooling of information. NeuroImage, 16:538–550.

    Article  Google Scholar 

  • Meek, C. (1995a). Causal inference and causal explanation with background knowledge. In Proceedings of the 11th Conference on Uncertainty in Artificial Intelligence.

    Google Scholar 

  • Meek, C. (1995b). Strong completeness and faithfulness in Bayesian networks. In Proceedings of the 11th International Conference on Artificial Intelligence and Statistics.

    Google Scholar 

  • Nyberg, E. and Kor., K. (2006). Informative interventions. In Russ., F. and Williamso., J., editor., Causality and Probability in the Sciences. College Publication, London.

    Google Scholar 

  • Parviainen, P. and Koivist., M. (2009). Exact structure discovery in Bayesian networks with less space. In Proceedings of the 25th Conference on Uncertainty in Artificial Intelligence.

    Google Scholar 

  • Pearl, J. (1988). Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Morgan Kauffmann Publishers.

    MATH  Google Scholar 

  • Pearl, J. (2000). Causality: Model, Reasoning, and Inference. Cambridge University Press.

    MATH  Google Scholar 

  • Ramsey, J. D., Hanson, S. J., Hanson, C., Halchenko, Y. O., Poldrack, R. A., and Glymou., C. (2010). Six problems for causal inference from fMRI. NeuroImage, 49(2):2–1558.

    Article  Google Scholar 

  • Richardson, T., Bailer, H., and Banerje., M. (1999). Specification searches using MAG models. In Proceedings of the 52nd ISI World Statistics Congress.

    Google Scholar 

  • Richardson, T. and Spirte., P. (2002). Ancestral graph Markov models. Annals of Statistics, 30(4):4–1030.

    MathSciNet  Google Scholar 

  • Silander, S. and Myllymak., M. (2006). A simple approach for finding the globally optimal Bayesian network structure. In Proceedings of the 22th Conference on Uncertainty in Artificial Intelligence.

    Google Scholar 

  • Silva, R. and Ghahraman., Z. (2009). The hidden life of latent variables: Bayesian learning with mixed graph models. Journal of Machine Learning Research, 10:1187–1238.

    MathSciNet  MATH  Google Scholar 

  • Spirtes, P. (1995). Directed cyclic graphical representation of feedback models. In Proceedings of the 11th Conference on Uncertainty in Artificial Intelligence, pages 491–498.

    Google Scholar 

  • Spirtes, P., Glymour, C., and Scheine., R. (2000). Causation, Predictio, and Search. MIT Pres, 2nd edition.

    Google Scholar 

  • Spirtes, P., Meek, C., and Richardso., T. (1995). Causal inference in the presence of latent variables and selection bias. In Proceedings of 11th Conference on Uncertainty in Artificial Intelligence.

    Google Scholar 

  • Sutton, A. J., Abrams, K. R., Jones, D. R., Sheldon, T. A., and Son., F. (2000). Methods for Meta-Analysis in Medical Research. John Wiley & Son, New York.

    Google Scholar 

  • Tillman, R. E. (2009). Structure learning with independent non-identically distributed data. In Proceedings of the 26th International Conference on Machine Learning.

    Google Scholar 

  • Tillman, R. E., Danks, D., and Glymou., C. (2009). Integerating locally learned causal structures with overlapping variables. In Advances in Neural Information Processing Systems 21.

    Google Scholar 

  • Tillman, R. E. and Spirte., P. (2011). Learning equivalence classes of acyclic models with latent and selection variables from multiple datasets with overlapping variables. In Proceedings of the 14th International Conference on Artificial Intelligence and Statistics.

    Google Scholar 

  • Triantafillou, S., Tsamardinos, I., and Tolli., I. G. (2010). Learning causal structure from overlapping variable sets. In Proceedings of the 13th International Conference on Artificial Intelligence and Statistics.

    Google Scholar 

  • Tsamardinos, I., Brown, L. E., and Aliferi., C. F. (2006). The max-min hill-climbing Bayesian network structure learning algorithm. Machine Learning, 65(1):1–78.

    Article  Google Scholar 

  • Tsamardinos, I., Triantafillou, S., and Lagan., V. (2012). Causal reasoning with ancestral graphs. Journal of Machine Learning Research, 13:1097–1157.

    Google Scholar 

  • Verma, T. S. and Pear., J. (1990). On equivalence of causal models. Technical repor, Cognitive Systems Laborator, University of Californi, Los Angeles.

    Google Scholar 

  • Zhang, J. (2007). A characterization of Markov equivalence classes for causal models with latent variables. In Proceedings of Uncertainty in Artificial Intelligence.

    Google Scholar 

  • Zhang, J. (2008). Causal reasoning with ancestral graphs. Journal of Machine Learning Research, 9:1437–1474.

    MathSciNet  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Robert E. Tillman.

About this article

Cite this article

Tillman, R.E., Eberhardt, F. Learning Causal Structure from Multiple Datasets with Similar Variable Sets. Behaviormetrika 41, 41–64 (2014). https://doi.org/10.2333/bhmk.41.41

Download citation

  • Received:

  • Revised:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.2333/bhmk.41.41

Key Words and Phrases

Navigation