Skip to main content

Analyzing Longitudinal and Panel Data

  • Chapter
  • First Online:
Quantitative Data Analysis

Abstract

Data often comes from observations made at multiple points in time. Obtaining repeated observations on the same units allows the researcher to access a richer information set about observed units than would be possible with single observations and to map the evolution of the phenomenon over multiple periods for both individual units and overall as a trend. (For example, relationships between two variables may strengthen, weaken, or even disappear over time.) Longitudinal data can be gathered via survey instruments or archival databases that offer repeated measures on the same variables at different times.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 49.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 64.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 99.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    The household, income and labour dynamics in Australia (Hilda) survey is maintained by the Melbourne institute and all editions and addenda are available at https://www.melbourneinstitute.com/hilda/

  2. 2.

    Two examples of where the N is small compared to the data points are studies about changes in exchange rates (one currency over multiple quarters) and high-frequency data on stock prices (where observations come down to minutes or seconds).

  3. 3.

    The STATA routine to estimate OLS with White-robust standard errors is regress y x, robust.

  4. 4.

    The command to perform Fama-MacBeth [8] estimation is similar for both the FMB-cs and FMB-ts types of regressions. Mitchell Petersen of Northwestern University posts the codes for a series of popular software with which the FMB estimation can be performed at http://www.kellogg.northwestern.edu/faculty/petersen/htm/papers/se/se_programming.htm

  5. 5.

    We refer readers to Petersen’s webpage (please check footnote 7 in Petersen’s webpage), which provides codes for the Newey-West routine.

  6. 6.

    The routine in STATA to perform one-way clustering is simple: either (1) regress y x, cluster (id) if time-series correlation is a concern, or (2) regress y x, cluster (time) if cross-sectional correlation is the main problem.

  7. 7.

    Petersen’s website offers a great deal of help in the estimation of OLS with two-way clustering. The routine is simple: cluster2 y x, fcluster(id) tcluster(time).

  8. 8.

    To be consistent with a large body of literature in accounting and finance [3,4], we use the expression “pooled-OLS regression” but refer to all types of regression approaches (e.g., logit or probit, which are commonly used) wherein the regression model specified does not take into account the fact that the data have a longitudinal structure.

  9. 9.

    FE models control for all of the time-invariant factors that affect the relationship of interest, whereas OLS takes into account only the covariates that are available. In this example, there are unobservable variables that affect X and Y that FE estimation takes into account but OLS ignores. Since FE estimation is always less biased than OLS, it is preferable.

  10. 10.

    The key objective of the Breusch-Pagan test is understanding whether one cand discard the clustering that is due to the same individuals being included multiple times.

  11. 11.

    For the sake of consistency, will refer to the STATA estimation routines.

  12. 12.

    A note of caution when employing Hausman test: Results of the test are limited to the specific models fitted with FE and RE. They are not intended as a definitive response to whether we should employ FE or RE to answer the research question of interest. Changing the specification of FE or RE models by adding different covariates will lead to different results. It is good practice to re-estimate different models and compare them before making any statement or decisions.

References

  1. Wooldridge JM (2002) Econometric analysis of cross section and panel data. MIT Press, Cambridge, MA

    Google Scholar 

  2. Gow ID, Ormazabal G, Taylor DJ (2010) Correcting for cross-sectional and time-series dependence in accounting research. Account Rev 85(2):483–512

    Article  Google Scholar 

  3. Cameron A, Trivedi P (2009) Microeconometrics: methods and applications. Cambridge University Press, New York, NY

    Google Scholar 

  4. Petersen MA (2009) Estimating standard errors in finance panel data sets: comparing approaches. Rev of Financ Stud 22:435–480

    Article  Google Scholar 

  5. Enders W (2008) Applied econometric times series. Wiley, New York

    Google Scholar 

  6. Baltagi B (2014) Panel data econometrics. Routledge, London

    Google Scholar 

  7. White H (1980) A heteroscedasticity-consistent covariance matrix estimator and a direct test for heteroscedasticity. Econometrica 48:817–838

    Article  Google Scholar 

  8. Fama EF, MacBeth JD (1973) Risk, return, and equilibrium: empirical tests. J Polit Econ 81:607–636

    Article  Google Scholar 

  9. Newey WK, West KD (1987) A simple, positive semi-definite, heteroscedasticity and auto- correlation consistent covariance matrix. Econometrica 55:703–708

    Article  Google Scholar 

  10. Peel M (2014) Addressing unobserved endogeneity bias in accounting studies: control and sensitivity methods by variable type. Account Bus Res 44:545–571

    Article  Google Scholar 

  11. Cameron CA, Gelbach JB, Miller DJ (2009) Robust inference with multiway clustering. J Bus Econ Stat 29:238–249

    Article  Google Scholar 

  12. Thompson S (2011) Simple formulas for standard errors that cluster by both firm and time. J Fin Econ 99:1–10

    Article  Google Scholar 

  13. Hausman JA (1978) Specification tests in econometrics. Econometrica 46:1251–1271

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing Switzerland

About this chapter

Cite this chapter

Mertens, W., Pugliese, A., Recker, J. (2017). Analyzing Longitudinal and Panel Data. In: Quantitative Data Analysis. Springer, Cham. https://doi.org/10.1007/978-3-319-42700-3_6

Download citation

Publish with us

Policies and ethics