Multivariate outlier detection in Stata

Vincenzo Verardi, Catherine Dehon

Research output: Contribution to journalArticlepeer-review

Abstract

Before implementing any multivariate statistical analysis based on empirical covariance matrices, it is important to check whether outliers are present because their existence could induce significant biases. In this article, we present the minimum covariance determinant estimator, which is commonly used in robust statistics to estimate location parameters and multivariate scales. These estimators can be used to robustify Mahalanobis distances and to identify outliers. Verardi and Croux (1999, Stata Journal 9: 439-453; 2010, Stata Journal 10: 313) programmed this estimator in Stata and made it available with the med command. The implemented algorithm is relatively fast and, as we show in the simulation example section, outperforms the methods already available in Stata, such as the Hadi method. © 2010 StataCorp LP.
Original languageEnglish
Pages (from-to)259-266
Number of pages8
JournalStata Journal
Volume10
Issue number2
Publication statusPublished - 2010

Keywords

  • Detection
  • Med
  • Minimum covariance determinant
  • Multivariate outliers
  • Robustness
  • St0192

Fingerprint Dive into the research topics of 'Multivariate outlier detection in Stata'. Together they form a unique fingerprint.

Cite this