Abstract
In univariate and in multivariate analyses, it is difficult to identify outliers in the case of skewed or heavy-tailed distributions. In this article, we propose simple univariate and multivariate outlier identification procedures that perform well with these types of distributions while keeping the computational complexity low. We describe the commands gboxplot (univariate case) and sdasym (multivariate case), which implement these procedures in Stata.
Original language | English |
---|---|
Article number | st0533 |
Pages (from-to) | 517-532 |
Number of pages | 16 |
Journal | Stata Journal |
Volume | 18 |
Issue number | 3 |
DOIs | |
Publication status | Published - Sept 2018 |
Keywords
- Box plot
- Gboxplot
- Generalized box plot
- Outlier detection
- Outlyingness
- Projection
- Sdasym
- St0533
- Tukey g-and-h distribution