ISSN: 2074-8132
ISSN: 2074-8132
En Ru
On the correction for the number of observations in the calculation of Mahalanobis distances

On the correction for the number of observations in the calculation of Mahalanobis distances

Recieved: 03/31/2025

Accepted: 04/24/2025

Published: 05/24/2025

Keywords: Mahalanobis distances; sample size; craniometry

Available online: 24.05.2025

To cite this article

Shirobokov Ivan G. On the correction for the number of observations in the calculation of Mahalanobis distances. // Lomonosov Journal of Anthropology 2025. Vol. 23. Issue 2. 104-114 https://doi.org/10.55959/MSU2074-8132-25-2-9.

This work is licensed under a Creative Commons: Attribution 4.0 International (CC BY 4.0). (CC BY 4.0). (https://creativecommons.org/licenses/by/4.0/deed.ru)
Issue 2, 2025

Abstract

Introduction. Mahalanobis distances (D²) are used in craniological studies to provide a generalised assessment of differences between samples, taking into account trait variances and correlations. However, sample distances tend to be biased upwards compared to the true D² value, especially in the case of small sample sizes. An adjustment for the number of observations, proposed by D. Rightmire, is one way of compensating for this bias. However, the conditions for its application and even the methods for calculating the variables may vary according to the characteristics of the sample. The aim of this study is to evaluate the effective impact of the adjustment on Mahalanobis distances and to explore approaches to reduce bias in sample estimates.

Materials and methods. The analysis was based on three aggregated series of male skulls fr om Bashkirs, Chuvash and Latvians. D² was calculated using an averaged covariance matrix. Three methods of calculating Mahalanobis distances were considered: with no adjustment for the number of observations, with Rightmire's adjustment applied to all distances, and with an adjustment that considers only statistically significant distances. In addition, alternative approaches to adjustment were tested in cases wh ere there were large differences in the number of observations for individual traits: using the harmonic mean of the sample sizes and calculating D² separately for linear and angular traits, followed by summation.

Results. Rightmire's correction, when applied to all distances, generally provides accurate and unbiased estimates of D². When adjustments are applied, the D² value may approach zero or even become negative, even in cases where there are significant differences between populations. Since the medians of the sample D² values are closest to the true D² values, all negative D² values can be safely transformed to zero without compromising the calculations. A D² value of zero does not imply that there are no morphological differences between samples. To approximate the true distance between samples, confidence intervals can be calculated, e.g. using bootstrapping procedures.

Conclusion. Rightmire's correction allows for unbiased Mahalanobis distance estimates when using an averaged covariance matrix and small sample sizes. However, the choice of a specific correction method should take into account the sample size and the variability in the number of observations for different traits. When working with poorly preserved cranial series, it is advisable to use the harmonic mean of the sample sizes or separate distance calculations for angular and linear traits. © 2025. This work is licensed under a CC BY 4.0 license.

References

Goncharov I.A., Goncharova N.N. Programma MultiCan dlya analiza mnogomernyh massivov dannyh s ispolzovaniem statistik vyborok i parametrov generalnoj sovokupnosti (MultiCan) [MultiCan program for analyzing multidimensional data arrays using sample statistics and population parameters (MultiCan)]. Svidetelstvo o registracii prav na PO №2016610803, Moscow, 2016. (In Russ).

Kozintsev A.G. Scythians of the North Pontic Region: Between-group cranial variation, affinities, and origins. Archaeology, Ethnology and Anthropology of Eurasia, 2007, 32 (4), pp. 143–157. (In Russ.). DOI:10.1134/s1563011007040135.

Kozintsev A.G. Craniometric evidence of the early Caucasoid migrations to Siberia and Eastern Central Asia, with reference to the Indo-European problem. Archaeology, Ethnology and Anthropology of Eurasia, 2009, 40 (4), pp. 125–136. (In Russ.). DOI: 10.1016/j.aeae.2010.02.014.

Shirobokov I.G., Moiseev V.G., Kozintsev A.G., Khartanovich V.I. Chistov Y.K., Gromov A.V. Individualnye kraniometricheskie dannye blizkih k sovremennosti grupp naseleniya Vostochnoj i Severo-Vostochnoj Evropy [Individual craniometric data of modern population groups from Eastern and North-Eastern Europe]. Elektronnoe izdanie. Saint Petersburg: MAE RAS Publ., 2017. (In Russ).

Rightmire G.P. On the computation of Mahalanobis’ generalized distance (D2). American Journal of Physical Anthropology, 1969, 30, pp. 157–160.

Sjøvold T. Some notes on the distribution and certain modifications of Mahalanobis generalized distance. Journal of Human Evolution, 1975, 4, pp. 549–558.