Data Typology Using Technologies for Separating Mixtures of Probability Distributions
https://doi.org/10.34023/2313-6383-2022-29-6-11-24
Abstract
The authors studied heterogeneous samples (for a number of statistical populations) taken from finite mixtures of probability distribu- tions, which reflect a number of socio-economic characteristics of the Russian society, using the information resources of the Federal State Statistics Service; in this case the number of mixing distributions (components), as well as their corresponding weights and parameters, may be unknown. In terms of content, the problem of separation (decomposition) of mixtures can be reduced to estimating unknown parameters of mixing distributions and their weights.
The article considers and discusses methods for solving this problem, their advantages and disadvantages, conditions and areas of application. In the study the decomposition of mixtures of probability distributions on the population of subjects of the Russian
Federation was carried out according to three interrelated statistical indicators – the unemployment rate, the poverty rate, the level of violence. These indicators can be considered as statistical measure of achieving sustainable development goals at the regional level and, at the same time, measures of socio-economic «health» of territory. Typologies of subjects of the Russian Federation were carried out according to the listed indicators. A cross-comparison of the results of the obtained typological groupings was performed, which also made it possible to identify the subjects of the Russian Federation that differ in both negative and positive trends in the context of the indicators under consideration.
The authors underline that the results of the study can be used by the authorities to develop specific measures for the socio-economic development of Russia and its regions.
About the Authors
V. V. GlinskiyRussian Federation
Vladimir V. Glinskiy – Dr. Sci. (Econ.), Professor, Head, Research Laboratory «Sustainable Development of Social and Economic Systems»
6, Nizhegorodskaya Str., Novosibirsk, 630102
Yu. N. Ismaiylova
Russian Federation
Yuliya N. Ismaiylova – Cand. Sci. (Econ.), Associate Professor, Department of Business Analytics and Statistics
6, Nizhegorodskaya Str., Novosibirsk, 630102
S. E. Khrushchev
Russian Federation
Sergey E. Khrushchev – Cand. Sci. (Phys.-Math.), Associate Professor, Senior Researcher, Laboratory of Applied Inverse Problems
4, Acad. Koptyuga Ave., Novosibirsk, 630090
L. K. Serga
Russian Federation
Lyudmila K. Serga – Cand. Sci. (Econ.), Associate Professor, Head, Department of Business Analytics and Statistics
6, Nizhegorodskaya Str., Novosibirsk, 630102
References
1. Granberg A.G., Ershov Yu.S. Short-Term Consequences of the Impact of the Crisis on the Spatial Development of the Russian Economy. In: Suspitsyn S.A. (ed.) Optimization of Territorial Systems. Novosibirsk: IEOPP SB RAS; 2010. Pp. 206–214. (In Russ.)
2. Granberg A.G. et al. World Crisis and Russian Spatial Socio-Economic Development Strategy. Region: Economics and Sociology. 2009;(4):69–101. (In Russ.)
3. Baranov S.V., Skufina T.P. Comparative Dynamics of Economic Growth and Interregional Differentiation of the Northern Region of Russia. Voprosy Statistiki. 2015;(11):69–77. (In Russ.)
4. Glushchenko K.P. On Estimation of Inter-Regional Inequality. Prostranstvennaya Ekonomika = Spatial Economics. 2015;(4):39–58. (In Russ.) Available from: https:// doi.org/10.14530/se.2015.4.039-058.
5. Glinsky V.V., Serga L.K., Bulkina A.M. Differentiation of Municipal Entities as a Factor of Economic Development of Territories. Voprosy Statistiki. 2016;(8):46–52. (In Russ.)
6. Gubanova E.S., Kleshch V.S. Overcoming Socio-Economic Inequality as a Condition for Sustainable and Balanced Spatial Development of the Region. Economic and Social Changes: Facts, Trends, Forecast. 2019;12(5):44–57. (In Russ.) Available from: https://doi.org/10.15838/esc.2019.5.65.3.
7. Zaytseva Y. Interregional Differentiation in BRIC Countries: Possibilities for Assessment. World Economy and International Relations. 2010;(5):44–51. (In Russ.) Available from: https://doi.org/10.20542/0131-2227-2010-5-44-51.
8. Kolomak E.A. Interregional Disparities in Russia: Economic and Social Aspects. Prostranstvennaya Ekonomika = Spatial Economics. 2010;(1):26–35. (In Russ.) Available from: https://doi.org/10.14530/se.2010.1.026-035.
9. Konopatskaya E.A. Differentiation of Regions of the Russian Federation According to the Level of Development of Human Capital. Economics, Statistics and Informatics. Vestnik UMO. 2011;(2):178–184. (In Russ.)
10. Lavrovsky B.L. Regional Asymmetry in Russian Federation: Estimation and Approaches to its Reduction. HSE Economic Journal. 2003;7(4):524–537. (In Russ.)
11. Moroshkina M.V. Trans-Regional Differentiation of Subjects of the Russian Federation. Economic Analysis: Theory and Practice. 2014;13(45):20–26. (In Russ.)
12. Pogodina E.A., Kataev E.N. An Assessment of Differentiation Degree of Economic and Social Development of the Volga Federal District's Regions. National Interests: Priorities and Security. 2014;10(21):30–37. (In Russ.)
13. Uskova T.V. Spatial Development of Territories: State, Trends, Ways of Reducing Risks. Problems of Territory's Development. 2015;1(75):7–15. (In Russ.)
14. Uskova T.V. Territories’ Sustainable Development and Modern Management Methods. Problems of Territory's Development. 2020;2(106):7–18. (In Russ.) Available from: https://doi.org/10.15838/ptd.2020.2.106.1.
15. Usmanov D.I. Inequality in the Socio-Economic Development of Russian Regions. European Social Science Journal. 2014;5-1(44):476–480. (In Russ.)
16. Chemezova E.Yu. Typologies of Subjects of the Russian Federation According to the Level of Socio-Economic Development. Vestnik NSUEU. 2010;(1):171–176. (In Russ.)
17. UN. Transforming Our World: The 2030 Agenda for Sustainable Development. Resolution adopted by the General Assembly on 25 September 2015. UN, 2015. 45 p. Available from: https://documents-dds-ny.un.org/doc/UNDOC/ GEN/N15/291/89/PDF/N1529189.pdf?
18. Aivazyan S.A. et al. Applied Statistics: Classification and Dimension Reduction. Moscow: Finansy i Statistika Publ.; 1989. 607 p. (In Russ.)
19. Ismaiylova Yu.N., Khrushchev S.E. Typological Grouping Based on Decomposition of Probability Distributions Mixtures. Vestnik NSUEM. 2020;(1):255–267. (In Russ.) Available from: https://doi.org/10.34020/20736495-2020-1-255-267.
20. Pearson K. Contributions to the Mathematical Theory of Evolution. Philosophical Transactions of the Royal Society of London. 1894;(185):71–110.
21. Dempster A.P., Laird N.M., Rubin D.B. Maximum Likelihood from Incomplete Data Via the EM Algorithm (with Discussion). Journal of the Royal Statistical Society B. 1977;39(1):1–38. Available from: https://doi.org/10.1111/j.2517-6161.1977.tb01600.x
22. Celeux G., Dielbot J. The SEM Algorithm: A Probabilistic Teacher Algorithm Derived from the EM Algorithm for the Mixture Problem. Computation Statistics Quarterly. 1985;2(1):73–82.
23. Korolev V.Yu. EM-Algorithm, Its Modifications and Their Application to the Problem of Separation of Mixtures of Probability Distributions. Theoretical Review. Moscow: IPI RAN; 2007. 94 p. (In Russ.)
24. Gorshenin A.K., Korolev V.Yu., Tursunbaev A.M. Median Modification of Emand Sem-Algorithms for Separation of Mixtures of Probability Distributions and Their Application to the Decomposition of Volatility of Financial Time Series. Informatics and Applications. 2008;2(4):12–47. (In Russ.)
25. Akaike H. Information Theory and an Extension of the Maximum Likelihood Principle. In: Petrov B.N., Csaki F. (eds) Proc. of the 2nd Int. Symposium on Information Theory. Budapest: Akademiai Kiado; 1973. Pp. 267–281.
26. Schwartz G. Estimating the Dimension of a Model. The Annals of Statistics. 1978;6(2):461–464. Available from: https://doi.org/10.1214/aos/1176344136.
27. Glinskiy V.V., Ismaiylova Yu.N. Probabilistic Mixtures in Measurements of Interterritorial Differentiation. Voprosy Statistiki. 2020;27(3):53–64. (In Russ.) Available from: https://doi.org/10.34023/2313-6383-2020-27-353-64.
28. Ismaiylova Yu.N. Data Typology Based on the Decomposition of Mixtures of Probability Distributions. Cand. Economy Sci. Diss. 08.00.12. Novosibirsk: National State University of Economics; 2022. 172 p. (In Russ.)
Review
For citations:
Glinskiy V.V., Ismaiylova Yu.N., Khrushchev S.E., Serga L.K. Data Typology Using Technologies for Separating Mixtures of Probability Distributions. Voprosy statistiki. 2022;29(6):11-24. (In Russ.) https://doi.org/10.34023/2313-6383-2022-29-6-11-24