:: Volume 15, Issue 1 (9-2021) ::
JSS 2021, 15(1): 119-146 Back to browse issues page
Using Machine Learning Classification Algorithms in Official Statistics
Zahra Rezaei Ghahroodi * , Hasan Ranji , Alireza Rezaei
Abstract:   (2711 Views)
In most surveys, the occupation and job-industry related questions are asked through open-ended questions, and the coding of this information into thousands of categories is done manually. This is very time consuming and costly. Given the requirement of modernizing the statistical system of countries, it is necessary to use statistical learning methods in official statistics for primary and secondary data analysis. Statistical learning classification methods are also useful in the process of producing official statistics. The purpose of this article is to code some statistical processes using statistical learning methods and familiarize executive managers about the possibility of using statistical learning methods in the production of official statistics. Two applications of classification statistical learning methods, including automatic coding of economic activities and open-ended coding of statistical centers questionnaires using four iterative methods, are investigated. The studied methods include duplication, support vector machine (SVM) with multi-level aggregation methods, a combination of the duplication method and SVM, and the nearest neighbor method. 
Keywords: Automated Coding, Text Mining, Statistical Learning, Official Statistics.
Full-Text [PDF 325 kb]   (1733 Downloads)    
Type of Study: Applied | Subject: Official Statistics
Received: 2020/03/24 | Accepted: 2021/09/1 | Published: 2021/03/15



XML   Persian Abstract   Print



Rights and permissions
Creative Commons License This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
Volume 15, Issue 1 (9-2021) Back to browse issues page