[Home ] [Archive]   [ فارسی ]  
:: Main :: About :: Current Issue :: Archive :: Search :: Submit :: Contact ::
:: ::
Back to the articles list Back to browse issues page
Using Machine Learning Classification Algorithms in Official Statistics
Zahra Rezaei Ghahroodi , Hasan Ranji, Alireza Rezaei
Abstract:   (195 Views)
In most surveys, the occupation and job-industry related questions are asked through open-ended questions, and the coding of this information into thousands of categories is done manually. This is very time consuming and costly. Given the requirement of modernizing the statistical system of countries, it is necessary to use statistical learning methods in official statistics for primary and secondary data analysis. Statistical learning classification methods are also useful in the process of producing official statistics. The purpose of this article is to code some statistical processes using statistical learning methods and familiarize executive managers about the possibility of using statistical learning methods in the production of official statistics. Two applications of classification statistical learning methods, including automatic coding of economic activities and open-ended coding of statistical centers questionnaires using four iterative methods, are investigated. The studied methods include duplication, support vector machine (SVM) with multi-level aggregation methods, a combination of the duplication method and SVM, and the nearest neighbor method. 
Keywords: Automated Coding, Text Mining, Statistical Learning, Official Statistics.
Full-Text [PDF 8765 kb]   (8 Downloads)    
Type of Study: Applied | Subject: Official Statistics
Received: 2020/03/24 | Accepted: 2021/09/1
Send email to the article author

Add your comments about this article
Your username or Email:


XML   Persian Abstract   Print

Back to the articles list Back to browse issues page
مجله علوم آماری – نشریه علمی پژوهشی انجمن آمار ایران Journal of Statistical Sciences
Persian site map - English site map - Created in 0.04 seconds with 30 queries by YEKTAWEB 4256