Cancer progression among patients can be assessed by creating a set of gene markers using statistical data analysis methods. Still, one of the main problems in the statistical study of this type of data is the large number of genes versus a small number of samples. Therefore, it is essential to use dimensionality reduction techniques to eliminate and find the optimal number of genes to predict the desired classes accurately. On the other hand, choosing an appropriate method can help extract valuable information and improve the machine learning model's efficiency. This article uses an ensemble learning approach, a random support vector machine cluster, to find the optimal feature set. In the current paper and in dealing with real data, it is shown that via randomly projecting the original high-dimensional feature space onto multiple lower-dimensional feature subspaces and combining support vector machine classifiers, not only the essential genes are found in causing prostate cancer, but also the classification precision is increased.
Mosavi N, Golalizadeh M. A New Approach in Using Random Support Vector Machine Cluster in Analyzing Prostate Cancer Gene Expression Data. JSS 2024; 17 (2) URL: http://jss.irstat.ir/article-1-830-en.html