Ensemble Gene Selection Method Based on Multiple Tree Models

Mingzhu Lou, Journal of Information Processing Systems Vol. 19, No. 5, pp. 652-662, Oct. 2023  

Keywords: Ensemble Tree Model, Gradient Boosting Decision Tree, Gene Selection, ID3, Random Forest


Identifying highly discriminating genes is a critical step in tumor recognition tasks based on microarray gene expression profile data and machine learning. Gene selection based on tree models has been the subject of several studies. However, these methods are based on a single-tree model, often not robust to ultra-highdimensional microarray datasets, resulting in the loss of useful information and unsatisfactory classification accuracy. Motivated by the limitations of single-tree-based gene selection, in this study, ensemble gene selection methods based on multiple-tree models were studied to improve the classification performance of tumor identification. Specifically, we selected the three most representative tree models: ID3, random forest, and gradient boosting decision tree. Each tree model selects top-n genes from the microarray dataset based on its intrinsic mechanism. Subsequently, three ensemble gene selection methods were investigated, namely multipletree model intersection, multiple-tree module union, and multiple-tree module cross-union, were investigated. Experimental results on five benchmark public microarray gene expression datasets proved that the multiple tree module union is significantly superior to gene selection based on a single tree model and other competitive gene selection methods in classification accuracy.

Show / Hide Statistics

Statistics (Cumulative Counts from November 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.

Cite this article
[APA Style]
Lou, M. (2023). Ensemble Gene Selection Method Based on Multiple Tree Models. Journal of Information Processing Systems, 19(5), 652-662. DOI: 10.3745/JIPS.04.0290.

[IEEE Style]
M. Lou, "Ensemble Gene Selection Method Based on Multiple Tree Models," Journal of Information Processing Systems, vol. 19, no. 5, pp. 652-662, 2023. DOI: 10.3745/JIPS.04.0290.

[ACM Style]
Mingzhu Lou. 2023. Ensemble Gene Selection Method Based on Multiple Tree Models. Journal of Information Processing Systems, 19, 5, (2023), 652-662. DOI: 10.3745/JIPS.04.0290.