AI Chat Paper
Note: Please note that the following content is generated by AMiner AI. SciOpen does not take any responsibility related to this content.
{{lang === 'zh_CN' ? '文章概述' : 'Summary'}}
{{lang === 'en_US' ? '中' : 'Eng'}}
Chat more with AI
PDF (1.5 MB)
Collect
Submit Manuscript AI Chat Paper
Show Outline
Outline
Show full outline
Hide outline
Outline
Show full outline
Hide outline
Open Access

Feature Selection in Socio-Economic Analysis: A Multi-Method Approach for Accurate Predictive Outcomes

Ahmad Al-Qerem1Ali Mohd Ali2Issam Jebreen1Ahmad Nabot1Mohammed Rajab3Mohammad Alauthman4Amjad Aldweesh5( )Faisal Aburub6Someah Alangari5Musab Alzgol7
Computer Science Department, Faculty of Information Technology, Zarqa University, Zarqa 13110, Jordan
Communications and Computer Engineering Department, Faculty of Engineering, Al-Ahliyya Amman University, Amman 19328, Jordan
University Headquarter, University of Anbar, Ramadi 31001, Iraq
Department of Information Security, University of Petra, Amman 11196, Jordan
College of Computing and Information Technology, Shaqra University, Shaqra 11911, Saudi Arabia
Department of Business Intelligence and Data Analytics, University of Petra, Amman 11196, Jordan
Computer Information Systems Department, Faculty of Information Technology, Isra University, Amman 11622, Jordan
Show Author Information

Abstract

Feature selection is a cornerstone in advancing the accuracy and efficiency of predictive models, particularly in nuanced domains like socio-economic analysis. This study explores nine distinct feature selection methods, utilizing a heart disease dataset as a representative model for complex socio-economic systems. Our findings identified four universally recognized features as critical across all selection methods. However, the divergence in significance attributed to other features by different methods underscores the inherent variability in selection techniques. When the top four features were incorporated into twelve classification models, a noticeable surge in predictive accuracy was observed, emphasizing their foundational role in enhancing model outcomes. The variations among methods stress the need for a methodical and discerning approach to feature selection, especially in data-rich socio-economic landscapes. As we venture further into an era defined by data-driven decision-making, rigour and precision in feature selection become indispensable. Future research should extend this approach to broader datasets, ensuring the robustness and adaptability of our findings.

References

[1]

R. J. S. Raj, S. J. Shobana, I. V. Pustokhina, D. A. Pustokhin, D. Gupta, and K. Shankar, Optimal feature selection-based medical image classification using deep learning model in Internet of Medical Things, IEEE Access, vol. 8, pp. 58006–58017, 2020.

[2]
J. Liu and G. Wang, A hybrid feature selection method for data sets of thousands of variables, in Proc. 2nd Int. Conf. Advanced Computer Control, Shenyang, China, 2010, pp. 288–291.
[3]

X. Xue, G. Li, D. Zhou, Y. Zhang, L. Zhang, Y. Zhao, Z. Feng, L. Cui, Z. Zhou, X. Sun, et al., Research roadmap of service ecosystems: A crowd intelligence perspective, International Journal of Crowd Science, vol. 6, no. 4, pp. 195–222, 2022.

[4]

A. Got, A. Moussaoui, and D. Zouache, Hybrid filter-wrapper feature selection using whale optimization algorithm: A multi-objective approach, Expert Syst. Appl., vol. 183, p. 115312, 2021.

[5]

M. Ghosh, R. Guha, R. Sarkar, and A. Abraham, A wrapper-filter feature selection technique based on ant colony optimization, Neural Comput. Appl., vol. 32, no. 12, pp. 7839–7857, 2020.

[6]

N. D. Cilia, C. De Stefano, F. Fontanella, and A. S. di Freca, A ranking-based feature selection approach for handwritten character recognition, Pattern Recognit. Lett., vol. 121, pp. 77–86, 2019.

[7]

A. Thakkar and R. Lohiya, Attack classification using feature selection techniques: A comparative study, J. Ambient Intell. Humaniz. Comput., vol. 12, no. 1, pp. 1249–1266, 2021.

[8]
D. L. Padmaja and B. Vishnuvardhan, Comparative study of feature subset selection methods for dimensionality reduction on scientific data, in Proc. IEEE 6th Int. Conf. Advanced Computing (IACC), Bhimavaram, India, 2016, pp. 31–34.
[9]

H. Rao, X. Shi, A. K. Rodrigue, J. Feng, Y. Xia, M. Elhoseny, X. Yuan, and L. Gu, Feature selection based on artificial bee colony and gradient boosting decision tree, Appl. Soft Comput., vol. 74, pp. 634–642, 2019.

[10]
J. S. Vaibhaw and P. K. Pattnaik, Brain–computer interfaces and their applications, in An Industrial IoT Approach for Pharmaceutical Industry Growth, V. E. Balas, V. K. Solanki, and R. Kumar, eds. Amsterdam, the Netherlands: Elsevier, 2020, pp. 31–54.
[11]

B. Richhariya, M. Tanveer, and A. H. Rashid, Diagnosis of Alzheimer’s disease using universum support vector machine based recursive feature elimination (USVM-RFE), Biomed. Signal Process. Contr., vol. 59, p. 101903, 2020.

[12]
D. Marneni and S. Vemula, Analysis of COVID-19 using machine learning techniques, in Statistical Modeling in Machine Learning, T. Goswami and G. R. Sinha, eds. Amsterdam, the Netherlands: Elsevier, 2023, pp. 37–53.
[13]

A. M. Ali, M. R. Hassan, F. Aburub, M. Alauthman, A. Aldweesh, A. Al-Qerem, I. Jebreen, and A. Nabot, Explainable machine learning approach for hepatitis C diagnosis using SFS feature selection, Machines, vol. 11, no. 3, p. 391, 2023.

[14]

M. Schonlau and R. Y. Zou, The random forest algorithm for statistical learning, Stata J. Promot. Commun. Stat. Stata, vol. 20, no. 1, pp. 3–29, 2020.

[15]

J. Naskath, G. Sivakamasundari, and A. A. S. Begum, A study on different deep learning algorithms used in deep neural nets: MLP SOM and DBN, Wirel. Pers. Commun., vol. 128, no. 4, pp. 2913–2936, 2023.

[16]
D. J. Perangin-Angin and F. A. Bachtiar, Classification of stress in office work activities using extreme learning machine algorithm and one-way ANOVA F-test feature selection, in Proc. 4th Int. Seminar on Research of Information Technology and Intelligent Systems (ISRITI), Yogyakarta, Indonesia, 2021, pp. 503–508.
[17]

Z. Liu, M. Hao, and F. Tian, Ratemaking model of usage based insurance based on driving behaviors classification, International Journal of Crowd Science, vol. 6, no. 2, pp. 98–109, 2022.

[18]

M. J. Rani and D. Devaraj, Two-stage hybrid gene selection using mutual information and genetic algorithm for cancer data classification, J. Med. Syst., vol. 43, no. 8, pp. 1–11, 2019.

[19]

X. Ji, J. Wang, and Z. Yan, A stock price prediction method based on deep learning technology, International Journal of Crowd Science, vol. 5, no. 1, pp. 55–72, 2021.

[20]

H. Takcı and F. Nusrat, Highly accurate Spam detection with the help of feature selection and data transformation, Int. Arab J. Inf. Technol., vol. 20, no. 1, pp. 29–37, 2023.

[21]

E. M. Senan, M. H. Al-Adhaileh, F. W. Alsaade, T. H. H. Aldhyani, A. A. Alqarni, N. Alsharif, M. I. Uddin, A. H. Alahmadi, M. E. Jadhav, and M. Y. Alzahrani, Diagnosis of chronic kidney disease using effective classification algorithms and recursive feature elimination techniques, J. Healthc. Eng., vol. 2021, p. 1004767, 2021.

[22]

R. Dhanya, I. R. Paul, S. S. Akula, M. Sivakumar, and J. J. Nair, F-test feature selection in Stacking ensemble model for breast cancer prediction, Procedia Comput. Sci., vol. 171, pp. 1561–1570, 2020.

[23]

R. Kumar, R. Arora, V. Bansal, V. J. Sahayasheela, H. Buckchash, J. Imran, N. Narayanan, G. N. Pandian, and B. Raman, Classification of COVID-19 from chest X-ray images using deep features and correlation coefficient, Multimed. Tools Appl., vol. 81, no. 19, pp. 27631–27655, 2022.

International Journal of Crowd Science
Pages 64-78
Cite this article:
Al-Qerem A, Ali AM, Jebreen I, et al. Feature Selection in Socio-Economic Analysis: A Multi-Method Approach for Accurate Predictive Outcomes. International Journal of Crowd Science, 2025, 9(1): 64-78. https://doi.org/10.26599/IJCS.2023.9100035

210

Views

22

Downloads

1

Crossref

0

Scopus

Altmetrics

Published: 29 January 2025
© The author(s) 2025.

The articles published in this open access journal are distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/).

Return