| Sign up

PDF (1.5 MB)

Cite

EndNote(RIS) BibTeX

Collect

Collect

Submit Manuscript

Open Access

Feature Selection in Socio-Economic Analysis: A Multi-Method Approach for Accurate Predictive Outcomes

Ahmad Al-Qerem^¹, Ali Mohd Ali^², Issam Jebreen^¹, Ahmad Nabot^¹, Mohammed Rajab^³, Mohammad Alauthman^⁴, Amjad Aldweesh^⁵(), Faisal Aburub^⁶, Someah Alangari^⁵, Musab Alzgol^⁷

1Computer Science Department, Faculty of Information Technology, Zarqa University, Zarqa 13110, Jordan

2Communications and Computer Engineering Department, Faculty of Engineering, Al-Ahliyya Amman University, Amman 19328, Jordan

3University Headquarter, University of Anbar, Ramadi 31001, Iraq

4Department of Information Security, University of Petra, Amman 11196, Jordan

5College of Computing and Information Technology, Shaqra University, Shaqra 11911, Saudi Arabia

6Department of Business Intelligence and Data Analytics, University of Petra, Amman 11196, Jordan

7Computer Information Systems Department, Faculty of Information Technology, Isra University, Amman 11622, Jordan

Show Author Information

Abstract

Feature selection is a cornerstone in advancing the accuracy and efficiency of predictive models, particularly in nuanced domains like socio-economic analysis. This study explores nine distinct feature selection methods, utilizing a heart disease dataset as a representative model for complex socio-economic systems. Our findings identified four universally recognized features as critical across all selection methods. However, the divergence in significance attributed to other features by different methods underscores the inherent variability in selection techniques. When the top four features were incorporated into twelve classification models, a noticeable surge in predictive accuracy was observed, emphasizing their foundational role in enhancing model outcomes. The variations among methods stress the need for a methodical and discerning approach to feature selection, especially in data-rich socio-economic landscapes. As we venture further into an era defined by data-driven decision-making, rigour and precision in feature selection become indispensable. Future research should extend this approach to broader datasets, ensuring the robustness and adaptability of our findings.

Keywords

feature selection socio-economic analysis predictive models data-driven decision-making selection methods predictive accuracy

References

[1]

R. J. S. Raj, S. J. Shobana, I. V. Pustokhina, D. A. Pustokhin, D. Gupta, and K. Shankar, Optimal feature selection-based medical image classification using deep learning model in Internet of Medical Things, IEEE Access, vol. 8, pp. 58006–58017, 2020.

Crossref Google Scholar

[2]

J. Liu and G. Wang, A hybrid feature selection method for data sets of thousands of variables, in Proc. 2nd Int. Conf. Advanced Computer Control, Shenyang, China, 2010, pp. 288–291.

[3]

X. Xue, G. Li, D. Zhou, Y. Zhang, L. Zhang, Y. Zhao, Z. Feng, L. Cui, Z. Zhou, X. Sun, et al., Research roadmap of service ecosystems: A crowd intelligence perspective, International Journal of Crowd Science, vol. 6, no. 4, pp. 195–222, 2022.

Crossref Google Scholar

[4]

A. Got, A. Moussaoui, and D. Zouache, Hybrid filter-wrapper feature selection using whale optimization algorithm: A multi-objective approach, Expert Syst. Appl., vol. 183, p. 115312, 2021.

Crossref Google Scholar

[5]

M. Ghosh, R. Guha, R. Sarkar, and A. Abraham, A wrapper-filter feature selection technique based on ant colony optimization, Neural Comput. Appl., vol. 32, no. 12, pp. 7839–7857, 2020.

Crossref Google Scholar

[6]

N. D. Cilia, C. De Stefano, F. Fontanella, and A. S. di Freca, A ranking-based feature selection approach for handwritten character recognition, Pattern Recognit. Lett., vol. 121, pp. 77–86, 2019.

Crossref Google Scholar

[7]

A. Thakkar and R. Lohiya, Attack classification using feature selection techniques: A comparative study, J. Ambient Intell. Humaniz. Comput., vol. 12, no. 1, pp. 1249–1266, 2021.

Crossref Google Scholar

[8]

D. L. Padmaja and B. Vishnuvardhan, Comparative study of feature subset selection methods for dimensionality reduction on scientific data, in Proc. IEEE 6th Int. Conf. Advanced Computing (IACC), Bhimavaram, India, 2016, pp. 31–34.

[9]

H. Rao, X. Shi, A. K. Rodrigue, J. Feng, Y. Xia, M. Elhoseny, X. Yuan, and L. Gu, Feature selection based on artificial bee colony and gradient boosting decision tree, Appl. Soft Comput., vol. 74, pp. 634–642, 2019.

Crossref Google Scholar

[10]

J. S. Vaibhaw and P. K. Pattnaik, Brain–computer interfaces and their applications, in An Industrial IoT Approach for Pharmaceutical Industry Growth, V. E. Balas, V. K. Solanki, and R. Kumar, eds. Amsterdam, the Netherlands: Elsevier, 2020, pp. 31–54.

[11]

B. Richhariya, M. Tanveer, and A. H. Rashid, Diagnosis of Alzheimer’s disease using universum support vector machine based recursive feature elimination (USVM-RFE), Biomed. Signal Process. Contr., vol. 59, p. 101903, 2020.

Crossref Google Scholar

[12]

D. Marneni and S. Vemula, Analysis of COVID-19 using machine learning techniques, in Statistical Modeling in Machine Learning, T. Goswami and G. R. Sinha, eds. Amsterdam, the Netherlands: Elsevier, 2023, pp. 37–53.

[13]

A. M. Ali, M. R. Hassan, F. Aburub, M. Alauthman, A. Aldweesh, A. Al-Qerem, I. Jebreen, and A. Nabot, Explainable machine learning approach for hepatitis C diagnosis using SFS feature selection, Machines, vol. 11, no. 3, p. 391, 2023.

Crossref Google Scholar

[14]

M. Schonlau and R. Y. Zou, The random forest algorithm for statistical learning, Stata J. Promot. Commun. Stat. Stata, vol. 20, no. 1, pp. 3–29, 2020.

Crossref Google Scholar

[15]

J. Naskath, G. Sivakamasundari, and A. A. S. Begum, A study on different deep learning algorithms used in deep neural nets: MLP SOM and DBN, Wirel. Pers. Commun., vol. 128, no. 4, pp. 2913–2936, 2023.

Crossref Google Scholar

[16]

D. J. Perangin-Angin and F. A. Bachtiar, Classification of stress in office work activities using extreme learning machine algorithm and one-way ANOVA F-test feature selection, in Proc. 4th Int. Seminar on Research of Information Technology and Intelligent Systems (ISRITI), Yogyakarta, Indonesia, 2021, pp. 503–508.

[17]

Z. Liu, M. Hao, and F. Tian, Ratemaking model of usage based insurance based on driving behaviors classification, International Journal of Crowd Science, vol. 6, no. 2, pp. 98–109, 2022.

Crossref Google Scholar

[18]

M. J. Rani and D. Devaraj, Two-stage hybrid gene selection using mutual information and genetic algorithm for cancer data classification, J. Med. Syst., vol. 43, no. 8, pp. 1–11, 2019.

Crossref Google Scholar

[19]

X. Ji, J. Wang, and Z. Yan, A stock price prediction method based on deep learning technology, International Journal of Crowd Science, vol. 5, no. 1, pp. 55–72, 2021.

Crossref Google Scholar

[20]

H. Takcı and F. Nusrat, Highly accurate Spam detection with the help of feature selection and data transformation, Int. Arab J. Inf. Technol., vol. 20, no. 1, pp. 29–37, 2023.

Crossref Google Scholar

[21]

E. M. Senan, M. H. Al-Adhaileh, F. W. Alsaade, T. H. H. Aldhyani, A. A. Alqarni, N. Alsharif, M. I. Uddin, A. H. Alahmadi, M. E. Jadhav, and M. Y. Alzahrani, Diagnosis of chronic kidney disease using effective classification algorithms and recursive feature elimination techniques, J. Healthc. Eng., vol. 2021, p. 1004767, 2021.

Crossref Google Scholar

[22]

R. Dhanya, I. R. Paul, S. S. Akula, M. Sivakumar, and J. J. Nair, F-test feature selection in Stacking ensemble model for breast cancer prediction, Procedia Comput. Sci., vol. 171, pp. 1561–1570, 2020.

Crossref Google Scholar

[23]

R. Kumar, R. Arora, V. Bansal, V. J. Sahayasheela, H. Buckchash, J. Imran, N. Narayanan, G. N. Pandian, and B. Raman, Classification of COVID-19 from chest X-ray images using deep features and correlation coefficient, Multimed. Tools Appl., vol. 81, no. 19, pp. 27631–27655, 2022.

Crossref Google Scholar

International Journal of Crowd Science

Volume 9 Issue 1,
March 2025

Pages 64-78

DOI: 10.26599/IJCS.2023.9100035

Cite this article:

Al-Qerem A, Ali AM, Jebreen I, et al. Feature Selection in Socio-Economic Analysis: A Multi-Method Approach for Accurate Predictive Outcomes. International Journal of Crowd Science, 2025, 9(1): 64-78. https://doi.org/10.26599/IJCS.2023.9100035

About Us

Learn about Open Access

Tsinghua University Press

Publish with Us

Peer Review Policy

Copyright and Licensing

Article Processing Charge

Contact Us

Journal Collaboration: Yao Meng (Ms.)✉️ +86-10-83470574

Technical Support: Kuo Zhao (Mr.)✉️ +86-10-83470507

Media Contact: Hao Jin (Mr.)✉️ +86-10-83470559

Address: Floor 6, Tower B, Xueyan Building, Shuangqing Road, Haidian District, Beijing 100084, China.

SciOpen——中国科技期刊卓越行动计划支持项目

Copyright © 2025 Tsinghua University Press Ltd.

京ICP备 10035462号-42 京公网安备11010802044758号