AI Chat Paper
Note: Please note that the following content is generated by AMiner AI. SciOpen does not take any responsibility related to this content.
{{lang === 'zh_CN' ? '文章概述' : 'Summary'}}
{{lang === 'en_US' ? '中' : 'Eng'}}
Chat more with AI
PDF (1,006.2 KB)
Collect
Submit Manuscript AI Chat Paper
Show Outline
Outline
Show full outline
Hide outline
Outline
Show full outline
Hide outline
Open Access

Navigating the ethical and privacy concerns of big data and machine learning in decision making

Department of Arts, Communications & Social Sciences, University Canada West, Vancouver V6Z O5E, Canada
Show Author Information

Abstract

In recent years, the fields of big data and machine learning have gained significant attention for their potential to revolutionize decision-making processes. The vast amounts of data generated by various sources can provide valuable insights to inform decisions across a range of domains, from business and finance to healthcare and social policy. Machine learning algorithms enable computers to learn from data and improve their performance over time, thereby enhancing their ability to make predictions and identify patterns. This article provides a comprehensive overview of how big data and machine learning can improve decision-making processes between 2017–2022. It covers key concepts and techniques involved in these tools, including data collection, data preprocessing, feature selection, model training, and evaluation. The article also discusses the potential benefits and limitations of these tools and explores the ethical and privacy concerns associated with their use. In particular, it highlights the need for transparency and fairness in decision-making algorithms and the importance of protecting individuals’ privacy rights. The review concludes by highlighting future research opportunities and challenges in this rapidly evolving field, including the need for more robust and interpretable models, as well as the integration of human decision making with machine learning algorithms. Ultimately, this review aims to provide insights for researchers and practitioners seeking to leverage big data and machine learning to improve decision-making processes in various domains.

References

[1]

H. Taherdoost and M. Madanchian, Multi-criteria decision making (MCDM) methods and concepts, Encyclopedia, vol. 3, no. 1, pp. 77–87, 2023.

[2]

B. H. Sparks and J. T. McCann, Factors influencing business intelligence system use in decision making and organisational performance, Int. J. Sustain. Strateg. Manag., vol. 5, no. 1, pp. 31–54, 2016.

[3]

N. Elgendy, A. Elragal, and T. Päivärinta, DECAS: a modern data-driven decision theory for big data and analytics, J. Decis. Syst., vol. 31, no. 4, pp. 337–373, 2022.

[4]

N. Tantalaki, S. Souravlas, and M. Roumeliotis, Data-driven decision making in precision agriculture: The Rise of big data in agricultural systems, J. Agric. Food Inf., vol. 20, no. 4, pp. 344–380, 2019.

[5]

S. Ren, Y. Zhang, Y. Liu, T. Sakao, D. Huisingh, and C. M. V. B. Almeida, A comprehensive review of big data analytics throughout product lifecycle to support sustainable smart manufacturing: A framework, challenges and future research directions, J. Clean. Prod., vol. 210, pp. 1343–1365, 2019.

[6]

H. Chen, R. H. L. Chiang, and V. C. Storey, Business intelligence and analytics: From big data to big impact, MIS Q., vol. 36, no. 4, p. 1165, 2012.

[7]

J. Guedalia, M. Lipschuetz, M. Novoselsky-Persky, S. M. Cohen, A. Rottenstreich, G. Levin, S. Yagel, R. Unger, and Y. Sompolinsky, Real-time data analysis using a machine learning model significantly improves prediction of successful vaginal deliveries, Am. J. Obstet. Gynecol., vol. 223, no. 3, pp. 437.e1–437.e15, 2020.

[8]

J. C. Peterson, D. D. Bourgin, M. Agrawal, D. Reichman, and T. L. Griffiths, Using large-scale experiments and machine learning to discover theories of human decision-making, Science, vol. 372, no. 6547, pp. 1209–1214, 2021.

[9]

M. Mohammadpoor and F. Torabi, Big Data analytics in oil and gas industry: An emerging trend, Petroleum, vol. 6, no. 4, pp. 321–328, 2020.

[10]

T. Lysaght, H. Y. Lim, V. Xafis, and K. Y. Ngiam, AI-assisted decision-making in healthcare: The application of an ethics framework for big data in health and research, Asian Bioeth. Rev., vol. 11, pp. 299–314, 2019.

[11]
W. M. S. Yafooz, Z. B. Abu Bakar, S. K. Ahammad Fahad, and A. M Mithun, Business intelligence through big data analytics, data mining and machine learning, in Proc. 8th Int. Conf. Data Management, Analytics and Innovation, Vellore, India, 2019, pp. 217–230.
[12]

A. Mohamed, M. K. Najafabadi, Y. B. Wah, E. A. K. Zaman, and R. Maskat, The state of the art and taxonomy of big data analytics: View from new big data framework, Artif. Intell. Rev., vol. 53, no. 2, pp. 989–1037, 2020.

[13]

A. Esteva, A. Robicquet, B. Ramsundar, V. Kuleshov, M. DePristo, K. Chou, C. Cui, G. Corrado, S. Thrun, and J. Dean, A guide to deep learning in healthcare, Nat. Med., vol. 25, no. 1, pp. 24–29, 2019.

[14]

V. Chang, An ethical framework for big data and smart cities, Technol. Forecast. Soc. Change, vol. 165, p. 120559, 2021.

[15]

C. Rudin, Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead, Nat. Mach. Intell., vol. 1, no. 5, pp. 206–215, 2019.

[16]
N. Amruthnath and T. Gupta, A research study on unsupervised machine learning algorithms for early fault detection in predictive maintenance, in Proc. 2018 5th Int. Conf. Industrial Engineering and Applications (ICIEA), Singapore, 2018, pp. 355–361.
[17]

L. H. Son and N. D. Tien, Tune up fuzzy C-means for big data: Some novel hybrid clustering algorithms based on initial selection and incremental clustering, Int. J. Fuzzy Syst., vol. 19, no. 5, pp. 1585–1602, 2017.

[18]

A. McAfee and E. Brynjolfsson, Big data: The management revolution, Harv. Bus. Rev., vol. 90, no. 10, pp. 60–68, 2012.

[19]

M. Favaretto, E. De Clercq, C. O. Schneble, and B. S. Elger, What is your definition of Big Data? Researchers’ understanding of the phenomenon of the decade, PLoS One, vol. 15, no. 2, p. e0228987, 2020.

[20]

I. Lee, Big data: Dimensions, evolution, impacts, and challenges, Bus. Horiz., vol. 60, no. 3, pp. 293–303, 2017.

[21]

P. Russom, Big data analytics, TDWI Best Pract. Rep., vol. 19, no. 4, pp. 1–34, 2011.

[22]

W. Y. C. Wang and Y. Wang, Analytics in the era of big data: The digital transformations and value creation in industrial marketing, Ind. Mark. Manag., vol. 86, pp. 12–15, 2020.

[23]

Z. Wang, S. Zaman, S. F. Rasool, Q. U. Zaman, and A. Amin, Exploring the relationships between a toxic workplace environment, workplace stress, and project success with the moderating effect of organizational support: Empirical evidence from Pakistan, Risk Manag. Healthc. Policy, vol. 13, pp. 1055–1067, 2020.

[24]

Y. Cui, S. Kara, and K. C. Chan, Manufacturing big data ecosystem: A systematic literature review, Robot. Comput. Integr. Manuf., vol. 62, p. 101861, 2020.

[25]

J. Cheng, H. Zhang, F. Tao, and C. F. Juang, DT-II: Digital twin enhanced Industrial Internet reference framework towards smart manufacturing, Robot. Comput. Integr. Manuf., vol. 62, p. 101881, 2020.

[26]

M. K. Saggi and S. Jain, A survey towards an integration of big data analytics to big insights for value-creation, Inf. Process. Manag., vol. 54, no. 5, pp. 758–790, 2018.

[27]

J. Joseph and V. Gaba, Organizational structure, information processing, and decision-making: A retrospective and road map for research, Acad. Manag. Ann., vol. 14, no. 1, pp. 267–302, 2020.

[28]

H. Kościelniak and A. Puto, BIG DATA in decision making processes of enterprises, Procedia Comput. Sci., vol. 65, pp. 1052–1058, 2015.

[29]

H. Taherdoost, An overview of trends in information systems: Emerging technologies that transform the information technology industry,Cloud Comput. Data Sci., pp. 1–16, 2022.

[30]

F. Provost and T. Fawcett, Data science and its relationship to big data and data-driven decision making, Big Data, vol. 1, no. 1, pp. 51–59, 2013.

[31]

T. H. Davenport, Analytics 3.0, Harv. Bus. Rev., vol. 91, no. 12, pp. 64–72, 2013.

[32]

Talaoui, Y. and M. Kohtamäki, 35 years of research on business intelligence process: A synthesis of a fragmented literature, Manag. Res. Rev., vol. 44, no. 5, pp. 677–717, 2021.

[33]
J. Vidal-García, M. Vidal, and R. H. Barros, Computational business intelligence, big data, and their role in business decisions in the age of the Internet of Things, in Web Services : Concepts, methodologies, tools, and applications, Information Resources Management Association Ed. Hershey, PA, USA: IGI Global, 2019. pp. 1048–1067.
[34]

D. Sen, M. Ozturk, and O. Vayvay, An overview of big data for growth in SMEs, Procedia Soc. Behav. Sci., vol. 235, pp. 159–167, 2016.

[35]

S. Kumar and K. K. Mohbey, A review on big data based parallel and distributed approaches of pattern mining, J. King Saud Univ. Comput. Inf. Sci., vol. 34, no. 5, pp. 1639–1662, 2022.

[36]

M. Hammer, K. Somers, H. Karre, and C. Ramsauer, Profit per hour as a target process control parameter for manufacturing systems enabled by big data analytics and industry 4.0 infrastructure, Procedia CIRP, vol. 63, pp. 715–720, 2017.

[37]

F. Chiheb, F. Boumahdi, and H. Bouarfa, A new model for integrating big data into phases of decision-making process, Procedia Comput. Sci., vol. 151, no. C, pp. 636–642, 2019.

[38]

R. Iqbal, F. Doctor, B. More, S. Mahmud, and U. Yousuf, Big Data analytics and Computational Intelligence for Cyber-Physical Systems: Recent trends and state of the art applications, Future Gener. Comput. Syst., vol. 105, pp. 766–778, 2020.

[39]

F. Tao, Q. Qi, A. Liu, and A. Kusiak, Data-driven smart manufacturing, J. Manuf. Syst., vol. 48, pp. 157–169, 2018.

[40]

Y. H. Kuo and A. Kusiak, From data to big data in production research: The past and future trends, Int. J. Prod. Res., vol. 57, nos.15&16, pp. 4828–4853, 2019.

[41]

Y. Cheng, K. Chen, H. Sun, Y. Zhang, and F. Tao, Data and knowledge mining with big data towards smart production, J. Ind. Inf. Integr., vol. 9, pp. 1–13, 2018.

[42]

Y. Liu, A. Soroka, L. Han, J. Jian, and M. Tang, Cloud-based big data analytics for customer insight-driven design innovation in SMEs, Int. J. Inf. Manag., vol. 51, p. 102034, 2020.

[43]

S. Nazir, S. Shahzad, Z. Hussain, M. Iqbal, and A. Keerio, Evaluating student grades using analytic network process, Sindh Univ. Res. J. (Sci. Ser. ), vol. 41, no. 1, pp. 97–102, 2016.

[44]

S. Nazir, S. Anwar, S. A. Khan, S. Shahzad, M. Ali, R. Amin, M. Nawaz, P. Lazaridis, and J. Cosmas, Software component selection based on quality criteria using the analytic network process, Abstr. Appl. Anal., vol. 2014, pp. 1–12, 2014.

[45]

S. Safdar, S. Zafar, N. Zafar, and N. F. Khan, Machine learning based decision support systems (DSS) for heart disease diagnosis: a review, Artif. Intell. Rev., vol. 50, no. 4, pp. 597–623, 2018.

[46]

I. Aouadni and A. Rebai, Decision support system based on genetic algorithm and multi-criteria satisfaction analysis (MUSA) method for measuring job satisfaction, Ann. Oper. Res., vol. 256, no. 1, pp. 3–20, 2017.

[47]
I. Petkovics, J. Simon, Á. Petkovics, and Z. Čović, Selection of unmanned aerial vehicle for precision agriculture with multi-criteria decision making algorithm, in Proc. 2017 IEEE 15th Int. Symp. on Intelligent Systems and Informatics (SISY), Subotica, Serbia, 2017, pp. 000151–000156.
[48]

A. Schwenk-Ferrero and A. Andrianov, Nuclear waste management decision-making support with MCDA, Sci. Technol. Nucl. Install., vol. 2017, pp. 1–20, 2017.

[49]
C. Fleig, D. Augenstein, and A. Maedche, Designing a process mining-enabled decision support system for business process standardization in ERP implementation projects, in Proc. 16th Int. Conf. Business Process Management, Sydney, Australia, 2018, pp. 228–244.
[50]
L. S. R. Supriadi and L. Sui Pheng, Knowledge based decision support system (KBDSS), in Business Continuity Management in Construction, L. S. R. Supriadi and L. S. Pheng Eds. Singapore: Springer, 2018, pp. 155–174.
[51]

Z. Sun, L. Sun, and K. Strang, Big data analytics services for enhancing business intelligence, J. Comput. Inf. Syst., vol. 58, no. 2, pp. 162–169, 2018.

[52]

B. K. Sarkar, Big data for secure healthcare system: A conceptual design, Complex Intell. Syst., vol. 3, no. 2, pp. 133–151, 2017.

[53]

J. Zakir, T. Seymour, and K. Berg, Big data analytics, Issue Inf. Syst., vol. 16, no. II, pp. 81–90, 2015.

[54]
G. Kaur, P. Tomar, and P. Singh, Design of cloud-based green IoT architecture for smart cities, in Internet of things and big data analytics toward next-generation intelligence, N. Dey, A. E. Hassanien, C. Bhatt, A. S. Ashour, and S. C. Satapathy Eds. Cham, Switzerland: Springer, 2018, pp. 315–333.
[55]
P. Ongsulee, V. Chotchaung, E. Bamrungsi, and T. Rodcheewit, Big data, predictive analytics and machine learning, in Proc. 2018 16th Int. Conf. ICT and Knowledge Engineering (ICT&KE), Bangkok, Thailand, 2019, pp. 1–6.
[56]

M. Chen, S. Mao, and Y. Liu, Big data: A survey, Mob. Netw. Appl., vol. 19, no. 2, pp. 171–209, 2014.

[57]
P. Domingos, The Master Algorithm: How the Quest for the Ultimate Learning Machine Will Remake Our World. New York, NY, USA: Basic Books, 2015.
[58]

Y. R. Shrestha, S. M. Ben-Menahem, and G. von Krogh, Organizational decision-making structures in the age of artificial intelligence, Calif. Manag. Rev., vol. 61, no. 4, pp. 66–83, 2019.

[59]
F. Provost and T. Fawcett, Data Science for Business: What You Need to Know about Data Mining and Data-Analytic Thinking. Sebastopol, CA, USA: O'Reilly Media, 2013.
[60]

X. Wu, X. Zhu, G. Q. Wu, and W. Ding, Data mining with big data, IEEE Trans. Knowl. Data Eng., vol. 26, no. 1, pp. 97–107, 2014.

[61]

H. Taherdoost, Towards nuts and bolts of conducting literature review: A typology of literature review, Electronics, vol. 12, no. 4, p. 800, 2023.

[62]

P. Wang, J. Liu, and B. Liao, Prediction of sports performance and analysis of influencing factors based on machine learning and big data statistics, J. Sens., vol. 2022, pp. 1–9, 2022.

[63]

S. Soffer, E. Klang, Y. Barash, E. Grossman, and E. Zimlichman, Predicting In-hospital mortality at admission to the medical ward: A big-data machine learning model, Am. J. Med., vol. 134, no. 2, pp. 227–234.E4, 2021.

[64]

G. Xie, Y. Qian, and S. Wang, Forecasting Chinese cruise tourism demand with big data: An optimized machine learning approach, Tour. Manag., vol. 82, p. 104208, 2021.

[65]

M. M. Hosseini, M. Zargoush, F. Alemi, and R. E. Kheirbek, Leveraging machine learning and big data for optimizing medication prescriptions in complex diseases: A case study in diabetes management, J. Big Data, vol. 7, no. 1, pp. 1–24, 2020.

[66]

R. Venkatesh, C. Balasubramanian, and M. Kaliappan, Development of Big Data Predictive Analytics Model for Disease Prediction using Machine learning Technique, J. Med. Syst., vol. 43, no. 8, pp. 1–8, 2019.

[67]

D. Nallaperuma, R. Nawaratne, T. Bandaragoda, A. Adikari, S. Nguyen, T. Kempitiya, D. De Silva, D. Alahakoon, and D. Pothuhera, Online incremental machine learning platform for big data-driven smart traffic management, IEEE Trans. Intell. Transport. Syst., vol. 20, no. 12, pp. 4679–4690, 2019.

[68]

X. Zhong, X. Zhang, and P. Zhang, Pipeline risk big data intelligent decision-making system based on machine learning and situation awareness, Neural Comput. Appl., vol. 34, no. 18, pp. 15221–15239, 2022.

[69]

H. Jamil, T. Umer, C. Ceken, and F. Al-Turjman, Decision based model for real-time IoT analysis using big data and machine learning, Wirel. Pers. Commun., vol. 121, no. 4, pp. 2947–2959, 2021.

[70]

P. B. de Laat, Algorithmic decision-making based on machine learning from big data: Can transparency restore accountability, Philos. Technol., vol. 31, no. 4, pp. 525–541, 2018.

[71]

S. Ji, B. Lee, and M. Y. Yi, Building life-span prediction for life cycle assessment and life cycle cost using machine learning: A big data approach, Build. Environ., vol. 205, p. 108267, 2021.

[72]

S. W. Choi, E. B. Lee, and J. H. Kim, The engineering machine-learning automation platform (EMAP): A big-data-driven AI tool for contractors’ sustainable management solutions for plant projects, Sustainability, vol. 13, no. 18, p. 10384, 2021.

[73]

X. Liu, T. E. Fatoyinbo, N. M. Thomas, W. W. Guan, Y. Zhan, P. Mondal, D. Lagomasino, M. Simard, C. C. Trettin, R. Deo, et al., Large-scale high-resolution coastal mangrove forests mapping across West Africa with machine learning ensemble and satellite big data, Front. Earth Sci., vol. 8, pp. 560933, 2021.

[74]

U. Moorthy and U. D. Gandhi, Medical big data analytics using machine learning algorithms, Int. J. Innov. Technol. Explor. Eng., vol. 9, no. 1, pp. 3517–3526, 2019.

[75]

E. P. EPhzibah and R. Sujatha, Big data management with machine learning inscribed by domain knowledge for health care, Int. J. Eng. Technol., vol. 6, no. 4, p. 98, 2017.

[76]

I. Segura-Bedmar, C. Colón-Ruíz, M. Á. Tejedor-Alonso, and M. Moro-Moro, Predicting of anaphylaxis in big data EMR by exploring machine learning approaches, J. Biomed. Inform., vol. 87, pp. 50–59, 2018.

[77]

C. Ning and F. You, Optimization under uncertainty in the era of big data and deep learning: When machine learning meets mathematical programming, Comput. Chem. Eng., vol. 125, pp. 434–448, 2019.

[78]

H. Wang, Y. Yao, and S. Salhi, Tension in big data using machine learning: Analysis and applications, Technol. Forecast. Soc. Change, vol. 158, p. 120175, 2020.

[79]

F. Sassite, M. Addou, and F. Barramou, A machine learning and multi-agent model to automate big data analytics in smart cities, Int. J. Adv. Comput. Sci. Appl., vol. 13, no. 7, pp. 441–451, 2022.

[80]

D. Bertoni, G. Aletti, D. Cavicchioli, A. Micheletti, and R. Pretolani, Estimating the CAP greening effect by machine learning techniques: A big data ex post analysis, Environ. Sci. Policy, vol. 119, pp. 44–53, 2021.

[81]

A. E. Opperhuizen and K. Schouten, Dynamics and tipping point of issue attention in newspapers: Quantitative and qualitative content analysis at sentence level in a longitudinal study using supervised machine learning and big data, Qual. Quant., vol. 55, no. 1, pp. 19–37, 2021.

[82]

S. Sharma and Y. K. Gupta, Predictive analysis and survey of COVID-19 using machine learning and big data, J. Interdiscip. Math., vol. 24, no. 1, pp. 175–195, 2021.

[83]

A. Kovacs-Györi, A. Ristea, C. Havas, M. Mehaffy, H. H. Hochmair, B. Resch, L. Juhasz, A. Lehner, L. Ramasubramanian, and T. Blaschke, Opportunities and challenges of geospatial analysis for promoting urban livability in the era of big data and machine learning, ISPRS Int. J. Geo-Inf., vol. 9, no. 12, p. 752, 2020.

[84]

T. Nibareke and J. Laassiri, Using Big Data-machine learning models for diabetes prediction and flight delays analytics, J. Big Data, vol. 7, no. 1, pp. 1–18, 2020.

[85]

F. López-Martínez, E. R. Núñez-Valdez, V. García-Díaz, and Z. Bursac, A case study for a big data and machine learning platform to improve medical decision support in population health management, Algorithms, vol. 13, no. 4, p. 102, 2020.

[86]

V. Subha Seethalakshmi, R. Karthigaivel, N. Vengadachalam, and S. Selvakumaran, RETRACTED: Application of Machine Learning and Big Data in Doubly Fed Induction Generator based Stability Analysis of Multi Machine System using Substantial Transformative Optimization Algorithm, Microprocess. Microsyst., vol. 73, p. 102971, 2020.

[87]

W. Gu, K. Foster, J. Shang, and L. Wei, A game-predicting expert system using big data and machine learning, Expert Syst. Appl., vol. 130, pp. 293–305, 2019.

[88]

K. Kalpana, G. S. V. Kumar, and K. Madhavi, Feature selection for machine learning in big data, Int. J. Innov. Technol. Explor. Eng., vol. 8, no. 6S4, pp. 332–335, 2019.

[89]

M. Syafrudin, G. Alfian, N. L. Fitriyani, and J. Rhee, Performance analysis of IoT-based sensor, big data processing, and machine learning model for real-time monitoring system in automotive manufacturing, Sensors, vol. 18, no. 9, p. 2946, 2018.

[90]

A. Achiron, Z. Gur, U. Aviv, A. Hilely, M. Mimouni, L. Karmona, L. Rokach, and I. Kaiserman, Predicting refractive surgery outcome: Machine learning approach with big data, J. Refract. Surg., vol. 33, no. 9, pp. 592–597, 2017.

[91]

C. L. Chi, J. Wang, T. R. Clancy, J. G. Robinson, P. J. Tonellato, and T. J. Adam, Big data cohort extraction to facilitate machine learning to improve statin treatment, West. J. Nurs. Res., vol. 39, no. 1, pp. 42–62, 2017.

Intelligent and Converged Networks
Pages 280-295
Cite this article:
Taherdoost H. Navigating the ethical and privacy concerns of big data and machine learning in decision making. Intelligent and Converged Networks, 2023, 4(4): 280-295. https://doi.org/10.23919/ICN.2023.0023

443

Views

32

Downloads

2

Crossref

2

Scopus

Altmetrics

Received: 21 April 2023
Accepted: 14 June 2023
Published: 30 December 2023
© All articles included in the journal are copyrighted to the ITU and TUP.

This work is available under the CC BY-NC-ND 3.0 IGO license:https://creativecommons.org/licenses/by-nc-nd/3.0/igo/

Return