AI Chat Paper
Note: Please note that the following content is generated by AMiner AI. SciOpen does not take any responsibility related to this content.
{{lang === 'zh_CN' ? '文章概述' : 'Summary'}}
{{lang === 'en_US' ? '中' : 'Eng'}}
Chat more with AI
PDF (723.7 KB)
Collect
Submit Manuscript AI Chat Paper
Show Outline
Outline
Show full outline
Hide outline
Outline
Show full outline
Hide outline
Research paper | Open Access

Anomaly data management and big data analytics: an application on disability datasets

Zhiwen Pan1( )Wen Ji1Yiqiang Chen1Lianjun Dai2Jun Zhang2
Institute of Computing Technology Chinese Academy of Sciences, Beijing, China
Information Centre of China Disabled Persons’ Federation, Beijing, China
Show Author Information

Abstract

Purpose

The disability datasets are the datasets that contain the information of disabled populations. By analyzing these datasets, professionals who work with disabled populations can have a better understanding of the inherent characteristics of the disabled populations, so that working plans and policies, which can effectively help the disabled populations, can be made accordingly.

Design/methodology/approach

In this paper, the authors proposed a big data management and analytic approach for disability datasets.

Findings

By using a set of data mining algorithms, the proposed approach can provide the following services. The data management scheme in the approach can improve the quality of disability data by estimating miss attribute values and detecting anomaly and low-quality data instances. The data mining scheme in the approach can explore useful patterns which reflect the correlation, association and interactional between the disability data attributes. Experiments based on real-world dataset are conducted at the end to prove the effectiveness of the approach.

Originality/value

The proposed approach can enable data-driven decision-making for professionals who work with disabled populations.

References

 

Akyildiz, I.F., Su, W., Sankarasubramaniam, Y. and Cayirci, E. (2002), “Wireless sensor networks: a survey”, Comm. ACM, Vol. 38 No. 4, pp. 393-422.

 
Borgelt, C. (2005), “An implementation of the FP-growth algorithm”, In Proceedings of the 1st international workshop on open source data mining: frequent pattern mining implementations. ACM, pp. 1-5.https://doi.org/10.1145/1133905.1133907
 
Fuji, R. and Matsumoto, K. (2018), Emotion Analysis on Social Big Data [J/OL], ZTECommunications. pp. 10-12, available at: http://kns.cnki.net/kcms/detail/34.1294.TN.20180105.1738.004.html
 

Hoffman, S. (2017), “Big data and the americans with disabilities act”, Social Science Electronic Publishing, Vol. 68 No. 4, pp. 777-793, available at: www.acm.org/class/how_to_use.html

 

Janssen, M., van der Voort, H. and Wahyudi, A. (2017), “Factors influencing big data decision-making quality”, Journal of Business Research, Vol. 70, pp. 338-345.

 
LIU, A. (2014), “Big data and its application in tourism industry: a theoretical review and analysis[a]”,Proceedings of the International Conference on Management and Engineering (CME 2014), 7.
 
Longadge, R. and Dongre, S. (2013), “Class imbalance problem in data mining review”, arXiv preprint arXiv: 1305.1707.
 

Mcdermott, S. and Turk, M.A. (2015), “What are the implications of the big data paradigm shift for disability and health?[J]”,Disability and Health Journal, Vol. 8 No. 3, pp. 303-304.

 
Mckinley, S. and Levine, M. (2007), “Cubic spline interpolation”, Methods of Shape-Preserving Spline Approximation, pp. 37-59.
 

Ma, M.X., Ngan, H.Y.T. and Liu, W. (2016), “Density-based outlier detection by local outlier factor on largescale traffic data”, Electronic Imaging, Vol. 2016 No. 14.

 
Nambisan, P., Luo, Z., Kapoor, A., Patrick, T.B. and Cisler, R.A. (2015), Social Media, Big Data, and Public Health Informatics: Ruminating Behavior of Depression Revealed through Twitter[C]//HI International Conference on System Sciences. IEEE, pp. 2906-2913.https://doi.org/10.1109/HICSS.2015.351
 

Olive, D.J. (2013), “Linear regression analysis”, Technometrics, Vol. 45 No. 4, pp. 362-363, available at: http://doi.acm.org/10.1145/503376.503378

 

Salzberg, S.L. (1994), Machine Learning, Vol. 16 No. 3, p. 235, available at: https://doi.org/10.1007/BF00993309

 

Wold, S., Esbensen, K. and Geladi, P. (1987), “Principal component analysis”, Chemometrics and Intelligent Laboratory Systems, Vol. 2 Nos 1/3, pp. 37-52.

International Journal of Crowd Science
Pages 164-176
Cite this article:
Pan Z, Ji W, Chen Y, et al. Anomaly data management and big data analytics: an application on disability datasets. International Journal of Crowd Science, 2018, 2(2): 164-176. https://doi.org/10.1108/IJCS-09-2018-0020

496

Views

15

Downloads

3

Crossref

4

Scopus

Altmetrics

Received: 08 September 2018
Revised: 12 October 2018
Accepted: 13 October 2018
Published: 13 November 2018
© The author(s)

Zhiwen Pan, Wen Ji, Yiqiang Chen, Lianjun Dai and Jun Zhang. Published in International Journal of Crowd Science. Published by Emerald Publishing Limited. This article is published under the Creative Commons Attribution (CC BY 4.0) licence. Anyone may reproduce, distribute, translate and create derivative works of this article (for both commercial and non-commercial purposes), subject to full attribution to the original publication and authors. The full terms of this licence may be seen at http://creativecommons.org/licences/by/4.0/legalcode

Return