PENERAPAN PRINCIPAL COMPONENT ANALYSIS (PCA) DALAM PENENTUAN FAKTOR DOMINAN YANG MEMPENGARUHI PENGIDAP KANKER SERVIKS (Studi Kasus : Cervical Cancer Dataset)
Main Article Content
Abstract
Cancer diagnosis is a frightening issue for patients and can affect the patient's psychological condition. Therefore, by giving attention and psychosocial support to cancer patients it is expected to overcome the psychological pressure of the patient. Screening is an early detection effort to identify a disease or disorder that is clinically unclear by using certain tests, examinations or procedures. This effort can be used quickly to distinguish people who seem healthy but actually suffer from an abnormality. The purpose of this study is to simplify and eliminate some less relevant screening without reducing the intent and purpose of the original data using the Principal Component Analysis (PCA). Based on the results of research conducted from the UCI Cervical Cancer dataset repository, shows that there are 9 dominant screening variables that have a large enough correlation to the formation of early detection of cervical cancer with a proportion of 99% of covariance variance, including the 3 highest factors very dominant, namely the age factor with the first highest eigenvalue that is 76.05 with the proportion of variance 58.50%, cigarette addicts with 14.90% variance and Hormonal Contraceptives factor with 9.3% variance. The total variance obtained from the 9 screening variables is 99%.
Keywords: Cervical Cancer, Screening, Principal Component Analysis, Eigenvalue, Factor Analysis
Downloads
Article Details
[2] Dang, T., Pham, T., Tran, H. & Van, Q. 2016. Using Dimension Reduction with Feature Selection to Enhance Accuracy of Tumor Classification. International Conference on Biomedical Engineering, pp.14-17.
[3] Fernandes, K., J.S. Cardoso, and J. Fernandes., 2017,” Transfer Learning with Partial Observability Applied
to Cervical Cancer Screening” In Iberian Conference on Pattern Recognition and Image Analysis, Springer.
[4] Han, J., Kamber, M. & Pei, J. 2012. Data Mining: Concepts and Techniques. 3rd Edition. Morgan Kaufmann Publishers: San Francisco.
[5] Hussain, H., Quazilbash. N.Z., Bai. S. & Khoja, S. 2015. Reduction of Variables for Predicting Breast Cancer Survivability Using Principal Component Analysis. International Conference on Computer-Based Medical Systems, pp. 131-134.
[6] https://archive.ics.uci.edu/ml, Center for Machine Learning and Intelligent Systems, University of California, Department of Information and Computer Science. 1998
[7] http://www.depkes.go.id/folder/view/01/structure-publikasi-pusdatin-info-datin.html
(Online) Diakses : tanggal 29 Mei 2019
[8] Jolliffe, I.T. 2002. Principal Component Analysis. 2nd Edition. Springer-Verlag: New York.
[9] Johnson, W.A. & Wichern, D.W. 2007. Applied Multivariate Statistical Analysis. 6th Edition. Pearson Prentice Hall: New Jersey.
[10] Kementerian Kesehatan RI. 2015. Buletin Jendela Data dan Informasi Kesehatan. ISSN 2088 – 270X
[11] Kotu, V. & Deshpande, B. 2015. Predictive Analytics and Data Mining. Morgan Kaufmann Publisher: San Francisco.
[12] Mitchell, T. M. 1997. Machine Learning. McGraw-Hill Science: New York.
[13] Maaten, L., Postma, E. & Herik, J. 2009. Dimensionality Reduction: A Comparative Review. (Online) http://www.uvt.nl/ticc (17 Agustus 2017).
[14] Wahidin, M. 2015. Deteksi Dini Kanker Leher Rahim di Indonesia.
(Online) http://www.pusdatin.kemkes.go.id (Diakses : tanggal 29 Mei 2019).
[15] Witten, I.H. & Frank, E. 2005. Data Mining Practical Machine Learning Tools and Techniques. 2nd Edition. Morgan Kaufmann Publishers: San Francisco.