An Analysis of Public Satisfaction with Government Services: A Multi-Method Approach Using PCA, K-Means Clustering, and Linear Regression
DOI:
https://doi.org/10.46984/pcdj8z43Keywords:
Public Satisfaction, PCA, K-means Clustering, Linear Regression, CSI, Service QualityAbstract
Flawless performance evaluation results across all service dimensions may potentially obscure the identification of areas for improvement and diminish objectivity in decision-making. This study aims to identify the specific service attributes influencing public satisfaction and to segment respondents based on their satisfaction levels at the Office of the Ministry of Religious Affairs in Payakumbuh City. The research integrates Principal Component Analysis (PCA), K-means clustering, and linear regression. PCA was employed to reduce data dimensionality and establish principal components; K-means clustering was utilized to group respondents based on perceptual similarities regarding service quality; and linear regression was applied to identify the most significant factors influencing public satisfaction within each segment. The data were sourced from the Public Service Survey Information System (SISULAP) application of the Payakumbuh Ministry of Religious Affairs, spanning June 2024 to October 2025, with a total of 1,950 respondents. The findings reveal that service process and efficiency are the primary factors influencing all respondent segments, with the low-satisfaction segment identified as the top priority for service improvement. The regression models demonstrate robust performance across all segments. These findings provide an empirical foundation for data-driven policymaking to enhance public service quality.
References
Allgaier, J., & Pryss, R. (2024). Cross-Validation Visualized: A Narrative Guide to Advanced Methods. Machine Learning and Knowledge Extraction, 6(2), 1378–1388. https://doi.org/10.3390/make6020065
Anilshi, A. (2024). Penerapan Algoritma K-Means Clustering Nilai. 720–734.
Bach, A., & Thiel, F. (2024). Collaborative online learning in higher education—quality of digital interaction and associations with individual and group-related factors. Frontiers in Education, 9(November), 1–12. https://doi.org/10.3389/feduc.2024.1356271
Bylemans, J., Everts, T., Brys, R., & Duncan, R. P. (2025). From anarchy to clarity, data pre-processing and statistical choices infsluence quantitative environmental DNA (eDNA) analyses. Methods in Ecology and Evolution, 16(7), 1322–1333. https://doi.org/10.1111/2041-210X.70064
Celestin, P. D. M. (2025). Principal Component Analysis For Simplifying Multivariate Financial Data In Portfolio Risk Analysis. SSRN Electronic Journal, 9(02), 171–179. https://doi.org/10.2139/ssrn.5188109
Chan, T. S. T., & Gibberd, A. (2025). Feasible model-based principal component analysis: Joint estimation of rank and error covariance matrix. Computational Statistics and Data Analysis, 201(July 2024), 108042. https://doi.org/10.1016/j.csda.2024.108042
Channamallu, S. S., Kermanshachi, S., Rosenberger, J. M., Pamidimukkala, A., & Hladik, G. (2025). Determinants of user satisfaction in smart parking applications. Transport Economics and Management, 3(April), 214–221. https://doi.org/10.1016/j.team.2025.05.001
Chitra, J., & Heikal, J. (2024). Customer segmentation using the K-Means Clustering algorithm in Foreign Banks in Indonesia. Indonesia Accounting Research Journal, 11(4), 230–241.
Delaosa, C., Pestana, J., Proudler, I. K., & Weiss, S. (2025). Impact of space–time covariance matrix estimation on bin-wise eigenvalue and eigenspace perturbations. Signal Processing, 233(February). https://doi.org/10.1016/j.sigpro.2025.109946
Deldadehasl, M., Karahroodi, H. H., & Haddadian Nekah, P. (2025). Customer Clustering and Marketing Optimization in Hospitality: A Hybrid Data Mining and Decision-Making Approach from an Emerging Economy. Tourism and Hospitality, 6(2), 1–19. https://doi.org/10.3390/tourhosp6020080
Dulger Altıner, D., Yıkmış, S., Şimşek, M. A., Türkol, M., Tokatlı Demirok, N., & Celik, G. (2024). Impact of Thermosonication Treatment on Parsley Juice: Particle Swarm Algorithm (PSO), Multiple Linear Regression (MLR), and Response Surface Methodology (RSM). ACS Omega, 9(27), 29585–29597. https://doi.org/10.1021/acsomega.4c02749
Fathi, H., Cremona, M. A., & Severino, F. (2025). Selection of functional predictors and smooth coefficient estimation for scalar-on-function regression models. http://arxiv.org/abs/2506.17773
Grünwald, P., Lardy, T., Hao, Y., Bar-Lev, S. K., & de Jong, M. (2024). Optimal E-Values for Exponential Families: the Simple Case. Mdl, 1–28. http://arxiv.org/abs/2404.19465
Guo, G., Song, H., & Zhu, L. (2025). The iterated score regression estimation algorithm for PCA-based missing data with high correlation. Scientific Reports, 15(1), 1–27. https://doi.org/10.1038/s41598-025-93333-6
Jullia, M., & Finatariani, E. (2024). Pengaruh Pertumbuhan Perusahaan, Kepemilikan Manajerial dan Kepemilikan Institusional Terhadap Nilai Perusahaan. AKADEMIK: Jurnal Mahasiswa Humanis, 4(3), 913–923. https://doi.org/10.37481/jmh.v4i3.1024
Kalantan, Z. I., Alharbi, L. S., Al-Zahrani, M. H., & Saleh Binhimd, S. M. (2025). Robust Dimensionality Reduction: A Bootstrap-Based Evaluation of PCA with Applications in Nutritional and Environmental Sciences. Contemporary Mathematics (Singapore), 6(1), 923–942. https://doi.org/10.37256/cm.6120256016
Kotronoulas, G., Miguel, S., Dowling, M., Fernández-Ortega, P., Colomer-Lahiguera, S., Bağçivan, G., Pape, E., Drury, A., Semple, C., Dieperink, K. B., & Papadopoulou, C. (2023). An Overview of the Fundamentals of Data Management, Analysis, and Interpretation in Quantitative Research. Seminars in Oncology Nursing, 39(2), 1–9. https://doi.org/10.1016/j.soncn.2023.151398
Lokanan, M. (2024). Harnessing Exploratory Data Analysis for Robust Financial Fraud Detection and Model Enhancement. Preprint on Research Square / ResearchGate (Article Title: “Harnessing Exploratory Data Analysis for Robust Financial Fraud Detection and Model Enhancement”). https://www.researchsquare.com/article/rs-5635767/v1
Malakar, I., & Nepal, B. (2024). Conceptualizing Explorative Data Analysis in Applied Statistics. Patan Gyansagar, 6(1), 46–63. https://doi.org/10.3126/pg.v6i1.67406
Musfiroh, M., Novitasari, D. C. R., Intan, P. K., & Wisnawa, G. G. (2023). Penerapan Metode Principal Component Analysis (PCA) dan Long Short-Term Memory (LSTM) dalam Memprediksi Prediksi Curah Hujan Harian. Building of Informatics, Technology and Science (BITS), 5(1), 1–11. https://doi.org/10.47065/bits.v5i1.3114
Nguyen, H. M., Ho, T. K. T., & Ngo, T. T. (2024). The impact of service innovation on customer satisfaction and customer loyalty: a case in Vietnamese retail banks. Future Business Journal, 10(1), 1–15. https://doi.org/10.1186/s43093-024-00354-0
Paap, K. R., Anders-Jefferson, R. T., Balakrishnan, N., & Majoubi, J. B. (2024). The many foibles of Likert scales challenge claims that self-report measures of self-control are better than performance-based measures. Behavior Research Methods, 56(2), 908–933. https://doi.org/10.3758/s13428-023-02089-2
Sánchez Vinces, B. V., Schubert, E., Zimek, A., & Cordeiro, R. L. F. (2025). A comparative evaluation of clustering-based outlier detection. In Data Mining and Knowledge Discovery (Vol. 39, Issue 2). Springer US. https://doi.org/10.1007/s10618-024-01086-z
Sansan Yasinta Ar Rohmah, Salsa Sahbani, & Poni Sukaesih. (2025). Public Satisfaction Index of Public Services at Pamarayan Community Health Center, Serang District, Banten Province. Proceeding of International Conference on Business, Economics, Social Sciences, and Humanities, 8, 35–46. https://doi.org/10.34010/icobest.v8i.678
Sarkar, M., Puja, A. R., & Chowdhury, F. R. (2024). Optimizing Marketing Strategies with RFM Method and K-Means Clustering-Based AI Customer Segmentation Analysis. Journal of Business and Management Studies ISSN:, 2709–0876, 8–22. https://doi.org/10.32996/jbms
Setiawan, A., Utami, E., Ariatmanto, D., & others. (2024). Cattle Weight Estimation Using Linear Regression and Random Forest Regressor. Jurnal RESTI (Rekayasa Sistem Dan Teknologi Informasi), 8(1), 72–79.
Tan, W. (2024). Big Data and Data Mining : Techniques for Discovering Hidden Insights. 05(03), 12–15.
Wang, G. (2025). Customer segmentation in the digital marketing using a Q-learning based differential evolution algorithm integrated with K-means clustering. PLoS ONE, 20(2 February), 1–21. https://doi.org/10.1371/journal.pone.0318519
Wang, M., Kang, X., Liang, J., Wang, K., & Wu, Y. (2024). Heteroscedasticity identification and variable selection via multiple quantile regression. Journal of Statistical Computation and Simulation, 94(2), 297–314. https://doi.org/10.1080/00949655.2023.2243533
Wang, Q. (2024). College Employment Recommendation Based on Improved K-Means Clustering and SimRank Algorithm in College Employment Management. IEEE Access, 12(July), 154230–154243. https://doi.org/10.1109/ACCESS.2024.3450965
Yildirim, H. (2024). The Multicollinearity Effect on the Performance of Machine Learning Algorithms : Case Examples in Healthcare Modelling. Academic Platform Journal of Engineering and Smart Systems, 12(3), 68–80.
Zhao, Y., Lang, P., & Wang, H. (2025). Dynamic study on the influencing factors and spatial and temporal evolution of cross-travel integration in Guangxi based on multiple regression computational analysis. International Journal for Housing Science and Its Applications, 46(3), 2371–2381. https://doi.org/10.70517/ijhsa463198
Zhu, Y., Jiang, W., & Alonso, G. (2024). Efficient Tabular Data Preprocessing of ML Pipelines. http://arxiv.org/abs/2409.14912
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Abuzar Gafari, Sarjon Defit, Rini Sovia

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors retain all their rights to the published works, such as (but not limited to) the following rights; Copyright and other proprietary rights relating to the article, such as patent rights, The right to use the substance of the article in own future works, including lectures and books, The right to reproduce the article for own purposes, The right to self-archive the article





