An Analysis of Public Satisfaction with Government Services: A Multi-Method Approach Using PCA, K-Means Clustering, and Linear Regression

Authors

  • Abuzar Gafari Magister Teknik Informatika, Universitas Putra Indonesia YPTK Padang
  • Sarjon Defit Magister Teknik Informatika, Universitas Putra Indonesia YPTK Padang
  • Rini Sovia Magister Teknik Informatika, Universitas Putra Indonesia YPTK Padang

DOI:

https://doi.org/10.46984/pcdj8z43

Keywords:

Public Satisfaction, PCA, K-means Clustering, Linear Regression, CSI, Service Quality

Abstract

Flawless performance evaluation results across all service dimensions may potentially obscure the identification of areas for improvement and diminish objectivity in decision-making. This study aims to identify the specific service attributes influencing public satisfaction and to segment respondents based on their satisfaction levels at the Office of the Ministry of Religious Affairs in Payakumbuh City. The research integrates Principal Component Analysis (PCA), K-means clustering, and linear regression. PCA was employed to reduce data dimensionality and establish principal components; K-means clustering was utilized to group respondents based on perceptual similarities regarding service quality; and linear regression was applied to identify the most significant factors influencing public satisfaction within each segment. The data were sourced from the Public Service Survey Information System (SISULAP) application of the Payakumbuh Ministry of Religious Affairs, spanning June 2024 to October 2025, with a total of 1,950 respondents. The findings reveal that service process and efficiency are the primary factors influencing all respondent segments, with the low-satisfaction segment identified as the top priority for service improvement. The regression models demonstrate robust performance across all segments. These findings provide an empirical foundation for data-driven policymaking to enhance public service quality.

References

Allgaier, J., & Pryss, R. (2024). Cross-Validation Visualized: A Narrative Guide to Advanced Methods. Machine Learning and Knowledge Extraction, 6(2), 1378–1388. https://doi.org/10.3390/make6020065

Anilshi, A. (2024). Penerapan Algoritma K-Means Clustering Nilai. 720–734.

Bach, A., & Thiel, F. (2024). Collaborative online learning in higher education—quality of digital interaction and associations with individual and group-related factors. Frontiers in Education, 9(November), 1–12. https://doi.org/10.3389/feduc.2024.1356271

Bylemans, J., Everts, T., Brys, R., & Duncan, R. P. (2025). From anarchy to clarity, data pre-processing and statistical choices infsluence quantitative environmental DNA (eDNA) analyses. Methods in Ecology and Evolution, 16(7), 1322–1333. https://doi.org/10.1111/2041-210X.70064

Celestin, P. D. M. (2025). Principal Component Analysis For Simplifying Multivariate Financial Data In Portfolio Risk Analysis. SSRN Electronic Journal, 9(02), 171–179. https://doi.org/10.2139/ssrn.5188109

Chan, T. S. T., & Gibberd, A. (2025). Feasible model-based principal component analysis: Joint estimation of rank and error covariance matrix. Computational Statistics and Data Analysis, 201(July 2024), 108042. https://doi.org/10.1016/j.csda.2024.108042

Channamallu, S. S., Kermanshachi, S., Rosenberger, J. M., Pamidimukkala, A., & Hladik, G. (2025). Determinants of user satisfaction in smart parking applications. Transport Economics and Management, 3(April), 214–221. https://doi.org/10.1016/j.team.2025.05.001

Chitra, J., & Heikal, J. (2024). Customer segmentation using the K-Means Clustering algorithm in Foreign Banks in Indonesia. Indonesia Accounting Research Journal, 11(4), 230–241.

Delaosa, C., Pestana, J., Proudler, I. K., & Weiss, S. (2025). Impact of space–time covariance matrix estimation on bin-wise eigenvalue and eigenspace perturbations. Signal Processing, 233(February). https://doi.org/10.1016/j.sigpro.2025.109946

Deldadehasl, M., Karahroodi, H. H., & Haddadian Nekah, P. (2025). Customer Clustering and Marketing Optimization in Hospitality: A Hybrid Data Mining and Decision-Making Approach from an Emerging Economy. Tourism and Hospitality, 6(2), 1–19. https://doi.org/10.3390/tourhosp6020080

Dulger Altıner, D., Yıkmış, S., Şimşek, M. A., Türkol, M., Tokatlı Demirok, N., & Celik, G. (2024). Impact of Thermosonication Treatment on Parsley Juice: Particle Swarm Algorithm (PSO), Multiple Linear Regression (MLR), and Response Surface Methodology (RSM). ACS Omega, 9(27), 29585–29597. https://doi.org/10.1021/acsomega.4c02749

Fathi, H., Cremona, M. A., & Severino, F. (2025). Selection of functional predictors and smooth coefficient estimation for scalar-on-function regression models. http://arxiv.org/abs/2506.17773

Grünwald, P., Lardy, T., Hao, Y., Bar-Lev, S. K., & de Jong, M. (2024). Optimal E-Values for Exponential Families: the Simple Case. Mdl, 1–28. http://arxiv.org/abs/2404.19465

Guo, G., Song, H., & Zhu, L. (2025). The iterated score regression estimation algorithm for PCA-based missing data with high correlation. Scientific Reports, 15(1), 1–27. https://doi.org/10.1038/s41598-025-93333-6

Jullia, M., & Finatariani, E. (2024). Pengaruh Pertumbuhan Perusahaan, Kepemilikan Manajerial dan Kepemilikan Institusional Terhadap Nilai Perusahaan. AKADEMIK: Jurnal Mahasiswa Humanis, 4(3), 913–923. https://doi.org/10.37481/jmh.v4i3.1024

Kalantan, Z. I., Alharbi, L. S., Al-Zahrani, M. H., & Saleh Binhimd, S. M. (2025). Robust Dimensionality Reduction: A Bootstrap-Based Evaluation of PCA with Applications in Nutritional and Environmental Sciences. Contemporary Mathematics (Singapore), 6(1), 923–942. https://doi.org/10.37256/cm.6120256016

Kotronoulas, G., Miguel, S., Dowling, M., Fernández-Ortega, P., Colomer-Lahiguera, S., Bağçivan, G., Pape, E., Drury, A., Semple, C., Dieperink, K. B., & Papadopoulou, C. (2023). An Overview of the Fundamentals of Data Management, Analysis, and Interpretation in Quantitative Research. Seminars in Oncology Nursing, 39(2), 1–9. https://doi.org/10.1016/j.soncn.2023.151398

Lokanan, M. (2024). Harnessing Exploratory Data Analysis for Robust Financial Fraud Detection and Model Enhancement. Preprint on Research Square / ResearchGate (Article Title: “Harnessing Exploratory Data Analysis for Robust Financial Fraud Detection and Model Enhancement”). https://www.researchsquare.com/article/rs-5635767/v1

Malakar, I., & Nepal, B. (2024). Conceptualizing Explorative Data Analysis in Applied Statistics. Patan Gyansagar, 6(1), 46–63. https://doi.org/10.3126/pg.v6i1.67406

Musfiroh, M., Novitasari, D. C. R., Intan, P. K., & Wisnawa, G. G. (2023). Penerapan Metode Principal Component Analysis (PCA) dan Long Short-Term Memory (LSTM) dalam Memprediksi Prediksi Curah Hujan Harian. Building of Informatics, Technology and Science (BITS), 5(1), 1–11. https://doi.org/10.47065/bits.v5i1.3114

Nguyen, H. M., Ho, T. K. T., & Ngo, T. T. (2024). The impact of service innovation on customer satisfaction and customer loyalty: a case in Vietnamese retail banks. Future Business Journal, 10(1), 1–15. https://doi.org/10.1186/s43093-024-00354-0

Paap, K. R., Anders-Jefferson, R. T., Balakrishnan, N., & Majoubi, J. B. (2024). The many foibles of Likert scales challenge claims that self-report measures of self-control are better than performance-based measures. Behavior Research Methods, 56(2), 908–933. https://doi.org/10.3758/s13428-023-02089-2

Sánchez Vinces, B. V., Schubert, E., Zimek, A., & Cordeiro, R. L. F. (2025). A comparative evaluation of clustering-based outlier detection. In Data Mining and Knowledge Discovery (Vol. 39, Issue 2). Springer US. https://doi.org/10.1007/s10618-024-01086-z

Sansan Yasinta Ar Rohmah, Salsa Sahbani, & Poni Sukaesih. (2025). Public Satisfaction Index of Public Services at Pamarayan Community Health Center, Serang District, Banten Province. Proceeding of International Conference on Business, Economics, Social Sciences, and Humanities, 8, 35–46. https://doi.org/10.34010/icobest.v8i.678

Sarkar, M., Puja, A. R., & Chowdhury, F. R. (2024). Optimizing Marketing Strategies with RFM Method and K-Means Clustering-Based AI Customer Segmentation Analysis. Journal of Business and Management Studies ISSN:, 2709–0876, 8–22. https://doi.org/10.32996/jbms

Setiawan, A., Utami, E., Ariatmanto, D., & others. (2024). Cattle Weight Estimation Using Linear Regression and Random Forest Regressor. Jurnal RESTI (Rekayasa Sistem Dan Teknologi Informasi), 8(1), 72–79.

Tan, W. (2024). Big Data and Data Mining : Techniques for Discovering Hidden Insights. 05(03), 12–15.

Wang, G. (2025). Customer segmentation in the digital marketing using a Q-learning based differential evolution algorithm integrated with K-means clustering. PLoS ONE, 20(2 February), 1–21. https://doi.org/10.1371/journal.pone.0318519

Wang, M., Kang, X., Liang, J., Wang, K., & Wu, Y. (2024). Heteroscedasticity identification and variable selection via multiple quantile regression. Journal of Statistical Computation and Simulation, 94(2), 297–314. https://doi.org/10.1080/00949655.2023.2243533

Wang, Q. (2024). College Employment Recommendation Based on Improved K-Means Clustering and SimRank Algorithm in College Employment Management. IEEE Access, 12(July), 154230–154243. https://doi.org/10.1109/ACCESS.2024.3450965

Yildirim, H. (2024). The Multicollinearity Effect on the Performance of Machine Learning Algorithms : Case Examples in Healthcare Modelling. Academic Platform Journal of Engineering and Smart Systems, 12(3), 68–80.

Zhao, Y., Lang, P., & Wang, H. (2025). Dynamic study on the influencing factors and spatial and temporal evolution of cross-travel integration in Guangxi based on multiple regression computational analysis. International Journal for Housing Science and Its Applications, 46(3), 2371–2381. https://doi.org/10.70517/ijhsa463198

Zhu, Y., Jiang, W., & Alonso, G. (2024). Efficient Tabular Data Preprocessing of ML Pipelines. http://arxiv.org/abs/2409.14912

Downloads

Published

2026-06-30

Issue

Section

Articles

How to Cite

“An Analysis of Public Satisfaction with Government Services: A Multi-Method Approach Using PCA, K-Means Clustering, and Linear Regression” (2026) Sebatik, 30(1), pp. 1–8. doi:10.46984/pcdj8z43.