Klasifikasi Kategori Aduan Masyarakat pada Aplikasi Lapor Gubernur Menggunakan TF-IDF dan Decision Tree
DOI:
https://doi.org/10.55606/jtmei.v5i2.6176Keywords:
Decision Tree, LaporGub, Public Complaints, Text Classification, TF-IDFAbstract
The digitalization of public complaint services through the LaporGub platform in Jawa Tengah has made it easier for citizens to submit aspirations and complaints to local governments. However, the complaint classification process still faces challenges because reports are written in diverse and unstructured free-text formats. This study implements the Decision Tree algorithm to automatically classify public complaint categories in Kabupaten Blora. The dataset consists of 244 complaint records processed using the Term Frequency-Inverse Document Frequency weighting method. To reduce data distribution imbalance, several minority categories were merged, resulting in eight main classes. Model evaluation was conducted using a stratified hold-out method with an 80:20 ratio and 5-fold cross-validation. The testing results achieved an accuracy of 59.18% with a weighted F1-score of 59.79%, while cross-validation produced an average accuracy of 57.35%. In addition, feature importance analysis revealed that words such as “school,” “oil,” “fertilizer,” “agriculture,” and “road” were the dominant factors influencing the classification decisions. Based on these findings, Decision Tree demonstrates good interpretability to support the initial recommendation process for complaint categorization, although its performance is still affected by limited data size and imbalanced class distribution.
Downloads
References
Anugrah, R. R. (2023). Penerapan cosine similarity dan pembobotan TF-IDF untuk klasifikasi pengaduan masyarakat berbasis web (Studi kasus: BAGWASSIDIK Ditreskrimum Polda Kalbar). Coding: Jurnal Komputer dan Aplikasi, 11(1), 100–109. https://doi.org/10.26418/coding.v11i1.55598
Bahrein, M., & Apandi, S. (2025). Perbandingan kinerja Naive Bayes, Support Vector Machine, regresi logistik, dan decision tree untuk klasifikasi sentimen ulasan produk berbasis TF-IDF. Techno. com, 24(4). https://doi.org/10.62411/tc.v24i4.13968
Bishop, C. M., & Nasrabadi, N. M. (2006). Pattern recognition and machine learning (Vol. 4, No. 4, p. 738). Springer.
Charbuty, B., & Abdulazeez, A. (2021). Classification based on decision tree algorithm for machine learning. Journal of Applied Science and Technology Trends, 2(01), 20–28. https://doi.org/10.38094/jastt20165
Christianto, Y. (2025). Klasifikasi tingkat literasi digital menggunakan model SVM dan LSTM dengan ekstraksi fitur TF-IDF dan BERT embedding (Doctoral dissertation, Universitas Islam Negeri Maulana Malik Ibrahim).
Ciang, D. (2023). Klasifikasi toksisitas komentar dengan algoritma Naive Bayes dan decision tree. Jurnal Komputer dan Informatika, 18(1), 31–37.
Garda, K. A., Saraswati, G. W., & Lutfina, E. (2026). Penerapan model deep learning BiLSTM untuk klasifikasi multi-kelas pada data aduan masyarakat. Jurnal INSTEK (Informatika Sains dan Teknologi, 11(1), 52–68.
Gunawan, G., Rahmawati, A., Suhada, S., Hidayatulloh, T., & Wintana, D. (2021). Optimasi linear sampling dan information gain pada algoritma decision tree untuk diagnosis penyakit diabetes. MULTINETICS, 7(2), 124–131. https://doi.org/10.32722/multinetics.v7i2.3796
Han, J., Kamber, M., & Pei, J. (2012). Data mining: Concepts and techniques. Morgan Kaufmann.
Indrajit, R. E., Zainudin, A., & Rudianto, D. (2005). Electronic government in action. Andi Yogyakarta.
Jurafsky, D., & Martin, J. H. (2013). Speech and language processing. Pearson.
Kowsari, K., Meimandi, K. J., Heidarysafa, M., Mendu, S., Barnes, L., & Brown, D. (2019). Text classification algorithms: A survey. Information, 10(4), 150. https://doi.org/10.3390/info10040150
Kusuma, I. G. N. A., Pradipta, I. M., Santosa, I. M. A., & Dharmendra, I. K. (2023). Penanganan ketidakseimbangan data pada klasifikasi pengaduan masyarakat. Jurnal Teknologi Informasi dan Komputer, 9(5). https://doi.org/10.36002/jutik.v9i5.2643
Manning, C. D. (2008). Introduction to information retrieval. Syngress. https://doi.org/10.1017/CBO9780511809071
Minaee, S., Kalchbrenner, N., Cambria, E., Nikzad, N., Chenaghlu, M., & Gao, J. (2021). Deep learning-based text classification: A comprehensive review. ACM Computing Surveys, 54(3), 1–40. https://doi.org/10.1145/3439726
Ningsih, A. (2023). Implementasi SP4N-LAPOR dalam pelayanan pengaduan masyarakat di Kabupaten Luwu Utara Provinsi Sulawesi Selatan (Doctoral dissertation, IPDN). https://doi.org/10.33701/jtkp.v4i2.2877
Oktavianto, H., Sulistyo, H. W., Wijaya, G., Irawan, D., & Abdurrahman, G. (2024). Analisis perbandingan decision tree dan random forest pada klasifikasi teks data kesehatan. Bina Insani ICT Journal, 11(1), 56–65. https://doi.org/10.51211/biict.v11i1.2928
Purba, M., Asri, S. D., Ayumi, V., Salamah, U., & Iryani, L. (2024). Klasifikasi dataset teks pengaduan masyarakat terhadap pemerintah di sosial media menggunakan logistic regression. JSAI (Journal Scientific and Applied Informatics), 7(1), 78–83. https://doi.org/10.36085/jsai.v7i1.6447
Sari, A. N., & Hidayat, R. (2022). Optimasi pembobotan kata TF-IDF untuk klasifikasi keluhan pelanggan menggunakan machine learning. Jurnal Sistem Informasi, 11(2), 45–52.
Srivastava, A. N., & Sahami, M. (Eds.). (2009). Text mining: Classification, clustering, and applications. CRC Press. https://doi.org/10.1201/9781420059458
Wibowo, P., Tarigan, M. F. A., & Siregar, H. F. (2025). Implementasi teknologi mobile e-digital dalam meningkatkan efisiensi pengaduan masyarakat. Jurnal Garuda Pengabdian Kepada Masyarakat, 3(1), 1–10. https://doi.org/10.55537/gabdimas.v3i1.934
Yuliana, R., & Hakim, A. (2026). Machine learning untuk reformasi birokrasi: Studi kasus LaporGub di Jawa Tengah. Jurnal Tata Kelola Digital Nusantara, 3(1), 10–18.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Sandi Pradananto, Muksan Junaidi

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.






