Cluster Text Random Opinion Tweet In Yogyakarta Using Automatic Clustering

Authors

  • Rabiatul Adawiyah Politeknik Pratama

DOI:

https://doi.org/10.55606/juprit.v2i1.1194

Keywords:

Cluster, Tweet, K-Means, Automatic Clustering, Data, Opinion

Abstract

Tweet Besides making computations difficult, the data obtained is also inefficient and complicated to interpret. Therefore, it is necessary to explore how to overcome these problems. This study proposes an approach to find the global optimum and make automatic grouping by analyzing moving averages, namely K-Means Automatic Clustering. So the purpose of this study was to explore and evaluate high-dimensional data from a collection of tweets, namely random opinion text tweets in Yogyakarta. The K-means Automatic Clustering algorithm is used for clusters based on the data attributes that have been obtained. Pre-processing experiments were carried out among others. Cleansing, Case folding, Tokenizing, Filtering, Stemming. Then look for the variance cluster to find the global optimum as an ideal cluster by identifying the moving variance by placing λ as the threshold (Global Optimum). So that the ideal cluster value is 0.332975. That is, the closer the cluster value obtained to number 1, the more the cluster search finds the optimum point. This research can be utilized in exploring and evaluating high-dimensional data, so that it becomes a consideration in providing approximate patterns from unstructured data sets with Visualization.

References

“Media sosial - Wikipedia bahasa Indonesia, ensiklopedia bebas”.

“Mengapa banyak orang yang nyaman curhat di twitter? - Sosial : Diskusi Komunikasi - Dictio Community”.

F. Khairani, A. Kurnia, M. N. Aidi, and S. Pramana, “Predictions of Indonesia Economic Phenomena Based on Online News Using Random Forest,” SinkrOn, vol. 7, no. 2, pp. 532–540, Apr. 2022, doi: 10.33395/sinkron.v7i2.11401.

M. F. Tyas, A. Kurnia, and A. M. Soleh, “TEXT CLUSTERING ONLINE LEARNING OPINION DURING COVID-19 PANDEMIC IN INDONESIA USING TWEETS,” BAREKENG: Jurnal Ilmu Matematika dan Terapan, vol. 16, no. 3, pp. 939–948, Sep. 2022, doi: 10.30598/barekengvol16iss3pp939-948.

K. N. Aini, H. Murfi, K. Nur’aini, I. Najahaty, L. Hidayati, and S. Nurrohmah, “Combination of Singular Value Decomposition and K-means Clustering Methods for Topic Detection on Twitter”, doi: 10.13140/RG.2.1.4081.2886.

“Elementary Survey Sampling, 7th ed.”.

K. E. Setiawan, A. Kurniawan, A. Chowanda, and D. Suhartono, “Clustering models for hospitals in Jakarta using fuzzy c-means and k-means,” Procedia Comput Sci, vol. 216, pp. 356–363, 2023, doi: 10.1016/j.procs.2022.12.146.

C. A. Murthy and N. Chowdhury, “In search of optimal clusters using genetic algorithms,” 1996.

“‎ridho.lecturer.pens.ac.id: papers: Barakbah_IES_2004”.

M. Alfian, A. Ridho Barakbah, and I. Winarno, “INTERNATIONAL JOURNAL ON INFORMATICS VISUALIZATION journal homepage : www.joiv.org/index.php/joiv INTERNATIONAL JOURNAL ON INFORMATICS VISUALIZATION Indonesian Online News Extraction and Clustering Using Evolving Clustering.” [Online]. Available: www.joiv.org/index.php/joiv

Downloads

Published

2023-02-06

How to Cite

Rabiatul Adawiyah. (2023). Cluster Text Random Opinion Tweet In Yogyakarta Using Automatic Clustering. Jurnal Penelitian Rumpun Ilmu Teknik, 2(1), 73–89. https://doi.org/10.55606/juprit.v2i1.1194