Text Mining Application With K-Means Clustering to Identify Sentiments and Popular Topics: A Case Study of The Three Largest Online Marketplaces in Indonesia

Andree E Widjaja, Andy Fransisko, Calandra Alencia Haryani, Hery Hery

Abstract


The number of internets and social media users, which continues to increase at a very fast rate, has resulted in the emergence of new business opportunities in Indonesia. One of those indications is the emergence of marketplace companies in Indonesia. The presence of these online marketplaces provides people with more online marketplace choices according to their preferences. One of the factors that became the basis for this election was reading comments or reviews from consumers on the marketplace posted on social media. This research was conducted using text mining method with k-means clustering algorithm to systematically identify the sentiments and topics that are widely discussed by online marketplace consumers in Indonesia. The data was processed by the k-means algorithm in the form of comments or reviews from three online marketplaces (Tokopedia, Shopee and Bukalapak) on Twitter. The amount of data for each marketplace referred to was 1500 data tweets. The results showed that the three online marketplaces were associated to different topics, even though they are in the same industry. These differences arise due to the fact that most consumers discuss the topics of programs held by their respective online marketplaces. The main topics related to Tokopedia are “belanja” (“shopping”) and “terimakasih” (“thank you”); while for Shopee “pilih” (“choose”) and “jongho”, and for Bukalapak “pra-kerja” (“pre-employment”). In addition, the sentiment analysis carried out shows that the sentiment of the three online marketplaces is predominantly neutral.

Article Metrics

Abstract: 140 Viewers PDF: 137 Viewers

Keywords


Internet usage, social media users, marketplace companies, sentiment analysis, online marketplaces

Full Text:

PDF


Refbacks

  • There are currently no refbacks.



Barcode

Journal of Applied Data Sciences

ISSN : 2723-6471 (Online)
Organized by : Computer Science and Systems Information Technology, King Abdulaziz University, Kingdom of Saudi Arabia.
Website : http://bright-journal.org/JADS
Email : taqwa@amikompurwokerto.ac.id (principal contact)
    support@bright-journal.org (technical issues)

 This work is licensed under a Creative Commons Attribution-ShareAlike 4.0