Main Article Content

Novianti Madhona Faizah
Luky Fabrianto
Widyat Nurcahyo
Herlina Trisnawati

Abstract

The increasing of internet use time after time is makes an impact addition of websites, the name of a website must be unique, eye catching and attractive, and in naming a website it should not use spaces, therefore it is often found that the website name consists of several words which are combined. This study aims to determine the most frequently used words on websites in Indonesia. The stages of this research briefly begin with the collection of 10,960 website names, word separation on each website name consisting of several words using Wordninja (one of packages available in Python programming language). The word separation process is carried out in several stages, starting from words containing at least 3 letters to 9 letters. Furthermore, from the word separation stage, ten words that appear most often are sorted. It was found that the word "Indonesia" most often appears at each stage of word separation, which is 139 times. Conclusion of this study is prove that Wordninja were very effective, as evidenced by an accuracy of 97.2%.

Downloads

Download data is not yet available.

Article Details

How to Cite
Novianti Madhona Faizah, Fabrianto, L., Widyat Nurcahyo and Herlina Trisnawati (2022) “Natural Language Processing Analysis Of Frequently Used Words On Indonesia Website Names”, Jurnal Mantik, 6(3), pp. 3649-3656. Available at: https://iocscience.org/ejournal/index.php/mantik/article/view/2917 (Accessed: 10May2026).
References
Canesche, M., Bragança, L., Neto, O. P. V., Nacif, J. A., & Ferreira, R. (2021). Google Colab CAD4U: Hands-on cloud laboratories for digital design. Proceedings - IEEE International Symposium on Circuits and Systems, 2021-May. https://doi.org/10.1109/ISCAS51556.2021.9401151
Dacon, J., & Tang, J. (2021). What Truly Matters? Using Linguistic Cues for Analyzing the #BlackLivesMatter Movement and its Counter Protests: 2013 to 2020. https://doi.org/10.5281/zenodo.4056563
Fan, M. (2017). Google Docs as a tool for collaborative writing in the middle school classroom. Journal of Information Technology Education: Research, 16, 391–410. http://www.informingscience.org/Publications/3870
Giannakoulopoulos, A., Pergantis, M., Limniati, L., & Kouretsis, A. (2022). Investigating the Country of Origin and the Role of the .eu TLD in External Trade of European Union Member States. Future Internet 2022, Vol. 14, Page 174, 14(6), 174. https://doi.org/10.3390/FI14060174
Gunawan, T. S., Ashraf, A., Riza, B. S., Haryanto, E. V., Rosnelly, R., Kartiwi, M., & Janin, Z. (2020). Development of video-based emotion recognition using deep learning with Google Colab. TELKOMNIKA (Telecommunication Computing Electronics and Control), 18(5), 2463–2471. https://doi.org/10.12928/TELKOMNIKA.V18I5.16717
Kang, Y., Cai, Z., Tan, C. W., Huang, Q., & Liu, H. (2020). Natural language processing (NLP) in management research: A literature review. Https://Doi.Org/10.1080/23270012.2020.1756939, 7(2), 139–172. https://doi.org/10.1080/23270012.2020.1756939
Kim, T. H., & Reeves, D. (2020). A survey of domain name system vulnerabilities and attacks. Journal of Surveillance, Security and Safety, 1(1), 34–60. https://doi.org/10.20517/JSSS.2020.14
Kuroki, M. (2021). Using Python and Google Colab to teach undergraduate microeconomic theory. International Review of Economics Education, 38, 100225. https://doi.org/10.1016/J.IREE.2021.100225
Legal Regulation of Internet Domain Names in North America by Jacqueline D. Lipton?:: SSRN. (n.d.). Retrieved June 24, 2022, from https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3290646
Mehedi, M., Pritom, A., Schweitzer, K. M., Bateman, R. M., Xu, M., & Xu, S. (2020). Data-Driven Characterization and Detection of COVID-19 Themed Malicious Websites; Data-Driven Characterization and Detection of COVID-19 Themed Malicious Websites. https://doi.org/10.1109/ISI49825.2020.9280522
Satoh, A., Nakamura, Y., Fukuda, Y., Nobayashi, D., & Ikenaga, T. (2021). An approach for identifying malicious domain names generated by dictionary-based DGA bots. IEICE Transactions on Information and Systems, E104.D(5), 669–672. https://doi.org/10.1587/TRANSINF.2020NTL0001
SINAGA, D. (2019). Comparative Study of Cohering Suffix in English and Indonesia. Jurnal Ilmiah Simantek, 3(1). https://www.simantek.sciencemakarioz.org/index.php/JIK/article/view/28
Smits, J. (2020). What does a Domain Name say.
Susilowati, Y. (2019). MODUL E-COMMERCE untuk Siswa Kelas XI Teaching Factory. https://books.google.co.id/books?id=I6LGDwAAQBAJ&pg=PA38&dq=Website+statis+dan+website+dinamis&hl=id&sa=X&ved=0ahUKEwi38LmnnJHpAhVs73MBHVopAXMQ6AEIRDAE#v=onepage&q=Website statis dan website dinamis&f=false
Tock, K. (2020). Google CoLaboratory as a Platform for Python Coding with Students. 2(1), 1–13. https://doi.org/10.32374/rtsre.2019.013.