Natural Language Processing Analysis Of Frequently Used Words On Indonesia Website Names
Main Article Content
Abstract
The increasing of internet use time after time is makes an impact addition of websites, the name of a website must be unique, eye catching and attractive, and in naming a website it should not use spaces, therefore it is often found that the website name consists of several words which are combined. This study aims to determine the most frequently used words on websites in Indonesia. The stages of this research briefly begin with the collection of 10,960 website names, word separation on each website name consisting of several words using Wordninja (one of packages available in Python programming language). The word separation process is carried out in several stages, starting from words containing at least 3 letters to 9 letters. Furthermore, from the word separation stage, ten words that appear most often are sorted. It was found that the word "Indonesia" most often appears at each stage of word separation, which is 139 times. Conclusion of this study is prove that Wordninja were very effective, as evidenced by an accuracy of 97.2%.
Downloads
Article Details
Dacon, J., & Tang, J. (2021). What Truly Matters? Using Linguistic Cues for Analyzing the #BlackLivesMatter Movement and its Counter Protests: 2013 to 2020. https://doi.org/10.5281/zenodo.4056563
Fan, M. (2017). Google Docs as a tool for collaborative writing in the middle school classroom. Journal of Information Technology Education: Research, 16, 391–410. http://www.informingscience.org/Publications/3870
Giannakoulopoulos, A., Pergantis, M., Limniati, L., & Kouretsis, A. (2022). Investigating the Country of Origin and the Role of the .eu TLD in External Trade of European Union Member States. Future Internet 2022, Vol. 14, Page 174, 14(6), 174. https://doi.org/10.3390/FI14060174
Gunawan, T. S., Ashraf, A., Riza, B. S., Haryanto, E. V., Rosnelly, R., Kartiwi, M., & Janin, Z. (2020). Development of video-based emotion recognition using deep learning with Google Colab. TELKOMNIKA (Telecommunication Computing Electronics and Control), 18(5), 2463–2471. https://doi.org/10.12928/TELKOMNIKA.V18I5.16717
Kang, Y., Cai, Z., Tan, C. W., Huang, Q., & Liu, H. (2020). Natural language processing (NLP) in management research: A literature review. Https://Doi.Org/10.1080/23270012.2020.1756939, 7(2), 139–172. https://doi.org/10.1080/23270012.2020.1756939
Kim, T. H., & Reeves, D. (2020). A survey of domain name system vulnerabilities and attacks. Journal of Surveillance, Security and Safety, 1(1), 34–60. https://doi.org/10.20517/JSSS.2020.14
Kuroki, M. (2021). Using Python and Google Colab to teach undergraduate microeconomic theory. International Review of Economics Education, 38, 100225. https://doi.org/10.1016/J.IREE.2021.100225
Legal Regulation of Internet Domain Names in North America by Jacqueline D. Lipton?:: SSRN. (n.d.). Retrieved June 24, 2022, from https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3290646
Mehedi, M., Pritom, A., Schweitzer, K. M., Bateman, R. M., Xu, M., & Xu, S. (2020). Data-Driven Characterization and Detection of COVID-19 Themed Malicious Websites; Data-Driven Characterization and Detection of COVID-19 Themed Malicious Websites. https://doi.org/10.1109/ISI49825.2020.9280522
Satoh, A., Nakamura, Y., Fukuda, Y., Nobayashi, D., & Ikenaga, T. (2021). An approach for identifying malicious domain names generated by dictionary-based DGA bots. IEICE Transactions on Information and Systems, E104.D(5), 669–672. https://doi.org/10.1587/TRANSINF.2020NTL0001
SINAGA, D. (2019). Comparative Study of Cohering Suffix in English and Indonesia. Jurnal Ilmiah Simantek, 3(1). https://www.simantek.sciencemakarioz.org/index.php/JIK/article/view/28
Smits, J. (2020). What does a Domain Name say.
Susilowati, Y. (2019). MODUL E-COMMERCE untuk Siswa Kelas XI Teaching Factory. https://books.google.co.id/books?id=I6LGDwAAQBAJ&pg=PA38&dq=Website+statis+dan+website+dinamis&hl=id&sa=X&ved=0ahUKEwi38LmnnJHpAhVs73MBHVopAXMQ6AEIRDAE#v=onepage&q=Website statis dan website dinamis&f=false
Tock, K. (2020). Google CoLaboratory as a Platform for Python Coding with Students. 2(1), 1–13. https://doi.org/10.32374/rtsre.2019.013.

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.