MASHINALI TARJIMA TIZIMLARI UCHUN O‘ZBEK TILI SO‘ZLARINI LEMMALASH VA GAP STRUKTURALARINI TAHLIL QILISH
Keywords:
Machine translation, word, lemma, lemmatization, sentence, sentence fragments, sentence types.Abstract
Translation enables effective communication between people all over the world. Machine translation is the automatic translation of natural language from one language to another, preserving the meaning. This article provides information on Uzbek sentence structure and Uzbek word lemmatization for machine translation systems, and the importance of lemmatization for translation. There are several types of machine translation, the use of lemmatization specifically for those types is mentioned and examples are given.
References
M. Sharipov and O. Sobirov, ‘Development of a Rule-Based Lemmatization Algorithm Through Finite State Machine for Uzbek Language’, in CEUR Workshop Proceedings, V. J. and K. B., Eds., CEUR-WS, 2022, pp. 154 – 159. [Online]. Available: https://www.scopus.com/inward/record.uri?eid=2-s2.0- 85146112590&partnerID=40&md5=e1080c39d101c0e351cfed1a8228d391
U. Salaev, E. Kuriyozov, and C. Gómez-Rodríguez, ‘A Machine Transliteration Tool Between Uzbek Alphabets’, CEUR Workshop Proc, vol. 3315, pp. 42 – 50, 2022, [Online]. Available: https://www.scopus.com/inward/record.uri?eid=2-s2.0- 85146119140&partnerID=40&md5=be670d829670d883b2f8326559ce954a
N. Abdurakhmonova and U. Tuliyev, ‘Morphological analysis by finite state transducer for Uzbek-English machine translation/Foreign Philology: Language’, Literature, Education, vol. 3, p. 68, 2018.
E. B. Boltayevich, E. Adali, K. S. Mirdjonovna, A. O. Xolmo’minovna, X. Z. Yuldashevna, and X. Nizomaddin Uktamboy O’G’li, ‘The Problem of Pos Tagging and Stemming for Agglutinative Languages (Turkish, Uyghur, Uzbek Languages)’, in UBMK 2023 - Proceedings: 8th International Conference on Computer Science and Engineering, 2023, pp. 57 – 62. http://doi.org/10.1109/UBMK59864.2023.10286792 .
A. M. Abdurashetona and I. O. Ismailovich, ‘Methods of Tagging Part of Speech of Uzbek Language’, in Proceedings - 6th International Conference on Computer Science and Engineering, UBMK 2021, 2021, pp. 82 – 85. https://10.1109/UBMK52708.2021.9558900
Y.-L. Yeong, T.-P. Tan, and S. K. Mohammad, ‘Using Dictionary and Lemmatizer to Improve Low Resource English-Malay Statistical Machine Translation System’, Procedia Comput Sci, vol. 81, pp. 243–249, 2016, doi: https://doi.org/10.1016/j.procs.2016.04.056
R. Zhang, H. Yamamoto, and E. Sumita, ‘On the Use of Lemmatization for Statistical Machine Translation’.
R. Zhang and E. Sumita, ‘Boosting statistical machine translation by lemmatization and linear interpolation’, in Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics Companion Volume Proceedings of the Demo and Poster Sessions, 2007, pp. 181–184.