O‘ZBEK-INGLIZ PARALLEL KORPUSIDA METAMA’LUMOTLAR VA FORMATLASH MASALASI HAMDA O‘ZBEK TILI UCHUN MAVJUD NLP VOSITALARI

Авторы

  • Botir Elov
  • Ma’rufjon Amirkulov
  • Malika Suyunova

DOI:

https://doi.org/10.47390/ts-v3i9y2025No3

Ключевые слова:

parallel korpus, metama’lumot, TEI, CoNLL-U, o‘zbek tili, NLP, morfologik tahlil, teglash, lemmatizatsiya, lingvistik resurs

Аннотация

Ushbu maqolada o‘zbek-ingliz parallel korpusini yaratish jarayonida metama’lumotlarni formatlash, TEI va CoNLL-U standartlaridan foydalanish hamda o‘zbek tili uchun mavjud tabiiy tilni qayta ishlash (NLP) vositalarini tahlil qilish masalalari yoritilgan. Parallel korpusning tuzilishi, ma’lumotlar mosligi, sintaktik va morfologik teglash jarayonlari bosqichma-bosqich ko‘rib chiqilgan. Shuningdek, o‘zbek tili uchun ishlab chiqilgan morfologik analizatorlar, lemmatizatorlar va POS-teggerlar imkoniyatlari tahlil qilinib, ularning parallel korpus yaratishdagi, shuningdek, matnni sentiment tahlil qilishdagi amaliy ahamiyati asoslab berilgan. Tadqiqot natijalari parallel korpuslarni yuqori aniqlik bilan kodlash, avtomatik tahlil qilish va qidiruv tizimlariga integratsiya qilishda metodik asos bo‘lib xizmat qiladi.

Библиографические ссылки

1. Pontiki, M., Galanis, D., Papageorgiou, H., Androutsopoulos, I., Manandhar, S., Al-Smadi, M, & Eryiğit, G. (2016, June). SemEval-2016 Task 5: Aspect-based sentiment analysis. In Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval-2016) (pp. 19–30). Association for Computational Linguistics.

2. Mirdjonovna, H. S., & Ilxomovna, A. X. Boltayevich, E. B., (2022, October). Methods for creating a morphological analyzer. In International Conference on Intelligent Human Computer Interaction (pp. 27-38). Cham: Springer Nature Switzerland.

3. Adalι, E., Mirdjonovna, K. S., Xolmo‘minovna, A. O., Yuldashevna, X. Z., Boltayevich, E. B., & Uktamboy O'g'li, X. N. (2023, September). The Problem of Pos Tagging and Stemming for Agglutinative Languages (Turkish, Uyghur, Uzbek Languages). In 2023 8th International Conference on Computer Science and Engineering (UBMK) (pp. 57-62). IEEE.

4. Elov, B., & Xudayberganov, N. (2024). O ‘zbek tili korpusi matnlarini pos teglash usullari. Computer Linguistics: problems, solutions, prospects, 1(1).

5. Sharipov, M., Mattiev, J., Sobirov, J., & Baltayev, R. (2022). Creating a morphological and syntactic tagged corpus for the Uzbek language. arXiv preprint arXiv:2210.15234.

6. Hamroyeva, S., Alayev, R., Xusainova, Z., & Yodgorov, U., Elov, B. (2023). O ‘zbek tili korpusi matnlarini qayta ishlash usullari. Digital transformation and artificial intelligence, 1(3), 117-129.

7. Elov, B., & Xudayberganov, N. (2024). O ‘zbek tili korpusi matnlarini pos teglash usullari. Computer Linguistics: problems, solutions, prospects, 1(1).

Загрузки

Прислана

2025-10-22

Опубликован

2025-10-23

Как цитировать

Elov, B., Amirkulov, M., & Suyunova, M. (2025). O‘ZBEK-INGLIZ PARALLEL KORPUSIDA METAMA’LUMOTLAR VA FORMATLASH MASALASI HAMDA O‘ZBEK TILI UCHUN MAVJUD NLP VOSITALARI. Techscience.Uz - Topical Issues of Technical Sciences, 3(9), 14–21. https://doi.org/10.47390/ts-v3i9y2025No3

Похожие статьи

1 2 3 4 5 > >> 

Вы также можете начать расширеннвй поиск похожих статей для этой статьи.