Личный кабинет

Статья

Косухин П.Г., Юрьева В.Р. (науч. рук. Юрьев Р.Н.) Challenges in nlp text analysis in mandarin chinese and lao: a case study of media articles
УДК тезиса: 004.8

This paper describes specific challenges associated with NLP text analysis in Mandarin Chinese and Lao, on an example of analysis of two media articles, one in Mandarin Chinese and one in Lao. The analysis is conducted using Python. The goal of the analysis is to extract the words repeated most frequently in both texts. In this paper, we present a short description of challenges we encountered while conducting analysis of the media articles, and the possible solutions we came up with to overcome the difficulties existing in NLP for Chinese and Lao.

Авторы:

Косухин Павел Григорьевич

Юрьева Варвара Родионовна

Руководитель:

Юрьев Родион Николаевич

Косухин П.Г., Юрьева В.Р. (науч. рук. Юрьев Р.Н.) Challenges in nlp text analysis in mandarin chinese and lao: a case study of media articles // Сборник тезисов докладов конгресса молодых ученых. Электронное издание. – СПб: Университет ИТМО, [2025]. URL: https://kmu.itmo.ru/digests/article/15398