Vacancy
A large number of scientists, programmers, teachers and students work on the Corpus. All of them can be divided into several working groups, each of which performs a specific task. Below we have listed the main such working groups and described the current "vacancies" for each of them. In parentheses, you will find an indication of which professional field is most likely to be engaged in this; however, these are not strict requirements and we are discussing any of your suggestions. So be sure to write!
Working with the site
- Web design of the search page (programmers)
- Development of the mobile version of the Corpus (programmers)
Replenishment of The Corpus
- alignment of parallel texts (Sinologists)
- search for new parallel texts in Goyu (Тайвань, Taiwan), Huayu (华 华, Singapore) (Sinologists)
The corpus of translations from the Wenyan language (─言)
- Search for new parallel texts in Wenyang (Sinologists)
- Search and development of related technologies (Sinologists)
Educational environment based on the Corpus
- Creation of an algorithm for automatic simplification of Russian and Chinese texts for a particular level of language knowledge (programmers, linguists, Sinologists, RCTs)
- Creating additional features in the corpus - audio and video examples for quotations from books contained in the corpus (programmers, Sinologists, RCTs)
- Creating an extension for Chrome that shows the most frequent contexts for a particular word (programmers, Sinologists, RCTs)
- Creation of methodological manuals on Russian as a foreign language and Chinese as a foreign language (linguists, Sinologists, RCTs)
Development of a new layout for the Corpus
- Creating an algorithm that would highlight the most likely translation of a particular word (programmers, linguists, Sinologists)
- Creating linguistic markup for Chinese texts - word separation, PoS tags, pinyin annotation, syntactic and semantic markup (programmers, linguists, Sinologists)
- Checking the existing markup for texts in traditional spelling (Sinologists, programmers)
Research tasks for the future
- Creating an algorithm that will automatically generate texts of different styles
- Research on translation studies, Chinese stylistics and philology
SMM and popularization
- SMM for TikTok
- SMM for WeChat (native Chinese speaker)
If you have any questions about working together, write to the project coordinator Kirill Semenov (kir.semenow@yandex.ru)