Currently, databases of textual and speech information in the Tatar language are being accumulated and analyzed for development, machine learning technologies are being developed, and the integration of the Tatar language speech interface into modern PCs and mobile devices is underway. To create a universal speech recognition system, a database of voices from more than 400 speakers with a total duration of about 60 hours has been collected. The necessary programs and models have been created, and the first experimental version of the recognition system, which understands 200 thousand Tatar words, has been launched. The achieved results are comparable to world analogues and allow for "communication" with a computer using voice commands (speech translation, mobile assistants, message dictation, news reading).
Last updated: 8 December 2025, 16:44