is2ai / issai_saida_kazakh_asr Goto Github PK
View Code? Open in Web Editor NEWthe first industrial-scale open-source Kazakh speech corpus. KSC2 corpus subsumes the previously introduced two corpora: KSC and KazakhTTS2 and supplements additional data from other sources. KSC2 contains around 1.2k hours of high-quality transcribed data comprising over 600k utterances.
Home Page: https://issai.nu.edu.kz/kz-speech-corpus/
License: Creative Commons Attribution 4.0 International