Giter VIP home page Giter VIP logo

is2ai / issai_saida_kazakh_asr Goto Github PK

View Code? Open in Web Editor NEW
43.0 6.0 6.0 28 KB

the first industrial-scale open-source Kazakh speech corpus. KSC2 corpus subsumes the previously introduced two corpora: KSC and KazakhTTS2 and supplements additional data from other sources. KSC2 contains around 1.2k hours of high-quality transcribed data comprising over 600k utterances.

Home Page: https://issai.nu.edu.kz/kz-speech-corpus/

License: Creative Commons Attribution 4.0 International

Shell 85.30% Python 14.70%
speech-recognition speech-synthesis speech-to-text speechrecognition

issai_saida_kazakh_asr's People

Contributors

mussakhojayeva avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar

issai_saida_kazakh_asr's Issues

README file contains placeholder link

The readme file has thelink as a link

# ISSAI_SAIDA_Kazakh_ASR
This repository provides the recipe for the paper [A Crowdsourced Open-Source Kazakh Speech Corpus](thelink). 

Моделди кошируге арналган ссылка истемейди

Жигиттер калайсындар, оте куаныштымын сендердин осындай керек жумыс жасагандарына. Бирак НУ га карайнтын ссылкалар жумыс истемей тур. Сол маселени шешип жиберсениздер. Рахмет

Preprocessing Data from Dataset

Do you have a plans clean your data, i.e. drop long pause in beginning and end audio?
Have you tried to train speech synthesis algorithms on this dataset?

Pretrained model

Hallo,

are you going to release a pretrained model?

Greetings

ModuleNotFoundError: No module named 'pandas'

hello,I have completed the installation of espnet according to the prompts, and the egs/yesno tests are correct. Then I change the data path according to the prompts in the steps, and run ./run.sh error. I can test import pandas separately. What is the reason for this
stage 0: Setting up directories
Traceback (most recent call last):
File "local/data_prep.py", line 4, in
import pandas as pd
ModuleNotFoundError: No module named 'pandas'
torch=1.5.1 cuda=10.1

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.