Giter VIP home page Giter VIP logo

casr-demo's Introduction

CASR-DEMO(中文自动语音识别演示系统)

ChangeLog

2024-03-31

  1. 可用功能为:说话人识别,语音识别。合成功能不可用。
  2. 重构了代码结构,最新分支为refactor/casr_demo,大家可以使用。
  3. 在Mac上做过验证。Python不低于3.8.
  4. 测试发现,有些关于语音的包安装可能不会直接成功,但是只要Google下就能解决。

启动方式

conda create -n casr python==3.12
pip install -r requirements.txt
python src/manage.py

关于本项目的一些说明

首先,欢迎大家关注项目,进行学习研究。收到一些小伙伴的问题我就集中回答下,这里是demo的源码、有两个版本其一是名为speech_env,这是一个简单的语音识别功能,界面如下面的效果图一;还有一个是V2.0的目录,这个版本功能比较齐全,界面如效果图二。大家感兴趣在自己机器上试试的话我推荐直接使用v2.0版本,还有一点项目只在win10平台上测试过,其他不保证能不能运行。由于月久失更,有些依赖包可能需要修改,不过应该都是小问题,根据实际过程的报错信息修复就行。再次感谢大家的关注!


最新整理了两个版本的发布版

speech_env(效果图一)

效果图1

speechV2.0 基于第三方接口实现语音识别和语音合成、说话人识别功能(效果图二)

效果图2


E-mail: [email protected] wiki: http://wiki.lihanghang.top
Updated on December 25,2019.

casr-demo's People

Contributors

dependabot[bot] avatar lhhroots avatar lihanghang avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

casr-demo's Issues

有关识别率的问题

您好!对您的项目很感兴趣,想了解一下,您那里有无大概数据有关识别率的情况?

项目Speech2.0的相关问题

大神您好,我想请问一下您这个项目是要部署到服务器上的是吗,我不知道该部署哪些文件到网站上,因为speechV2.0有太多文件了,还有文档,有点看不懂,还望您多多指点。
还有,我不明白白的一点是,您这个部署到服务器后,它所需要的功能库也是需要在网站上进行安装才能正常运行的是吗

v2.0版本

2.0的demo中manage.py主程序中导入的三方库是哪个?
自己没有找到,并且项目中没有这个包
from Speaker_Recognition import register, speakerrecog # 声纹识别库

image

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.