Light

lihanghang / casr-demo Goto Github PK

View Code? Open in Web Editor NEW

151.0 4.0 28.0 99.48 MB

基于Flask Web的中文自动语音识别演示系统,包含语音识别、语音合成、声纹识别之说话人识别。

Python 2.45% CSS 56.92% JavaScript 38.15% HTML 2.48%

flask-application speech-to-text ctc baidu-aip pyaudio speaker-recognition gmm casr-demo

casr-demo's Introduction

CASR-DEMO(中文自动语音识别演示系统）

ChangeLog

2024-03-31

可用功能为：说话人识别，语音识别。合成功能不可用。
重构了代码结构，最新分支为refactor/casr_demo，大家可以使用。
在Mac上做过验证。Python不低于3.8.
测试发现，有些关于语音的包安装可能不会直接成功，但是只要Google下就能解决。

启动方式

conda create -n casr python==3.12
pip install -r requirements.txt
python src/manage.py

关于本项目的一些说明

首先，欢迎大家关注项目，进行学习研究。收到一些小伙伴的问题我就集中回答下，这里是demo的源码、有两个版本其一是名为speech_env,这是一个简单的语音识别功能，界面如下面的效果图一；还有一个是V2.0的目录，这个版本功能比较齐全，界面如效果图二。大家感兴趣在自己机器上试试的话我推荐直接使用v2.0版本，还有一点项目只在win10平台上测试过，其他不保证能不能运行。由于月久失更，有些依赖包可能需要修改，不过应该都是小问题，根据实际过程的报错信息修复就行。再次感谢大家的关注！

最新整理了两个版本的发布版

点此处详见

speech_env（效果图一）

speechV2.0 基于第三方接口实现语音识别和语音合成、说话人识别功能(效果图二)

E-mail: [email protected] wiki: http://wiki.lihanghang.top
Updated on December 25,2019.

casr-demo's People

Contributors

Stargazers

Watchers

casr-demo's Issues

有关识别率的问题

您好！对您的项目很感兴趣，想了解一下，您那里有无大概数据有关识别率的情况？

项目Speech2.0的相关问题

大神您好，我想请问一下您这个项目是要部署到服务器上的是吗，我不知道该部署哪些文件到网站上，因为speechV2.0有太多文件了，还有文档，有点看不懂，还望您多多指点。
还有，我不明白白的一点是，您这个部署到服务器后，它所需要的功能库也是需要在网站上进行安装才能正常运行的是吗

v2.0版本

2.0的demo中manage.py主程序中导入的三方库是哪个?
自己没有找到,并且项目中没有这个包
from Speaker_Recognition import register, speakerrecog # 声纹识别库

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.