Comments (5)
希望能提供再详细些的信息,比如何种情况下比起以前来该分割的没分割?能提供测试文件最好。因为我自己本身用不到英文转录,开发测试都是基于日语音频。
另外split这个功能本身也是针对日语才加的,对英文不起效。
from n46whisper.
如这样一句话:
De Zerby's start at Brighton has been nothing short of spectacular, continuing the Seagulls transformation into one of the most aggressive and entertaining teams in the league, all with a simple principle at its core – possession football all over the pitch.,
但在whisper生成的文件里,则是按照固定长度分开来的De Zerby's start at Brighton has been nothing short of spectacular, continuing the Seagulls / transformation into one of the most aggressive and entertaining teams in the league, all / with a simple principle at its core – possession football all over the pitch. It's a principle
这三段,也就是说whisper在识别英文时把本来按照语法连接成的1句分割成了按照固定长度短句的3行。
from n46whisper.
有可能是和beam size
这个参数有关, 在换成faster-whisper之后我把这个参数定explicitly固定成5了。以前的话是默认的None.
The default beam size is 5 when using the whisper command line, but not when calling the model.transcribe method. Here the beam size defaults to None which means that greedy decoding is used.
from n46whisper.
感谢,已解决
from n46whisper.
感谢,已解决
请问一下怎么解决的呢,能说一下步骤吗?
from n46whisper.
Related Issues (20)
- 低识别率然后字幕中会提示“adjust_required” HOT 3
- 不知能否添加在视频里自动略过其他语言的功能
- 使用Google Gemini AI文本翻译出现未知错误 HOT 2
- 能不能再做一个从谷歌网盘选择文件夹的功能呢?
- 在语音识别库配置完毕,将开始转换这一步出错 HOT 1
- 一直显示 加载模型 Loading model... HOT 3
- 第一步,登陆google账户后就网页卡住,试了好几次都这样,不知道为什么。 HOT 1
- HF_TOKEN这个是 什么意思 不走了 之前都是好的 HOT 20
- 建议加个能导出纯文本txt的选项 HOT 2
- 加载模型错误 HOT 1
- 请求添加whisper的prompt选项
- GPT翻译能否增加一个网址选项,旨在使用其他网站的API
- 关于谷歌网盘文件类型的修改建议
- 建议选择谷歌文件时能够提示已选择的文件信息,和删除已选择的文件的选项 HOT 2
- [建议] 提高机翻双语字幕准确性的辅助工具 HOT 1
- only 2 file on google drive can be proces HOT 2
- 为什么会出现明明有对话 但是却丢了一大段字幕呢? HOT 2
- 我做了一个本地llm翻译、总结的版本,12G显存即可食用,欢迎来玩儿~ HOT 7
- 有概率谷歌云盘挂载无法选择文件 HOT 1
- 推荐一个日文语音识别的工具,ReazonSpeech HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from n46whisper.