Comments (7)
Hi @freetimekate,
I created a similar app, Leah:
https://github.com/drorm/leah
https://chrome.google.com/webstore/detail/leah-language-educator-an/bolaopgaickihobdlddefmdffopigkmh
that is focused on language learning, but I'd love to explore adapting or creating a different version that's focused on folks that are vision impaired.
Other than shortcuts, is there any other functionality that you'd be interested in?
from talk-to-chatgpt.
@C-Nedelcu, I'm so glad to see your reaction since I was a little reluctant to post about a "competing" product, which is funny since both are totally open source :-).
ChatGPT does really well, better than Google translate, with mainstream languages such as as English, French and Spanish, as far a I can tell. With more obscure languages, such as Hebrew, it still has rough edges. I'm not surprised since there are a lot less docs for ChatGPT to train, and I expect it to improve over time.
I wish I had figured out to use your product and build on it earlier, but at this point, I'm used to my stack: Angular, typescript, etc. So will continue there.
Happy though to answer some of the obvious questions in the issues.
from talk-to-chatgpt.
hi freetimekate,
do you mean like keyboard shortcuts like CTRL+ALT+something?
this is pretty easy to add yeah, what shortcuts would be helpful?
I could see the following:
- muting the mic
- muting the bot
- skipping the current message
Any further suggestions? This could make for a good update for V1.7
from talk-to-chatgpt.
Hi C-Nedelcu,
That's it exactly! Just like using CONTL+R to refresh the page. So cool to hear it could be relatively easy. Thinking about this in detail, for a low-vision or screen-free experience, I would recommend a keyboard shortcut for the following:
-
Actuate start (depressing ENTER or equivalent). This way you can refresh the page with CTRL+R, then start the tool without the mouse
-
Stop/skip bot (as discussed)
-
Toggle Mute on/off (setting the default mute state of the mute button as discussed)
-
Mute override keyboard shortcut (press and hold to temporarily unmute/mute). This is particularly helpful for users who can't see the icons and are in environments where others may be speaking. Simply set the default to mute(or unmute) and press and hold a keyboard shortcut any time you'd like to input with voice(or speak without inputting if mute is toggled on). It's easy enough to toggle the buttons when you can see the icons but this feature would be very helpful for low vison and screen free applications.
-
In an ideal world there would be a voice input mode. With one key command, the page is refreshed, start is actuated, and the microphone is toggled to mute. There could be mention keyboard shortcut in your settings page.
The low-vision user would then use two keyboard shortcuts primarily; skip/stop key command to interrupt, and press and hold mute override key(s) to input with voice.
Now that I've said that I'm realizing, if you're intending to speak, then you probably don't want to be talked over, so the mute override key should also skip the current message. That actually be slick. More like natural conversation.
The user would then only have a single keyboard shortcut to operate the interface if low-vision. I could also see this used in headless applications for communicating with chat GPT without a screen at all, just a single voice input button. Just press and told to input with voice.
Would love to hear your thoughts. Thanks again for creating such a helpful tool and considering these updates for low-vision and screen-free users! Looking forward to your next update whatever you choose to do!
from talk-to-chatgpt.
Hi there, I just wanted to say that this wouldn't only be useful for the visually impaired but I think it would be great to have a shortcuts in general, for all users. If the shortcuts were user customisable that would be even better! It's an excellent product thanks so much for making.
from talk-to-chatgpt.
Hi @freetimekate, I created a similar app, Leah: https://github.com/drorm/leah https://chrome.google.com/webstore/detail/leah-language-educator-an/bolaopgaickihobdlddefmdffopigkmh that is focused on language learning, but I'd love to explore adapting or creating a different version that's focused on folks that are vision impaired. Other than shortcuts, is there any other functionality that you'd be interested in?
This is highly interesting and exciting (much more so than my own extension which I consider to be basically a gimmick). I'm amazed to discover how people find new uses for ChatGPT. Its capabilities in terms of translation, grammar, spell check, and everything that has to do with languages in general, are often overlooked in the press. Truthfully when it comes to translations ChatGPT performs incredibly well !
I'm sorry I haven't been making progress on keyboard shortcuts recently. I haven't had time to dedicate to improving Talk-to-ChatGPT lately. I'd love it if there were some pull requests from other developers, anyone's welcome to contribute.
from talk-to-chatgpt.
I have added keyboard shortcuts in v2.0
ALT+SHIFT+S: Start Talk-To-ChatGPT
ALT+SHIFT+H: suspend/resume speech recognition (Hush)
ALT+SHIFT+V: suspend/resume bot's voice (Voice)
ALT+SHIFT+L: skip current message (Leap)
The choice of key combination was particularly difficult when you realize that Chrome already uses a LOT of these. All the ones I wanted were already taken, basically.
from talk-to-chatgpt.
Related Issues (20)
- Text between asterisks gets ignored
- Cloudfalre captcha support
- Coqui TTS support?
- The AI isn't speaking back HOT 2
- code block reading
- missing out words
- chat gpy
- No speech input, api is working
- Custom GPTs not working HOT 1
- V2.9.0 stopped showing HOT 10
- Problemas Talk to chatgtp HOT 1
- Chrome Web Store link 404
- talk to Chat gpt is not working from today morining HOT 6
- Fixed Talk-to-ChatGPT to work again HOT 9
- TALK-TO-ChatGPT not automatically sending speech to text
- gpt4o stops talk-to-gpt HOT 10
- How much would it cost to keep this running? HOT 13
- I fixed the extension not auto-sending messages HOT 2
- Not auto sending or playing responses back, any fixes? HOT 1
- New OpenAI model update solution HOT 10
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from talk-to-chatgpt.