Giter VIP home page Giter VIP logo

charisma-sdk-js's People

Contributors

bensalilijames avatar john-a-j avatar samuelbeard avatar szabinakov avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

charisma-sdk-js's Issues

Action required: Greenkeeper could not be activated 🚨

🚨 You need to enable Continuous Integration on all branches of this repository. 🚨

To enable Greenkeeper, you need to make sure that a commit status is reported on all branches. This is required by Greenkeeper because it uses your CI build statuses to figure out when to notify you about breaking changes.

Since we didn’t receive a CI status on the greenkeeper/initial branch, it’s possible that you don’t have CI set up yet. We recommend using Travis CI, but Greenkeeper will work with every other CI service as well.

If you have already set up a CI for this repository, you might need to check how it’s configured. Make sure it is set to run on all new branches. If you don’t want it to run on absolutely every branch, you can whitelist branches starting with greenkeeper/.

Once you have installed and configured CI on this repository correctly, you’ll need to re-trigger Greenkeeper’s initial pull request. To do this, please delete the greenkeeper/initial branch in this repository, and then remove and re-add this repository to the Greenkeeper App’s white list on Github. You'll find this list on your repo or organization’s settings page, under Installed GitHub Apps.

js sdk voice audio format smarter handling

“We’re noticing that when using a synthetic voice it sometimes doesn’t play correctly for the user. On Android, Gemelo voices aren’t playing and on iOS none of the voices are playing (Replica, Cereproc, Gemelo). Are there browser limitations to what synthetic voices we can use”

Advised that it would be an encoding mismatch and to make sure the audio format worked with the target browser.

then

We managed to fix the issue by removing “ogg” from the encoding array. This probably means that your sdk is only using the first element of that array even if that encoding gives an error (which is what we were seeing).
conversation.setSpeechConfig({
encoding: [“mp3”],
output: “buffer”,
});

Add a new ChatMode to when other characters are talking.

There are currently two ChatModes that are sent from the react sdk. "Chat" and "Tap". When characters are talking and not expecting the player to talk, it would be good for there to be a ChatMode of "Character" (Or better name). That way, we can disable the chat when the player shouldn't be typing.

add to docs about the differences between microphone.startListening() and playthrough.startSpeechRecognition()

There are two speech recognition systems available in the Charisma JS SDK.

  1. microphone.startListening() uses the built-in browser SpeechRecognition. It’s free, but it’s not available in every browser (such as Firefox). This is what the https://charisma.ai/ website editor uses in the chat tester/‘play’ page.
  2. playthrough.startSpeechRecognition() is our premium alternative which leverages Deepgram (or AWS/Google) under the hood. It works in every browser and uses credits and the quality should be significantly superior.

Regarding method 2), we recently shipped a fix in @charisma-ai/sdk version 4.0.3 which resolves an issue with using an incorrect sample rate, leading to odd/garbled/no results in certain browsers.

Could I check you’re using method 2) and you’re on the latest version of the SDK (4.0.3)? If so, perhaps you could try either the AWS or Google engines by providing service: “unified:aws” or service: “unified:google” in the startSpeechRecognition options, and see if that works any better?

There is a message called speech-recognition-started which we send to the client when the connection to the STT service has been initialised successfully. You can use onSpeechRecognitionStarted on a Playthrough to react to it.
You can also see messages in chrome dev tools if you inspect the WebSocket messages.

The STT custom options are described in more detail here so there might be options that can be set for better compatibility with different browsers.

and add a further ticket to address this sort of stuff in the react sdk, which might be done slightly differently

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.