Giter VIP home page Giter VIP logo

deepmultispeech's People

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

deepmultispeech's Issues

TODO list

FUNDAMENTAL

  • Repair data: download clean, full dataset and merge data already available.
    It's been observed that there are barely any sentence common to all the speakers, contrary to what the website for noisy-VCTK corpus says. Therefore, for parallel VC it is important to have all the data. Otherwise the number of samples available will be ridiculous.
  • Make Noisy-VCTK automatic pipeline for downloading data.
  • New metadata processing - following original tree structure.

Input data:

  • SE training on 4 target speakers
  • SE test on 2 test speakers
  • VC training on 2? source speakers and same 4 target speakers
  • VC test on 2 test speakers and same 4 target speakers
  • VC as parallel-VC (same utterance source-target)
  • DTW between source Mel and target Mel
  • TTS training on same 4 target speakers
  • TTS test on 2 test speakers
  • User-defined number of source speakers in VC
  • User-defined number of target speakers in SE/VC

Architecture:

  • SE modality
  • VC modality
  • TTS modality
  • Body with several projection layers
  • Include non-linearities Body
  • Change dimensionality at different layers - Change of concept

Training:

  • Split training of modalities - rest of model (speedup?)
  • Train jointly
  • Extract feature maps obtained after modality nets
  • Extract feature maps obtained after body

Eval:

  • Run SNR on SE results

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.