Giter VIP home page Giter VIP logo

conditional-motion-in-betweening's People

Contributors

childtoy avatar dependabot[bot] avatar jihoonerd avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

conditional-motion-in-betweening's Issues

How to use the generated actions in Blender or Unreal?

Hi! Thanks for your amazing work!
The trained network works pretty well, however I'm puzzled about how to use the generated actions in Blender or Unreal.
The outputs of the network are: (1)global pos, (2)global rot(as quaternion), using function quat_ik we can get local_quaternion and local_positions. I use the generated local_quaternion to create a animation using Blender, more detailly, I use a fbx downloaded from maximo.com, I delete the original animation, and set every bone's rotation_quaternion frame by frame.
Sadly, the result looks weird and is diffrent from the ploted images, below is the result pose:
image
It is expected to look like the blue pose in the following image as ploted by the CMIB code :
gt0
I guess this is because the Lafan1 dataset do not use popular T-pose as rest_pose. The generated quaternion is aimed to rotate the vector defined in the offsets array (in skeleton.py) instead of the T-pose.
So I wander if there is a convenience way to use the generated actions in Blender or Unreal? I believe this would extremly helpful, looking forward to your reply! Thanks! :)

Use linear probed discriminator

Current unrolled state does not handle sequential data, which may lead to fail capture modality.
Consider using the last cell state as a motion descriptor and discriminator input.

Benchmark models show different l2p,l2q from the paper

I download the benchmark models from the site, and test it on lanfan dataset. But the l2p and l2q are diffrent from the paper. I wonder if something wrong with my setting. Or, the benchmark models are not the best setting trained models.

Unit length representation in global coordinates

As prediction in global coordinate system presented improved visual performance, converting input representation from local to global is in progress. However, global coordinate prediction has critical disadvantage that cannot guarantee the length of links.

Predicting the unit displacement and multiplication after it will provide link length preserving representation in global coordinate.

[Hotfix] Replace label infogan encoder to direct injection

Current InfoganCodeEncoder was found not effective on embedding label into a first hidden state of LSTM. It is suspected that idea of expanding labels to hidden state (LSTM's hidden state, mostly >512) was unsuccessful.

This calls for reverting to direct injection of label by concatenating them.

where I can find corresponding code about Motion data augmentation?

Based on my own understand, there are 3 parts process about traing.

  1. Randomized Shuffled Anchor Pose: corresponding to the random mask_start_frame.
  2. Semantic Embedding: in the network Sturcture, cond_embedding
  3. motion data augmentation? I can't find the corresponding code?

[Major Change] Use BERT-based Transformer Encoder / Transformer Decoder

Experiments conducted has shown that finding manifold of motion generation and disentanglement of it is difficult.

BERT suggested not only a transformer based representation learning, but also presented Masked Language Model(MLM) task, which considerably resembles our in-betweening work.

Similar approach is studied at https://arxiv.org/abs/2103.00776.

Any kinds of discriminator (1D Conv, Transformer Decoder, RNN-based Decoder) can be integrated to yield mutual information loss.

Test with continuous infogan code

Ongoing code has been tested for discrete case only.
Continuous code might extract meaningful information. This paper suggests to use proper number of latent codes with adding nose to real data for stabilization.

About other datasets

Thanks for your excellent work! In your paper, you used four datasets. I would like to know where I can find the data and testing code for reading the HumanEva, HUMAN4D, and MPI-HDM05 datasets. Thank you! I look forward to your reply.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.