Comments (5)
Hi, the sequence of mouth ROIs (before cropping) is saved with grayscale video. data
should have a shape of (T, H, W), where T is the amount of frames. H and W denote the height and width, respectively. Can you print the filename self.list[idx][0]
at dataset.py#L109 and data
at dataset.py#L115 to target the problematic file and check its shape?
from lipreading_using_temporal_convolutional_networks.
Hi, I did not use the --convert-grayscale option hence this issue had arisen. The issue is resolved now. Thank you so much for your help!
from lipreading_using_temporal_convolutional_networks.
Hey,can you tell how to use the --convert-grayscale option
from lipreading_using_temporal_convolutional_networks.
Hello @rohith-crypto you can pass the argument --convert-grayscale
to the command line when running crop_mouth_from_video.py. Please check an example at preprocessing/README.md
from lipreading_using_temporal_convolutional_networks.
Hi, I did not use the --convert-grayscale option hence this issue had arisen. The issue is resolved now. Thank you so much for your help!
please can you show as how you are solve in code ?
from lipreading_using_temporal_convolutional_networks.
Related Issues (20)
- Can we do Sentence Prediction for the model? HOT 1
- About variable length augmentation HOT 1
- DC-TCN number of parameters and Hardest words list
- Must convert gray? HOT 1
- ShuffleNet's Parameter
- Do this code in github include the part of data Augmentation? HOT 1
- With the same data , why the result is so different on ms-tcn and dc-tcn ?
- Acc of resnet18_dctcn_video_boundary in my test is wrong HOT 1
- about preprocessing
- cant process HOT 1
- How to use pretrain model after download from Google drive HOT 2
- what is the form of <ANNONATION-DIRECTORY> because I want applied it own my dataset , and landmark method.
- IndexError: index 28 is out of bounds for axis 0 with size 4 when run crop_mouth_from_video.py
- RuntimeError: CUDA error: device-side assert triggered HOT 2
- Can you tell me how to get word boundary from real reasoning?
- Not able to evaluate visual-only performance using the pre-processed npz files HOT 2
- KeyError: 'optimizer_state_dict' arise with Pretrined model
- Your work is excellent! How can I calculate lip reading loss L between the face my model reders and my ground truth image?
- How to create .pkl file for my own video
- How are you dealing with varied length of input - like some are 29,28,27.
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from lipreading_using_temporal_convolutional_networks.