Comments (7)
What TensorFlow version is it happening on?
During development we used 1.12.0
from mixmatch.
My TensorFlow version is 1.14.0.
And I also want to ask how many iterations in training process? I am a bit confused about you code. It's seem to the number of iteration in your implement is not same to the paper "Realistic evaluation of deep semi-supervised learning algorithms" (500,000 iterations).
from mixmatch.
So try
pip install tensorflow-gpu==1.12.0
As to the iteration, this code counts them in number of images seen. For standard experiments, the number is 64 million roughly speaking. So in terms of iterations, the number would be divided by the batch size (64).
Yes there are many difference with the paper you mentioned, amongst them we also do not use a learning rate schedule, we also use less augmentations (just mirror and shift) to keep things simpler.
from mixmatch.
I know, I will try tensorflow with 1.12.0. Thank you for your reply.
from mixmatch.
I also pushed a commit to the requirements.txt
file so that it install 1.12.0
by default.
from mixmatch.
Did you have any success with making pseudo_label.py run @zhangbin0917 ? I am having a similar error while running mixup.py, vat.py, and pseudo_label.py, however, I am able to successfully run mixmatch.py, ict.py, and mean_teacher.py .
ValueError: Cannot feed value of shape (64, 2, 32, 32, 3) for Tensor 'y:0', which has shape '(?, 32, 32, 3)'
This is using version 1.13.1 of tensorflow.
from mixmatch.
That's a different issue - the problem there is the placeholder y_in (line 43 in mixup.py, 37 in vat.py, and 36 in pseudo_label.py) has the wrong shape:
[None] + hwc,
when the ClassifySemi code passes in
[None, 2] + hwc
A quick fix might be changing the y_in line to
y_in = tf.placeholder(tf.float32, [None, 2] + hwc, 'y')
y_stacked = tf.reshape(y_in, [-1] + hwc)
and then changing
logits_y = classifier(y_in, training=True)
to
logits_y = classifier(y_stacked, training=True)
in the relevant files, but I'm not sure that follows intended behavior of the respective algorithms.
from mixmatch.
Related Issues (20)
- When will Remixmatch (ICLR'20) be available? HOT 3
- A question about "post_ops" in mixmatch.py HOT 2
- Implemented on other models HOT 1
- What are the most important things to reproduce the result on my own dataset? HOT 2
- Use MixMatch on tabular data. HOT 6
- A question about lambda_u HOT 2
- Is there any reason why you chose to use Beta Distribution? HOT 1
- Reason for ramping up weight of unlabelled loss function(lambda_u). HOT 3
- Comparison of fully supervised models with MixMatch. HOT 2
- How to chose total number of training steps HOT 5
- how to save the train and test accuracies to disk HOT 2
- question about mixmatch/scripts/create_split.py line113-130
- Working with higher resolution images HOT 1
- Hello, can I use it for multi label classification? If so, what should I pay attention to in the process of tag prediction? For multi label classification, sigmoid is generally used as the loss function. In this case, can you change your loss function to sigmoid? HOT 3
- what is the proper behavior of consistency loss HOT 1
- how to recover performance when doing evaluation HOT 4
- why not using dropout in the wide resnet as done in the wide resnet paper? HOT 4
- In your implementation of Mean teacher, isn't the student model and the teacher model the same? HOT 1
- ModuleNotFoundError: No module named 'libml' HOT 1
- Project dependencies may have API risk issues HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from mixmatch.