Giter VIP home page Giter VIP logo

Comments (7)

HuangOwen avatar HuangOwen commented on May 28, 2024

Hi @BestSongEver

I'm not sure if you change some hyperparameter in ult.py. Make sure the learning rate, dropout rate, cosine learning rate decay is the same with our paper. If you want to use the model for inferencing, you can just download it via google drive link.

from transferable-interactiveness-network.

BestSongEver avatar BestSongEver commented on May 28, 2024

@HuangOwen Well, i didn't change any hyperparameter, just modify num_iteration from 20000 to 300000 (following iCAN)...By the way, do 20000 iterations perform good in ur code? 300000 are too much more than 20000.

I will try again and update the issue if i solve the problem.

from transferable-interactiveness-network.

HuangOwen avatar HuangOwen commented on May 28, 2024

@BestSongEver I got your point. We actually do not train our model from scratch. Instead, we finetune the model with some weight initialized from iCAN best model, you should download it and make sure you're initializing from the right weight. 20000 iter is enough for fine-tuning but apparently insufficient for state-of-the-art performance if you're training from scratch. : )

from transferable-interactiveness-network.

BestSongEver avatar BestSongEver commented on May 28, 2024

@HuangOwen Thanx for ur reply.
I tried the initialization with iCAN best model and got a better result.
Problem solved! Thanx again.

from transferable-interactiveness-network.

BestSongEver avatar BestSongEver commented on May 28, 2024

Hi @HuangOwen
Hi again. I am going to cite ur brilliant paper into my work.
Before that, I am devoting to reimplement ur result in "RPdCd" Mode with my GPU.
But i got some questions:
1.In my understanding, take VCOCO as an example, "60000_TIN_VCOCO_D.pkl" is the output result from binary discriminator in RPt2Cd Mode. It is the "Interactiveness Knowledge" in ur paper. Is that right? So,where did this file come from ur code?
2.with ur "60000_TIN_VCOCO_D.pkl", i can get the result in RPt2Cd Mode, but how can i get ".pkl" result of RPdCd Mode?
3. Besides, In Transfer Learning Modes, P can learn interactiveness knowledge across datasets, could u please tell me how the algorithm "across datasets"? Which means how to combine the Knowledge from Dataset 1 and Dataset 2 in training ?

Thank u many times!

from transferable-interactiveness-network.

HuangOwen avatar HuangOwen commented on May 28, 2024

Hi @BestSongEver sorry for the late reply

  1. This TIN_VCOCO_D is trained with another file with different network architecture, we are still sorting out the code of that.
    2/3 Our core insight is that we train P on various datasets because P is transferable and C is not transferable (action definition of VCOCO and HICO is different). To train a P 'across datasets' we mean you just enlarge the training data to more dataset, not restricted to VCOCO, when you're training P.

from transferable-interactiveness-network.

BestSongEver avatar BestSongEver commented on May 28, 2024

@HuangOwen Got it.
So, until now, I can only get the result in RPt2Cd Mode, not in RPdCd Mode.
Maybe u will release ur "TIN_VCOCO_D" training code within the next few days ?
I am excited and looking forword to reimplement the result with ur code.
Thx again.

from transferable-interactiveness-network.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.