Giter VIP home page Giter VIP logo

Comments (5)

dongwhfdyer avatar dongwhfdyer commented on August 10, 2024

now i know it. look at the llava project. you would find the two-stage weight-loading methods. if anyone still don't know, contact me

from geochat.

lx709 avatar lx709 commented on August 10, 2024

Thanks, @dongwhfdyer , I already figured it out.

from geochat.

kartikey9254 avatar kartikey9254 commented on August 10, 2024

now i know it. look at the llava project. you would find the two-stage weight-loading methods. if anyone still don't know, contact me

hi there , i am trying out this model and the demo worked but when i used the lora.sh script for training it displays OSError: Error no file named pytorch_ Model. bin, tf_ Model. h5, model. ckpt. index or flex_ Model. msgpack found in directory/home/LaVA/lava v1.5-13b lora . can you guide me how can i train this model ?

from geochat.

732259408 avatar 732259408 commented on August 10, 2024

@dongwhfdyer hi, In the finetune_lora.sh --pretrain_mm_mlp_adapter path/to/llava-v1.5-mlp2x-336px-pretrain-vicuna-7b-v1.5/mm_projector.bin, l have a issue. Is the mm_projector.bin file using weights from llava-v1.5-7b? I couldn't find mm_projector.bin in Geochat-7B.

from geochat.

Amazingren avatar Amazingren commented on August 10, 2024

hi @dongwhfdyer ,

It seems you already successfully reproduced this project.

I am still confused about the training procedure.

  • Do we only need to run the fintune_lora.sh and then merge the weight?
  • Or, we also need to do pretrain with pretrain.sh

It would be super nice to get some response from you.

Best and have a nice day,

from geochat.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.