Giter VIP home page Giter VIP logo

Comments (2)

Beomi avatar Beomi commented on June 1, 2024 2

fp16으로 학습시 Model weights 와 Gradients 등 모두 fp16으로도 할 수 있고,
Model weight는 32로 두고 처리할 수도 있죠.(Mixed AMP등)

학습하시는 환경에 따라서 선택하시면 될 것 같습니다.
(보통 여건이 된다면 model weight는 fp32로 쓰죠)

말씀해주신 부분에는 OOM이 발생하신 상태라서 모델도 Fp16으로 올려보는것을 시도해보라고 말씀드린 케이스이구요 :)

from koalpaca.

SeongBeomLEE avatar SeongBeomLEE commented on June 1, 2024

답변 감사합니다!

from koalpaca.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.