Giter VIP home page Giter VIP logo

Comments (4)

hollance avatar hollance commented on May 30, 2024 1

These are the widths and heights of 5 "anchor" boxes (also known as prior boxes or default boxes). Recall that for every cell in the 13 x 13 grid we predict 5 bounding boxes? What we predict are actually widths and heights relative to these anchor boxes.

The reason YOLO uses these anchor boxes is as a hint to the neural network that most of the predictions will have one of these shapes. These particular anchor boxes were chosen by the authors of YOLO using clustering on the Pascal VOC dataset to find the most common object shapes. Using such anchor boxes is common in many object detection models.

from yolo-coreml-mpsnngraph.

hollance avatar hollance commented on May 30, 2024 1

It's constant for the training dataset used, in this case Pascal VOC. These anchors were calculated just once, before YOLO was trained. (The YOLO9000 paper explains how they did this in more detail.)

If you're training YOLO on a very different dataset with different shaped objects, you may need to calculate different anchors.

Turi Create, for example, also uses YOLO for object detection but has more and different anchors, presumably to deal with a larger variety of objects.

from yolo-coreml-mpsnngraph.

izdi avatar izdi commented on May 30, 2024

Is this constant value or somehow calculated? If the last then are you familiar with approaches to do so ?

from yolo-coreml-mpsnngraph.

izdi avatar izdi commented on May 30, 2024

Thanks, this is very useful information !

from yolo-coreml-mpsnngraph.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.