Giter VIP home page Giter VIP logo

complex-yolo's Introduction

Complex-YOLO

Complex-YOLO: Real-time 3D Object Detection on Point Clouds pytorch

Introduction

This is an unofficial implementation of Complex-YOLO: Real-time 3D Object Detection on Point Clouds in pytorch. A large part of this project is based on the work here:https://github.com/marvis/pytorch-yolo2

Point Cloud Preprocessing is based on:https://github.com/skyhehe123/VoxelNet-pytorch https://github.com/dongwoohhh/MV3D-Pytorch

Data Preparation

Download the 3D KITTI detection dataset.

Camera calibration matrices of object data set (16 MB)

Training labels of object data set (5 MB)

Velodyne point clouds (29 GB)

Train

python3 main.py

trained model(using DarkNet) download link :https://pan.baidu.com/s/1yeU3Q-Oyozv7qFqrVhzj0A

Result

complex-yolo's People

Contributors

ai-liu avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

complex-yolo's Issues

where is the cfg file

Hi, I want to convert your yolov2.weights to tensorflow .pb file, can you give me your .cfg file?

Confused with coordinate of 3D box

The function boxed_cam_to_velo in file "utils.py" projects the position of ground truth 3D bounding box to velo coordinate just with calibration matrix Tr_velo_to_cam. Does that mean the position is in camera 0 coordinate rather than rectified camera 0 coordinate? I have read some codes, such pointnet and frustum pointnets, which consider the position in rectified camera 0 coordinate.

Weight file has too few weights

Hi Al-liu,
thank you for your implementation!
According to the cfg file in your repository the model should have 46935494 trainable parameters. However, your weight file contains only 46931945 weights. That is a difference of 3549.
It would be great if you could explain.
Many thanks!

Overfitting

Hi @AI-liu ,
Using the same hyperparameters from the repo, I trained it for 200 epochs on KITTI. It overfits to the train data. After training, I checked the first 100 images output using eval.py. It only gives outputs for train dataset and not for validatation dataset. Did you use same code and hyperparams for your results in the repo?

RuntimeError

Hello everyone, in the middle of training i am facing the following issue. please help if anyone knows about this problem:
File "/media/imad/Windows/3d folder/Complex-YOLOv3/utils/utils.py", line 346, in build_targets
best_ious, best_n = ious.max(0)
RuntimeError: cannot perform reduction function max on tensor with no elements because the operation does not have an identity

Pre-trained model weights file

Could you please share a direct link to the pre-trained model weights file (for your PyTorch implementation)?
Thanks!

Error occurred when running eval.py

Hello,
Thank you for sharing such a good implentation of Complex-YOLO :-)

after run python3 main.py, I got 20's of ComplexYOLO_epochxxx
and I ran python3 eval.py after modifying data path but Error occurred like below

Traceback (most recent call last):
  File "eval.py", line 93, in <module>
    target, target_num= get_target(label_file, calib['Tr_velo2cam'])
ValueError: too many values to unpack (expected 2)

any idea to resolve it?

Custom dataset augmentation

hi @AI-liu thanks for the wonderful code . Just had a few queries

  1. are there any steps how to augment the data(custom)
  2. if i am using augmentation which feature of augmentation should i choose

Thanks in advance

something about the data

Thank for your source code and can we use the network directly to deal with the single-line radar point cloud data?

Evaluation

Hi @AI-liu ,
Did you evaluate mAP on the KITTI? were you able to get the scores mentioned in the paper?

Annotation Tool

Hello everyone, can someone please suggest me point cloud annotation tool to label my lidar data with output similar to Kitti dateset format....
Thank you.

Model files after training

Thank you very much for the code you provided, and I've been working on point cloud detection recently, so I sincerely hope to get the model files after your training. Thank you very much.

training on own data: classes

How do I train on own data, I already generated data/labels, but in my case I have 3 classes. So I decided to run and changed the

region_loss = RegionLoss(num_classes=8, num_anchors=5)
to
region_loss = RegionLoss(num_classes=3, num_anchors=5)
and got an error:
RuntimeError: shape '[12, 5, 10, 16, 16]' is invalid for input of size 230400

seems like second and third element of the output shape should be equal to 75 if multiplied together. But not sure what should I change to introduce different amount of classes.

Pre-trained model was canceled

Hi,
Thanks for sharing this amazing work,
but it seems the pre-trained model share link expired and was cancelled,
Is it convenient to provide a link?
Thank you very much!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.