valeoai / woodscape Goto Github PK

View Code? Open in Web Editor NEW

597.0 47.0 122.0 30.12 MB

The repository containing tools and information about the WoodScape dataset.

Home Page: https://woodscape.valeo.com/dataset

Python 100.00%

dataset

woodscape's People

Contributors

Stargazers

Watchers

Forkers

osupcvlab hanahimi 3d-a xurannlpr gaoqiangwu mojojojooo helloparanoidandroid drajashe imdsafi09 ohhandsome xiaoyue-wang githubfragments rvarun7777 xingfuhoumei yujmo xjp99v5 liuguoyou yangcyz blun7 yongliang-qiao eungicho herobinbin lijihw08 jarsam weili1457355863 terryfic5 xuyanging pgdtgq huangwgang nada-baili hust-wayne zjsun7 yunhaoqiao hoang-phuong-nguyen pynancy rqbrother tanutarou keymanchen1215 atlasgooo2 diagonal404 you-old louisallen7 smashkoala fangchengji apollo5217 sichitong painkhw jquan7 zhangsongdmk xiaoyuxiaoer key-word phongphuhanam lukechencqu zhaoxiaolong2020 jie311 binianzjl atlasgooo yinianwujide ruijieren98 e18301194 greenni1988 mediumcore hiyx sinabaniasadazad david-zd blowfish9423 qwer9835 linh-huubui wpfhtl rangeliu shubhamchaudhari1996 navrobot streamwill k-me-j magicpixel-dataset neophack dannyway03 li-lindong churry-lee arslan-z blinkblade woniu0703 binary1215 briandeegan82 ms-krajesh ben00123789 dianimulan tommyanqi deepbehavier arctanbell sunsongmu zivzone ai-jie01 yuanxiaode rongmiq jinjinhong sadjadrz aluckyi chengwei920412 marenan

woodscape's Issues

Dataset Acquisition

Hello! I have followed the papers of your team, it was an excelltent work! Then, when will you release the dataset WoodScape? Hope for your reply!

How to convert bbox_2d_annotations and segmentation annotations from fisheye-style format to cylindrical format?

Hi:
I haved changed fisheye image to cylindrical image and then I need to change the annotations correspondingly. So I used this code
world_points = fisheye_cam.project_2d_to_3d(fisheye_points, norm=np.ones(fisheye_points.shape[0])) cyl_points = cylindrical_cam.project_3d_to_2d(world_points), which fisheye_points means the coordinates of 2d bounding boxes on raw fisheye images， cyl_points means ones on cylindrical images. But I got a unreasonable result when I visualized the annotations. Is there anything wrong with my code?
Hope for reply! Thank you!

About calibration code

Hi, I am trying your calibration code (projection.py) on fisheye images. I found the distortion coefficients k1-k4 are very different from the standard format in OpenCV (their number is smaller than 1, while yours is hundreds). Could your please share your calibration code and how to correspond your coeff with Opencv's?

some question in DirtyGAN

hello！Here is a question about your latest paper DirtyGAN.
In this paper, how does the mask generated by VAE guide cycleGAN to generate the stains corresponding to the mask？
Looking forward to your relpy !!!

Green strips

Hello,

Thank you very much for sharing this amazing dataset! I noticed that there are two green strips, one at the top and one at the bottom, in each RGB image. They are a few pixels wide and contain some differently colored pixels as well. In the instance annotations they are labeled as green_strip. Could you explain what the purpose of these strips is? Can I just crop them during training?
I’m looking forward to your answer.

Best,
Jan

TypeError: 'Tupperware' object is not callable

I am trying to run the program but keep getting a TypeError. You can see the logs down below. I am executing the code in Google colab and all files are on Google Drive. I also tried running it locally on my machine but ran into the same TypeError. Any hints or solutions?

{
    "train": "detection",
    "dataset_dir": "/content/gdrive/MyDrive/WoodScape-master/omnidet/WoodScape_ICCV19",
    "train_file": "/content/gdrive/MyDrive/WoodScape-master/omnidet/data/train.txt",
    "val_file": "/content/gdrive/MyDrive/WoodScape-master/omnidet/data/val.txt",
    "test_file": "/content/gdrive/MyDrive/WoodScape-master/omnidet/data/test.txt",
    "output_directory": "/content/gdrive/MyDrive/WoodScape-master/omnidet/data/output",
    "model_name": "res18_baseline",
    "dataset": "woodscape_raw",
    "input_height": 288,
    "input_width": 544,
    "network_layers": 18,
    "pose_network_layers": 18,
    "frame_idxs": [
        0,
        -1
    ],
    "pose_model_type": "separate",
    "pose_model_input": "pairs",
    "rotation_mode": "euler",
    "num_scales": 4,
    "crop": true,
    "disable_auto_mask": false,
    "ego_mask": true,
    "reconstr_weight": 0.15,
    "ssim_weight": 0.85,
    "smooth_weight": 0.001,
    "clip_loss_weight": 0.5,
    "semantic_num_classes": 10,
    "semantic_loss": "focal_loss",
    "semantic_class_weighting": "woodscape_enet",
    "motion_class_weighting": "motion_enet",
    "motion_loss": "focal_loss",
    "siamese_net": true,
    "num_classes_detection": 5,
    "classes_names": [
        "vehicles",
        "person",
        "bicycle",
        "traffic_sign",
        "traffic_light"
    ],
    "detection_conf_thres": 0.8,
    "detection_nms_thres": 0.2,
    "anchors1": [
        [
            24,
            45
        ],
        [
            28,
            24
        ],
        [
            50,
            77
        ]
    ],
    "anchors2": [
        [
            52,
            39
        ],
        [
            92,
            145
        ],
        [
            101,
            69
        ]
    ],
    "anchors3": [
        [
            52,
            39
        ],
        [
            92,
            145
        ],
        [
            101,
            69
        ]
    ],
    "batch_size": 22,
    "num_workers": 6,
    "epochs": 125,
    "learning_rate": 0.0001,
    "scheduler_step_size": [
        100,
        110
    ],
    "min_distance": 0.1,
    "max_distance": 100.0,
    "log_frequency": 300,
    "val_frequency": 300,
    "save_frequency": 20,
    "weighing": "enet",
    "num_classes": 10,
    "pretrained_weights": "/content/gdrive/MyDrive/res18",
    "models_to_load": [
        "detection"
    ],
    "onnx_model": "omnidet",
    "opset_version": 12,
    "model_summary": true,
    "init_weights": true,
    "video_name": "norm",
    "model_path": "/content/gdrive/MyDrive/res18",
    "onnx_export_path": "/content/drive/MyDrive/export/",
    "onnx_load_model": "/content/gdrive/MyDrive/res18/onnx/omnidet_float32_opset12.onnx",
    "device": "cuda:0",
    "cuda_visible_devices": "0",
    "use_multiple_gpu": false
}
=> Clean up the log directory?y
=> Cleaned up the logs!
=> Training on the WOODSCAPE_RAW dataset 
=> Training model named: res18_baseline 
=> Models and tensorboard events files are saved to: /content/gdrive/MyDrive/WoodScape-master/omnidet/data/output 
=> Training is using the cuda device id: 0 
=> Loading woodscape_raw training and validation dataset
/usr/local/lib/python3.7/dist-packages/torch/utils/data/dataloader.py:481: UserWarning: This DataLoader will create 6 worker processes in total. Our suggested max number of worker in current system is 4, which is smaller than what this DataLoader is going to create. Please be aware that excessive worker creation might get DataLoader running slow or even freeze, lower the worker number to avoid potential slowness/freeze if necessary.
  cpuset_checked))
=> Total number of training examples: 8029 
=> Total number of validation examples: 205
=> Loading model from folder /content/gdrive/MyDrive/res18
Loading detection weights...
Cannot find {} weights so {} is randomly initialized
Traceback (most recent call last):
  File "/content/gdrive/MyDrive/WoodScape-master/omnidet/main.py", line 109, in <module>
    main()
  File "/content/gdrive/MyDrive/WoodScape-master/omnidet/main.py", line 76, in main
    model = DetectionModel(args)
  File "/content/gdrive/MyDrive/WoodScape-master/omnidet/train_detection.py", line 277, in __init__
    self.pre_init()
  File "/content/gdrive/MyDrive/WoodScape-master/omnidet/train_detection.py", line 94, in pre_init
    self.save_args()
  File "/content/gdrive/MyDrive/WoodScape-master/omnidet/utils.py", line 135, in save_args
    yaml.dump(to_save, f)
  File "/usr/local/lib/python3.7/dist-packages/ruamel/yaml/main.py", line 1380, in dump
    block_seq_indent=block_seq_indent,
  File "/usr/local/lib/python3.7/dist-packages/ruamel/yaml/main.py", line 1321, in dump_all
    dumper._representer.represent(data)
  File "/usr/local/lib/python3.7/dist-packages/ruamel/yaml/representer.py", line 80, in represent
    node = self.represent_data(data)
  File "/usr/local/lib/python3.7/dist-packages/ruamel/yaml/representer.py", line 103, in represent_data
    node = self.yaml_representers[data_types[0]](self, data)
  File "/usr/local/lib/python3.7/dist-packages/ruamel/yaml/representer.py", line 321, in represent_dict
    return self.represent_mapping('tag:yaml.org,2002:map', data)
  File "/usr/local/lib/python3.7/dist-packages/ruamel/yaml/representer.py", line 214, in represent_mapping
    node_value = self.represent_data(item_value)
  File "/usr/local/lib/python3.7/dist-packages/ruamel/yaml/representer.py", line 107, in represent_data
    node = self.yaml_multi_representers[data_type](self, data)
  File "/usr/local/lib/python3.7/dist-packages/ruamel/yaml/representer.py", line 447, in represent_object
    reduce = data.__reduce_ex__(2)
TypeError: 'Tupperware' object is not callable

the usage of previous images

The readme says “previous images needed for geometric tasks”。
Can you give me some examples about the geometric tasks?

Depth ground-truth missing

Hi guys,

Thanks for releasing the dataset. It's very useful for my research with fisheye cameras.

I'm trying to implement your paper FisheyeDistanceNet on depth estimation with Woodscape Dataset, but I didn't find Fisheye Images' ground-truth depth map or sparse lidar ground-truth in the current dataset.

Will you release the GT for depth soon?

Best,
Weiheng

Detection and semantic task, the detection mAP is 11.3 and does not reach the mAP64 mentioned in the paper?

Hi,

the train setting is same as your params.yaml，the semantic task iou is 58.7. How should I do to imporve the performance?

Looking forward to your reply, thanks!

converting fisheye to rectilinear image

The code shared in projection.py gives cylindrical image as output. I am trying to convert the fisheye image to rectilinear image. Can someone guide me on how to do that.

Can you provide the camera intrinsic parameters ?

Cannot download rgb_images.zip from google drive

Hi guys

Thanks for providing the dataset!
I had a problem when I tried to download the rgb_images.zip from google drive.

I tried a few times and every time it failed with this forbidden error.

Best regards,
Xinchao

About OmniDet loss function

Thanks for your release of this dataset.

I read the OmniDet paper and find that you use auto-encoder to abstract feature to form feature-metric loss.

Question One:
figure2 in your paper show that this feature is abstracted from the last output layer of decoder instead of from that of encoder, which is confusing. Is this just a drawing mistake or this feature Ft is really taken from the decoder?

Question Two:
The loss function for autoencoder only contain L_dis and L_cvt, but not L_reconstruction for autoencoder, which is different from that of "Feature-metric Loss for Self-supervised Learning of
Depth and Egomotion". is this just a writing mistake?

Hope you can solve these two question that puzzle me a lot

Why shared encoder is resnet instead of self-attention?

To start with, Thanks for this innovative job;
i noticed the shared encoder in your paper is self-attention, but code is resnet, why replace it?
thanks a lot if you reply to me

When will leaderboards be open?

I hope you can provide a leaderboard, and researchers can show their result.

Is this dataset suitable for benchmarking visual slam?

Hi,

Thanks for your kindness in sharing the data. The paper says, it contains several videos for benchmarking visual odometry/slam. However, I could not find suitable video data in your current description. Am I missing something?

Deshun

Asking about Camera Geometry Tensor

First of all, many thanks for your work.

I have investigated your implementation for OmniDet but I couldn't find the usage of Camera Geometry Tensor Ct (cam_conv=False). Are there any reasons for it?

Thank you once again!

mismatch in quaternion data for extrinsic parameters

The sample data shared in sample application and calibration readme files mention extrinsic rotation data in quaternion whereas the calibration data available released contains extrinsics similar to euler format.

How can i use this new data for getting rotation matrix?

Best Regards
Tahera

Dataset for same instance or different

Does this dataset contain front, rear, left and right view images of same instance or different instances?

some doubts about your dirtygan

HI, Some doubts about the way you make use of generated masks. In fact, the C2S generator(you mentioned in paper) takes in both clean image and generated mask, but how does two inputs been trained with C2Sgenerator? concat them or dot or other ways?

Explanation of vehicle_data

Thanks for the great work.

I couldn't find any explanation of vehicle_data(*.json). Could you provide the units of those values? (e.g. ego_yawRate)

semantic segmentation questions

Thank you for the released data.
In woodscape paper, you provide semantic segmentation results of ENet as the baseline. Do you train and test only using 10 classes , consistent with the semantic tags you provided?
Besides, the github readme says the remaining 1.8K test samples are held out for a benchmark. Will it be an online testing platform like cityscape? When is it released?

Are there video data?

Dear authors,

Could you please provide fisheye videos data?

Many thanks!

Is this paper a scam

Group of bicycles & motorcycles annotated as vehicles

Hi, in the annotation, group of bicycles / motorcycles are annotated as vehicles. Do you guys plan to correct this, or move them to something like group of bicycles / motorcycles?

The confusion of real-world length calculations

How to calculate the length of the real world corresponding to 1 offset of the x-axis, y-axis, and z-axis in the world coordinate system (world_points) ？:<

And how to calculate the length of the real world corresponding to 1 pixel in the x-axis and y-axis in screen_points ?
:<

I'm going crazy :< It puzzles me for a long time.
Thank you !

CANNOT download semantic segmentation annotations

First of all, thank you for sharing this invaluable dataset in public.

I am trying to download the semantic segmentation annotations, but it fails.

I used this link 'https://drive.google.com/uc?export=download&id=1nNJMKz2Nx6pj6L2xlj_AWBhK6BGECSRD' and got the error like

Could you take a look at this problem?

Lidar Ground Truth

Hello Guys,

My current research is based upon Fisheye cameras and WoodSpace dataset seems to be totally aligned with my research.
I wanted to ask whether you guys are plannning to release the lidar data as well anytime soon?

Regards,
Hamza

the structure of "motion_annotations" directory download from google drive does not match with that described in readme file

the structure of "motion_annotations" directory download from google drive does not match with that described in readme file.
And I could you add more explanation to the motion data not only annotation but also the source paired images. For example, which directory is the training data in the Data organization?

Hi, single detection task, the mAP is very low,what's wrong?

For the single detection task, the mAP is very low and does not reach the mAP63 mentioned in the paper. May I ask what is wrong?

the train setting: single detection task, train_file is train.txt, val_file is val.txt, input height is 576, input_width is 1088, num_classes_detection is 5, batch_size 24, epoch 20, learning_rate 0.0001, models_to_load: encoder and detection, pretrained_weihts is res50, use two GPU of 48G, the frame_idxs is [0], the other setting is same.

the result is：mAP 0.199 at step 3000 on 8 epoch, mAP per class in order [0.303, 0.265, 0.17, 0.105, 0.154]. The final loss is 1.26.

Looking forward to your reply, thanks!

A paper proposing a dataset accepted by ICCV yet dataset not released after two years, what a joke.

Point cloud from fisheye depth image

Hi,

I found multiple "project_2d_to_3d" functions in scripts/calibration/projection. To convert the inferred fisheye depth image (like the image below in your data) into a point cloud, which camera model and function exactly should be used? Thanks!

Dataset release

It is going to be 2021Q1 several days later. When to release the fisheye dataset ? Thank you very much.

Lidar ground truth

Hi guys,
I am trying to implement your paper FisheyeDistanceNet. I found that there is no lidar ground truth information available.
When is the lidar ground truth likely to be released?

With regards,
Ajith

Dataset Release

Dear authors, is the Dataset released now? 2020.4.11

Thanks very much if you can provide the link of Dataset as we are very interested in it for further research.

3D bounding box detection annotations

Hi, may I know when will the annotations of 3D bounding boxes be released?

apply for dataset

The time is Q4 in 2020 . So when to release the fisheye dataset ? Thank you

dataset release

I'm very interested in your dataset. When will it be available? It was mentioned that it would be released in Q4 2020 in this Link .Will you release it on time?

Point cloud and camera pose

Hi @senthil-yogamani,

I am really fascinated by your data and I wondering if you are planning to share the point cloud and the camera pose soon?

Thank you

cylindrical to fisheye

Hi, thanks for contributing the dataset and sharing the utility codes.

The fisheye->cylindrical code works well, however, is it possible to convert the cylindrical image back to the fisheye image?

I have tried to project cylindrical cam to fisheye cam, but the conversion is super slow and the results are not good now.

I will appreciate any ideas regarding this issue. Thanks.

how can get intrinsic calibration k1、k2、k3、k4

how can get intrinsic calibration k1、k2、k3、k4?
thank you!

How to apply this dataset to the SLAM?

Hi, I wonder how I can apply the dataset cause there is no time stamp for each photo and the photo seems not be continuously taken in the time order. And I cannot find the video for the LSD-SLAM mentioned in your paper. Thank you.

Fisheye camera calibration method

Hi, thanks for the great work!
I have read the readme and example for calibration, and would like to produce the same calibration files/parameters as provided in the WoodScape dataset for my own collected fisheye images.
Could you provide more details on how the calibration is done, and if possible some links to the model and method used for the calibration. Thank you!

How can we get access to the data?

dataset

Hello,
I am interested to use the dataset for my research. When will it be available for use.

A small question about the fisheye calibration method

First of all, I really appreciate your great contribution for providing this dataset to the community!

Question: What is the meaning of adding +0.5, -0.5 in scripts/calibration/projection.py?

145        self._principle_point = 0.5 * self._size + np.array([principle_point[0], principle_point[1]], dtype=float) - 0.5

153    cx_offset = property(lambda self: self._principle_point[0] - 0.5 * self._size[0] + 0.5)
154    cy_offset = property(lambda self: self._principle_point[1] - 0.5 * self._size[1] + 0.5)

What is the timeline for dataset release?

two months past after the announcement of the release and iccv paper accepted...

Error in running semantic_map_generator

Trying to generate the semantic map. Getting this error.

!python /content/WoodScape/scripts/semantic_map_generator.py --src_path /content/instance_annotations/ --dst_path /content/semantic_annotations/ --semantic_class_mapping /content/WoodScape/scripts/configs/semantic_mapping_9_classes.json --instance_class_mapping /content/WoodScape/scripts/mappers/class_names.json

Error I am getting: