Giter VIP home page Giter VIP logo

picking-instruction's Introduction

PFN Picking Instructions for Commodities Dataset (PFN-PIC)

This dataset is a collection of spoken language instructions for a robotic system to pick and place common objects. Text instructions and corresponding object images are provided.

Download (dataset-main.zip)

We consider a situation where the robot is instructed by the operator to pick up a specific object and move it to another location: for example, Move the blue and white tissue box to the top right bin.

An example of image

This dataset consists of RGBD images, bounding box annotations, destination box annotations, and text instructions.

dataset
├── en.train.jsonl
├── en.validation.jsonl
├── ja.train.jsonl
├── ja.validation.jsonl
├── image_file/
    ├── 1.png
    ├── 2.png
    ├── ....
    └── 1180.png

All objects in each image are annotated with bounding boxes. Each bounding box is associated with a destination box and text instructions. In addition to RGB images, depth images are also available in PCD (Point Cloud Data) file format. Since the PCD files are relatively large (17GB), we provide them upon request. Please create a GitHub issue for the request.

The bounding box annotations, destination box annotations, and text instructions are provided in en.train.jsonl, en.validation.jsonl, ja.train.jsonl, ja.validation.jsonl which are all in JSON Lines text file format. Each line of these files represents the annotations for one image. We recommend to use jq or other JSON tools for pretty-printing.

$ jq -r '.' dataset/en.train.jsonl | head -30
{
  "image_file": "1.png",
  "pcd_file": "1.pcd",
  "objects": [
    {
      "dest_box": "tl",
      "bbox": {
        "x": 649.7302,
        "y": 654.038,
        "width": 171.1864,
        "height": 235.914
      },
      "instructions": [
        "Put the green package next to the mustard in the first box on the left with the white circle.",
        "pick up the green sachet and put it in the upper left box",
        "Move the green and white package with asian scripture to the top left box."
      ]
    },
...

Citation:

  • [English] Jun Hatori, Yuta Kikuchi, Sosuke Kobayashi, Kuniyuki Takahashi, Yuta Tsuboi, Yuya Unno, Wilson Ko, Jethro Tan. Interactively Picking Real-World Objects with Unconstrained Spoken Language Instructions, Proceedings of International Conference on Robotics and Automation (ICRA2018), 2018. ICRA Best Paper Award on Human-Robot Interaction (HRI), project page, paper content on arxiv (The first 6 authors are contributed equally and ordered alphabetically.)
  • [Japanese] 羽鳥 潤, 菊池 悠太, 小林 颯介, 高橋 城志, 坪井 祐太, 海野 裕也, Wilson Ko, Jethro Tan. 実世界におけるインタラクティブな物体指示, 言語処理学会第21回年次大会(NLP2018), 2018. (最初の6人は全員筆頭著者であり貢献度に差はない)

Statistics

file name #image #bounding box #instruction
en.train.json 1060 25500 71701
en.validation.json 20 353 898
ja.train.json 1060 25500 76551
ja.validation.json 20 383 1149

Note that since some of the annotations include misspelling and do not appropriately specify target objects in the English validation set, we manually reviewed all the text instructions in the validation set and removed inappropriate instructions.

Terms of Use

The images and annotations in this dataset belong to Preferred Networks, Inc. and are licensed under a Creative Commons Attribution 4.0 License.

creative commons logo

THIS IMAGES AND ANNOTATIONS ARE PROVIDED "AS IS" AND NO REPRESENTATIONS OR WARRANTIES OF ANY KIND CONCERNING THE IMAGES AND ANNOTATIONS, WHETHER EXPRESS, IMPLIED, STATUTORY, OR OTHER, ARE MADE. THIS INCLUDES, WITHOUT LIMITATION, WARRANTIES OF TITLE, MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE, NON-INFRINGEMENT, ABSENCE OF LATENT OR OTHER DEFECTS, ACCURACY, OR THE PRESENCE OR ABSENCE OF ERRORS, WHETHER OR NOT KNOWN OR DISCOVERABLE. IN NO EVENT SHALL THE COPYRIGHT HOLDER BE LIABLE ON ANY THEORY OR OTHERWISE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, CONSEQUENTIAL, PUNITIVE, EXEMPLARY, OR OTHER LOSSES, COSTS, EXPENSES, OR DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) ARISING OUT OF THIS PUBLIC LICENSE OR USE OF THE IMAGES AND ANNOTATIONS EVEN IF THE COPYRIGHT HOLDER HAS BEEN ADVISED OF THE POSSIBILITY OF SUCH LOSSES, COSTS, EXPENSES, OR DAMAGES.

picking-instruction's People

Contributors

soskek avatar yuutat avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Forkers

yuutat yinlang832

picking-instruction's Issues

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.