Giter VIP home page Giter VIP logo

vidat's People

Contributors

davidzhang73 avatar dependabot[bot] avatar sgould avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

vidat's Issues

Detailed cursor shape

It would be good also if a symbol appears on a corner of the bounding box for selecting that box and after selecting the box, the color of the box change, and then one can use "Delete" on the keyboard to delete the box or use arrow keys to move it. Also, for resizing, it would be good if the mouse cursor changes to some diagonal arrow symbol on the corners.

Video tutorials.

Video tutorials on how to use the tool for different annotation tasks.

Transcripts with video

It would be nice to get the option of aligning transcripts with the video. Something like this, please take a look at the video or demo, https://berndhuber.github.io/bscript/. So there is a synchronization between the transcript or script and the time in the video when the person says that word. It would be nice not just for tasks similar to the multi-modal opinion mining in video reviews but also video dense captioning.

Exporting of keyframes

Export keyframes as jpeg or png images so that they can be processed independently and to ensure timeframe and frame index align (for these frames) in other tools.

SQL database

I would like to have the annotations in a SQL database. Although, in the end, we can share JSON files with the community, I would prefer SQL for the trackability of the annotation process, scalability, simplicity, and order. sqlite3 it is a good and easy option.

Flexible keyframes

I would like to import a list of keyframes from a file. Using one frame every second or so is just one alternative for keyframes selection, and it depends on what we want to do. For example, in the Spatio-temporal graph, I used Laplace variance for keyframe selection. Essentially, Keyframes needs to be flexible.

Cache frames instead of use video element

  • Use video info panel to cache frames.
    • A video element to load the video and a canvas element to save the frames to Vuex.
    • Code can be reused.
  • Delete the video in canvas panel, and re-layout.

General tidy up and error checking

Refactor javascript into modules. Report errors and show warnings as status text (flash messages). Warn when deleting. Warn when leaving page without saving.

jQuery and bootstrap

Should we use jQuery and bootstrap to simplify code, improve robustness and interface consistency? Or will this just add unnecessary complexity.

Amazon Mechanical Turk platform

I certainly believe that the current prototype of the annotation tool covers or will cover (after some iterations) everything that we need for doing the spatial, temporal, and natural-language annotation for the research on video analysis. However, I would like to point out some things about the use of this tool in the Amazon Mechanical Turk platform.

It is important to consider the annotation tool for two purposes. Of course, the annotation process itself and the evaluation of the annotators. Turkers want to get their payment as soon as they can. They want to receive their payment between two or three days. If we want to assess the annotations in time. We need a tool that facilitates the evaluation of annotations and annotators. It is for that reason that I believe this tool should have two purposes in mind.

Keyboard shortcuts support

  • h :: help (this window): Will not be implemented
  • p :: play video segment left panel only
  • comma, < :: advance to next keyframe
  • period, > :: retreat to previous keyframe
  • [, { :: select left slider: Cannot be focused due to it is Vue component and focus is not implemented
  • ], } :: select right slider: Cannot be focused due to it is Vue component and focus is not implemented
  • left-arrow :: previous frame (when slider is selected)
  • right-arrow :: next frame (when slider is selected)
  • page-up :: jump to previous 10% of video (when slider is selected)
  • page-down :: jump to next 10% of video (when slider is selected)
  • plus (+) :: add a new segment
  • delete :: delete currently active object
  • shift :: duplicate currently active object when mouse down
  • tab :: move to next field when editing objects

More compact layout

Snipaste_2020-08-03_20-17-28

  • Make the video info panel more compact.
  • Use dense mode in Quasar.
  • Remove the video element or hide to cache frames.

'+' cursor for ready to draw

According to the bounding box annotation, it would be good if the mouse cursor shape changes to + when it is ready to draw a bounding box.

Show/hide annotation types

Allow user to show and hide different annotation types (e.g., hide bounding boxes when labelling human pose).

Preference settings from URL query parameters

Related to #20, At least preferences for showing/hiding GUI components should be be included in the first release (e.g., hide bounding box annotations if we're asking workers to label video segments).

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.