Light

mintashkim / vad_rtfm Goto Github PK

View Code? Open in Web Editor NEW

0.0 1.0 0.0 173.25 MB

Python 8.95% Jupyter Notebook 91.05%

vad_rtfm's Introduction

RTFM

A famous weakly supervised learning model for video anomaly detection (VAD).

original repository

Implementation

ShanghaiTech i3d features
Revised few inconvenient codes
i3d features
- Extracted for every 16 frames
- 10 crops augmentation means torchvision TenCrop

Re-Tooling

3d features extraction: Use x3d rather than i3d, c3d which are 3D conv models
- Use torchvideo: pytorchvideo_x3d
- UCF_Crime x3d feature extraction (requires more than 24 hrs)
- Train RTFM with UCF_Crime + x3d dataset
  - Good performance!
  - Tested with new motion data (never seen): Got correct: white_check_mark:, Got correct: x:
    - normal:{drinking ✅, googling ❌, normal ✅, toilet ✅}
    - abnormal:{capture ❌, drawing ✅, writing ✅}
    - Things to notice: 1. Result changes even the data does not have big difference 2. Model works fairly good for webcam domain data even it is not trained with
    - If webcam domain data is given and weakly supervised, there is hope :)

TODO

Webcam abnormal face data
Webcam abnormal motion data

Troubleshooting

RuntimeError: Expected a 'cuda' device type for generator but found 'cpu'
- See cuda_troubleshooting.txt
No visdom error
- visdom is a web-based visualization tool
- pip install visdom
- (in a new terminal) python -m visdom.server
- can see the result by accessing via web

vad_rtfm's People

Contributors

Watchers

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.