kavindie Goto Github PK
Type: User
Type: User
[VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Open-source simulator for autonomous driving research.
Official pytorch repository for CG-DETR "Correlation-guided Query-Dependency Calibration in Video Representation Learning for Temporal Grounding"
Tensorflow implementation of embed CNN-LSTM network for sentiment analysis task.
Training framework for conditional imitation learning
ROS nodes for controlling and monitoring a differential drive robot.
Who Let The Dogs Out? Modeling Dog Behavior From Visual Data https://arxiv.org/pdf/1803.10827.pdf
Code release for "EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone" [ICCV, 2023]
EILEV: Efficient In-Context Learning in Vision-Language Models for Egocentric Videos
Software design & development with AI
Code for Data61's tutorial on Graph Representation Learning
A toolkit for developing and comparing reinforcement learning algorithms.
Assignments for Berkeley CS 285: Deep Reinforcement Learning (Fall 2022)
ImageBind One Embedding Space to Bind Them All
Repository to store conditional imitation learning based AI that runs on CARLA.
[NIPS 2017] InfoGAIL: Interpretable Imitation Learning from Visual Demonstrations
InternVideo: General Video Foundation Models via Generative and Discriminative Learning (https://arxiv.org/abs/2212.03191)
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
Python Implementations of the Kalman Filter with Conctant Velocity model
γICLR 2024π₯γ Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
LAVIS - A One-stop Library for Language-Vision Intelligence
[NeurIPS'23 Oral] Visual Instruction Tuning: LLaVA (Large Language-and-Vision Assistant) built towards GPT-4V level capabilities.
MELTR: Meta Loss Transformer for Learning to Fine-tune Video Foundation Models (CVPR 2023)
Models and examples built with TensorFlow
Useful datasets and links
A declarative, efficient, and flexible JavaScript library for building user interfaces.
π Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. πππ
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google β€οΈ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.