ytep-zhi Goto Github PK
Name: Jiazhi Yang
Type: User
Company: @OpenDriveLab
Bio: Researcher @OpenDriveLab.
Location: Shanghai, China
Name: Jiazhi Yang
Type: User
Company: @OpenDriveLab
Bio: Researcher @OpenDriveLab.
Location: Shanghai, China
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
Implementation for CenterFormer: Center-based Transformer for 3D Object Detection (ECCV 2022)
PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
The official implementation of "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Code release for "HAL: Improved Text-Image Matching by Mitigating Visual Semantic Hubs" at AAAI 2020.
Fine-tuning LLaMA to follow instructions within 1 Hour and 1.2M Parameters
Using Low-rank adaptation to quickly fine-tune diffusion models.
Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners
Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"
MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
OpenMMLab Detection Toolbox and Benchmark
NeRF visualization library under construction
Self-Supervised Learning Toolbox and Benchmark
Source code of Universal Weighting Metric Learning for Cross-Modal Matching. The paper is accepted by CVPR2020.
Position Focused Attention Network for Image-Text Matching
Official PyTorch Implementation of Proxy Anchor Loss for Deep Metric Learning, CVPR 2020
LLaMA: Open and Efficient Foundation Language Models
ResNeSt: Split-Attention Networks
PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)
Setup a new machine without sudo!
[ECCV 2022] ST-P3, an end-to-end vision-based autonomous driving framework via spatial-temporal feature learning.
A full-fledged version of Pix2Seq
Code and documentation to train Stanford's Alpaca models, and generate the data.
[NeurIPS 2022] Trajectory-guided Control Prediction for End-to-end Autonomous Driving: A Simple yet Strong Baseline.
[PAMI'22] TransFuser: Imitation with Transformer-Based Sensor Fusion for Autonomous Driving, [CVPR'21] Multi-Modal Fusion Transformer for End-to-End Autonomous Driving
Goal-oriented Autonomous Driving
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.