ytep-zhi Goto Github PK

followers: 105.0 following: 830.0 repos: 30.0 gists: 0.0

Name: Jiazhi Yang

Type: User

Company: @OpenDriveLab

Bio: Researcher @OpenDriveLab.

Location: Shanghai, China

Jiazhi Yang's Projects

bevformer

[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.

centerformer

Implementation for CenterFormer: Center-based Transformer for 3D Object Detection (ECCV 2022)

diffusiondet

PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)

dino

PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO

dit

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

groundingdino

The official implementation of "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

hal

Code release for "HAL: Improved Text-Image Matching by Mitigating Visual Semantic Hubs" at AAAI 2020.

llama-adapter

Fine-tuning LLaMA to follow instructions within 1 Hour and 1.2M Parameters

lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

mae-pytorch

Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners

mask2former

Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"

minigpt-4

MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models

mmdetection

OpenMMLab Detection Toolbox and Benchmark

nerfvis

NeRF visualization library under construction

openselfsup

Self-Supervised Learning Toolbox and Benchmark

polyloss

Source code of Universal Weighting Metric Learning for Cross-Modal Matching. The paper is accepted by CVPR2020.

position-focused-attention-network

Position Focused Attention Network for Image-Text Matching

proxy-anchor-cvpr2020

Official PyTorch Implementation of Proxy Anchor Loss for Deep Metric Learning, CVPR 2020

pyllama

LLaMA: Open and Efficient Foundation Language Models

resnest

ResNeSt: Split-Attention Networks

scan

PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)

setup

Setup a new machine without sudo!

st-p3

[ECCV 2022] ST-P3, an end-to-end vision-based autonomous driving framework via spatial-temporal feature learning.

stable-pix2seq

A full-fledged version of Pix2Seq

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

tcp

[NeurIPS 2022] Trajectory-guided Control Prediction for End-to-end Autonomous Driving: A Simple yet Strong Baseline.

transfuser

[PAMI'22] TransFuser: Imitation with Transformer-Based Sensor Fusion for Autonomous Driving, [CVPR'21] Multi-Modal Fusion Transformer for End-to-End Autonomous Driving

uniad

Goal-oriented Autonomous Driving

unified-io-inference

yolox

YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/

ytep-zhi Goto Github PK

Jiazhi Yang's Projects

Recommend Projects

Recommend Topics

Recommend Org