github-sxiong Goto Github PK

followers: 0.0 following: 0.0 repos: 45.0 gists: 0.0

Type: User

github-sxiong's Projects

-a-deformable-model-based-supervised-learning-algorithm-for-grasping-unknown-occluded-objects

[2022 T-ASE] DSQNet: A Deformable Model-Based Supervised Learning Algorithm for Grasping Unknown Occluded Objects

3d-diffusion-policy

[RSS 2024] 3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations

3d-object-classification-using-pointnet

This project highlights the use of deep learning for 3d point cloud classification of common objects using Standford's ShapeNet dataset

annotated_deep_learning_paper_implementations

🧑‍🏫 59 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

apc-vision-toolbox

MIT-Princeton Vision Toolbox for the Amazon Picking Challenge 2016 - RGB-D ConvNet-based object segmentation and 6D object pose estimation.

arc-robot-vision

MIT-Princeton Vision Toolbox for Robotic Pick-and-Place at the Amazon Robotics Challenge 2017 - Robotic Grasping and One-shot Recognition of Novel Objects with Deep Learning.

awesome-prompting-on-vision-language-model

This repo lists relevant papers summarized in our survey paper: A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models.

consistency-policy

[RSS 2024] Consistency Policy: Accelerated Visuomotor Policies via Consistency Distillation

data4robotics

A Fresh Look at Human Data for Robotic Pre-Training

diffclip-leveraging-stable-diffusion-for-language-grounded-3d-classification

DiffCLIP: Leveraging Stable Diffusion for Language Grounded 3D Classification

fgcnet_detailed

ICRA 2022 "Hybrid Physical Metric For 6-DoF Grasp Pose Detection"

food-recognition

🍔🍟🍗 Food analysis baseline with Theseus. Integrate object detection, image classification and multi-class semantic segmentation 🍞🍖🍕

gazebo_dataset_generation

A pipeline to generate artificial rgb/depth image datasets using gazebo Simulator

graspkpnet

grcnn_plane_robotic_grasping

基于GRCNN的机械臂视觉平面抓取

grcnn_rgb

ieee-icra-2023---3dsgrasp-3d-shape-completion-for-robotic-grasp-youtube-video-

3DSGrasp: 3D Shape-Completion for Robotic Grasp

learning-equi-angular-representations-for-online-continual-learning

mamb

Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

mirage-cross-embodiment-zero-shot-policy-transfer-with-cross-painting

Mirage: a zero-shot cross-embodiment policy transfer method. Benchmarking code for cross-embodiment policy transfer.

ml-visuals

🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.

multi-head-ensemble-of-smoothed-classifiers

Codes for circular-teaching [working in progress...]

multi-label-image-classification

A Baseline for Multi-Label Image Classification Using Ensemble Deep CNN

multi-view-image-classification

This repository presents a couple of approaches to the problem of multi-view image classification. I faced this challenge during a hackathon in which I participated, and decided to share my code here. I've also written a Medium article to provide further details and explanations. Feel free to check it out !

github-sxiong Goto Github PK

github-sxiong's Projects

Recommend Projects

Recommend Topics

Recommend Org