github-sxiong Goto Github PK
Type: User
Type: User
[2022 T-ASE] DSQNet: A Deformable Model-Based Supervised Learning Algorithm for Grasping Unknown Occluded Objects
[RSS 2024] 3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations
This project highlights the use of deep learning for 3d point cloud classification of common objects using Standford's ShapeNet dataset
๐งโ๐ซ 59 Implementations/tutorials of deep learning papers with side-by-side notes ๐; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, ...), gans(cyclegan, stylegan2, ...), ๐ฎ reinforcement learning (ppo, dqn), capsnet, distillation, ... ๐ง
MIT-Princeton Vision Toolbox for the Amazon Picking Challenge 2016 - RGB-D ConvNet-based object segmentation and 6D object pose estimation.
MIT-Princeton Vision Toolbox for Robotic Pick-and-Place at the Amazon Robotics Challenge 2017 - Robotic Grasping and One-shot Recognition of Novel Objects with Deep Learning.
This repo lists relevant papers summarized in our survey paper: A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models.
[RSS 2024] Consistency Policy: Accelerated Visuomotor Policies via Consistency Distillation
A Fresh Look at Human Data for Robotic Pre-Training
DiffCLIP: Leveraging Stable Diffusion for Language Grounded 3D Classification
ICRA 2022 "Hybrid Physical Metric For 6-DoF Grasp Pose Detection"
๐๐๐ Food analysis baseline with Theseus. Integrate object detection, image classification and multi-class semantic segmentation ๐๐๐
A pipeline to generate artificial rgb/depth image datasets using gazebo Simulator
ๅบไบGRCNN็ๆบๆขฐ่่ง่งๅนณ้ขๆๅ
3DSGrasp: 3D Shape-Completion for Robotic Grasp
Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
Mirage: a zero-shot cross-embodiment policy transfer method. Benchmarking code for cross-embodiment policy transfer.
๐จ ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
Codes for circular-teaching [working in progress...]
A Baseline for Multi-Label Image Classification Using Ensemble Deep CNN
This repository presents a couple of approaches to the problem of multi-view image classification. I faced this challenge during a hackathon in which I participated, and decided to share my code here. I've also written a Medium article to provide further details and explanations. Feel free to check it out !
ๅพๅๅคๆ ็ญพๅ็ฑปๆ ๆณจๅทฅๅ ท
Octopi: Object Property Reasoning with Large Tactile-Language Models
This is the ROS Wrapper of OD-GraspNet
PoinTramba: A Hybrid Transformer-Mamba Framework for Point Cloud Analysis
OpenNI
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.