Giter VIP home page Giter VIP logo

fucking-algorithm's Introduction

Hi there 👋 I am ShengYun (Anthony) Peng, a CS PhD student @Georgia Tech

Research interest:

My research strengthens the generalization and safety of the generative AI, spanning vision models, LLMs, and VLMs. As steps towards this goal, I work on:

  • Generalizable multimodal representation learning: foundation models for table recognition (UniTable, Table Transformer, Self-supervised Pretraining), RGB-infrared fusion object tracking (DsiamMFT, SiamFT), structural health monitoring (system identification).
  • Safe and robust machine learning models: LLM loss landscape (coming soon!), robust CNN design principles (#1 on RobustBench CIFAR-10), multi-task person tracking (SkeleVision), and defending LLM attacks (LLM Self Defense)

Papers

  • UniTable: Towards a Unified Framework for Table Structure Recognition via Self-Supervised Pretraining, preprint - [paper] [code]
  • Self-Supervised Pre-Training for Table Structure Recognition Transformer, AAAI'24 Workshop Oral - [paper] [code]
  • High-Performance Transformers for Table Structure Recognition Need Early Convolutions, NeurIPS'23 Workshop Oral - [paper] [code]
  • Robust Principles: Architectural Design Principles for Adversarially Robust CNNs, BMVC'23 Best Poster Award - [paper] [code]
  • SkeleVision: Towards Adversarial Resiliency of Person Tracking with Multi-Task Learning, ECCV'22 Workshop - [paper] [code]

fucking-algorithm's People

Contributors

1097452462 avatar brucecat avatar cchroot avatar chenjiexu avatar csguojin avatar dekunma avatar enrilwang avatar eric496 avatar gowufang avatar happyvictorwu avatar jasonlu0117 avatar jasper-joe avatar jodyz0203 avatar kalok87 avatar kepler-zc avatar kingkong1111 avatar kptnewler avatar l-wweeii avatar labuladong avatar leodpen avatar littlecry avatar lixiandea avatar lo-tp avatar marinejoker avatar miraclemin avatar tianzhongwei avatar tonytang731 avatar zakanun avatar zhangxiann avatar zhengpj95 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.