Giter VIP home page Giter VIP logo

Hey 👋🏽, I'm cpuimage

Hi, I'm ZhiHan Gao and I live in Guangzhou, China.

I write open source projects about audio and image algorthms in github.

If you like my open source projects, and useful for you,

please consider buying me a coffee.

Thank you for your support!

Talking about Personal Stuffs:

  • 👨🏽‍💻 I have worked for Baidu, KingSoft, etc.
  • 🌱 I’m currently working on
    • Deep Learning

      • A Trimap-Free Solution for Real-Time Automatic Portrait Matting on Mobile Devices
      • A Robust Optimizer for Accelerated Training Convergence in Deep Learning => Normalization Is No Longer Needed
      • A General and Adaptive Robust Loss Structure Scheme
      • A Robust Loss Weighting Solution For Learning Long-Tail Data
      • Image Synthesis and Semantic Manipulation Using Stable Diffusion Networks
      • Stable Diffusion Architecture Optimization And Deployment On Mobile Devices
      • A Robust Solution For Accelerated Training Convergence And Learning Long-Tail Data
      • A Arbitrary Resolution Super Resolution Solution for Real World
      • Accelerate Stable Diffusion FP16 Inference Deployment Optimization with TensorRT
      • Port Stable Diffusion X4 Upscaler To TensorFlow And Support FP16 Inference Deployment
      • Port Stable Diffusion PromptGen (GPT2) To TensorFlow And Support ONNX Inference Deployment
      • Improve Batch Normalization for Robust Training and Inference => Normalization Is No Longer Needed
      • Stable Diffusion Architectural Distillation
      • Content-aware 3-view synthesis based on Stable Diffusion
      • Super Resolution Solution based on Stable Diffusion
      • Video Editing techniques based on Stable Diffusion
      • Port Stable Diffusion XL 1.0 To TensorFlow And Support FP16 Inference Deployment
      • Stable Diffusion V1.5 And XL Inference With PyTorch Weights And More Features Like Stable Diffusion Web UI In TensorFlow
      • LoRA TensorFlow 2 Implementation For Fine-Tuning Stable Diffusion
    • Statistical Algorithms

      • Real time and embedded implementation of speech enhancement algorithms based on Minimum Mean-Square Error Short-Time Spectral Amplitude estimation (MMSE-STSA)
  • 👯 I’m looking to collaborate on audio and image algorithms
    • 🤔 Reach me on
    1. WeChat: DbgMonks
    2. QQ: 200759103
    3. Telegram: cpuimage
  • 💬 Any paid technical service or solution consulting

cpuimage's Projects

adapnm icon adapnm

The Unofficial Tensorflow 2 Implementations of Positive-Negative Momentum Optimizers.

albumentations icon albumentations

Fast image augmentation library and an easy-to-use wrapper around other libraries. Documentation: https://albumentations.ai/docs/ Paper about the library: https://www.mdpi.com/2078-2489/11/2/125

angulargrad icon angulargrad

The Unofficial Tensorflow 2 Implementations of AngularGrad Optimizers.

audiodenoise icon audiodenoise

c implementation of 《Audio Denoise by Time-Frequency Block Thresholding》

beeps icon beeps

Fast Bi-Exponential Edge-Preserving Blur Filter Implementation In C

bpm icon bpm

tempo analysis (bpm detection) algorithms for dealing with beats per second detection

bpm-detector icon bpm-detector

A simple versions of Scheirer's beat detection algorithm

celebahairmask-hq icon celebahairmask-hq

A large-scale face dataset for hair segmentation, hair recognition, and GANs for hair generation and editing.

cldice icon cldice

Tensorflow implementation of clDice loss

cpufft icon cpufft

A Simple and Efficient FFT Implementation in C

dct_8x8 icon dct_8x8

float data loss compression algorithm base DCT 8X8

deblurring icon deblurring

Estimating an Image's Blur Kernel Using Natural Image Statistics, and Deblurring it: An Analysis of the Goldstein-Fattal Method

deskew icon deskew

Detect image skew angle and deskew image

dualattentionguideddropout icon dualattentionguideddropout

Unofficial Tensorflow Implementation of Dual-attention Guided Dropblock Module https://arxiv.org/abs/2003.04719

fftresampler icon fftresampler

A Simple and Efficient Implementation Of Fast Fourier Transform For Audio Resampler

fftw3 icon fftw3

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions, of arbitrary input size, and of both real and complex data (as well as of even/odd data, i.e. the discrete cosine/sine transforms or DCT/DST).

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.