Giter VIP home page Giter VIP logo

baidu_asr's Introduction

Hey 👋🏽, I'm cpuimage

Hi, I am ZhiHan Gao, living in Shantou, China.

I write open source projects about audio and image algorthms in github.

If you like my open source projects, and useful for you,

please consider buying me a coffee.

Thank you for your support!

Talking about Personal Stuffs:

  • 👨🏽‍💻 I have worked for Baidu, KingSoft, etc.
  • 🌱 I’m currently working on
    • Deep Learning
      • A Trimap-Free Solution for Real-Time Automatic Portrait Matting on Mobile Devices
      • A Robust Optimizer With Normalized Accelerated Convergence Capability in Deep Learning
      • A General and Adaptive Robust Loss Structure Scheme
      • A Robust Loss Weighting Solution For Learning Long-Tail Data
      • Image Synthesis and Semantic Manipulation Using Stable Diffusion Networks
      • Stable Diffusion Architecture Optimization And Deployment On Mobile Devices
      • A Robust Solution For Accelerated Training Convergence And Learning Long-Tail Data
      • A Arbitrary Resolution Super Resolution Solution for Real World
      • Accelerate Stable Diffusion FP16 Inference Deployment Optimization with TensorRT
      • Port Stable Diffusion X4 Upscaler To TensorFlow And Support FP16 Inference Deployment
      • Port Stable Diffusion PromptGen (GPT2) To TensorFlow And Support ONNX Inference Deployment
      • Improve Batch Normalization for Robust Training and Inference
      • Stable Diffusion Architectural Distillation
      • Content-aware 3-view synthesis based on Stable Diffusion in Game Art
      • Super Resolution Solution based on Stable Diffusion
      • Video Editing techniques based on Stable Diffusion
      • Port Stable Diffusion XL 1.0 To TensorFlow And Support FP16 Inference Deployment
      • A Plug-And-Play Algorithm For Asynchronous Inference With Frequency-Domain Decomposable Reconstruction For Arbitrary Visual Scenes
      • Stable Diffusion Inference With PyTorch Weights And More Features Like Stable Diffusion Web UI In Keras 3.x
      • Port FLUX.1 To Keras 3.x
      • FLUX.1 Support FP16 Inference Deployment and Low Memory Lora Training In PyTorch
    • Statistical Algorithms
      • Real time and embedded implementation of speech enhancement algorithms based on Minimum Mean-Square Error Short-Time Spectral Amplitude estimation (MMSE-STSA)
  • 👯 I’m looking to collaborate on audio and image algorithms
    • 🤔 Reach me on
      • Telegram Badge
      • Wechat Badge
      • QQ Badge
  • 💬 Any paid technical service or solution consulting
    • 📫 Reach me on mail:
      • mail Badge

baidu_asr's People

Contributors

cpuimage avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.