Giter VIP home page Giter VIP logo

Summary

  • Experience of building an engineer team from scratch to about 10 engineer;
  • Software developer with management experience and solid programming skills;
  • Experienced researcher in computer vision, machine learning and deep learning.

Contact Information

Skills

  • Programing: C, Java, Python, Javascript, Mongodb, React
  • Computer vision: motion analysis, action recognition, face recognition, object recognition
  • Machine learning: sparse learning, multiple task learning, statistical learning
  • Deep Learning: CNN, RNN, LSTM, GAN, Caffe, Tensorflow, Keras;
  • Others: Linux, Docker, MongoDB, ARM NEON

Education

  • Bachelor, Beijing Normal Univerity, 2005~2009, EE;

  • Ph.D., Arizona State University, 2009~2014, CS;

    • Sparse learning and face recogniton;
    • Multi-task learning and attribute learning;
    • Video analysis and motion skill analysis.
    • Dissertation and book.

Experience

  • Manager, Facebook, 2019~now

    • Lead a team to develop face tracking technology to drive AR/VR products, Facebook App, IG and Messengers.
  • Founder & CTO, Button, 2018~2019.

    • Lead the development of (pharmaceutic and medical device) asset discovery platform;
    • Build a team of six engineers and several data scientists with cross-business collaborations;
    • Javascript, Mongodb, React, Nature Language Processing, Recommendation System.
  • Software Engineer, Google, 2016~2018.

    • Work on the camera of Google's 2016 and 2017 Pixel Phones;
    • Tech lead in video pipeline (stability, power consumption, codec tuning, audio performance);
    • Lead the development of hyper-lapse mode of Pixel Camera;
    • Contributor of the new camera framework for Pixel 2018;
    • Improve Panorama, Photosphere and Refocus modes of Pixel camera;
    • Skills: Java, C, Android, JNI, Python, TensorFlow, Convolution neural network.
  • Staff Software Engineer, Samsung, 2014~2016.

    • Lead algorithm and software development for 3D camera (time of flight, ToF);
      • Improve depth quality over noise and motion blur ($5mm$ accuracy within 1 to 5 meter);
      • Optimize the code for real time performance on multiple platforms;
    • Research and develop deep learning algorithms;
      • Develop deep learning algorithms for varying tasks, including image recognition, object detection, image quality assessment, image processing, image captioning, speech recongition and machine translation;
      • Optimize deep learning algorithms for mobile platform for better speed and reduced memory footprint, including, sparse pruning, quantization (bit reduction) and compression;
      • Guide the design for Samsung mobile neural processor and identify the hot spot in neural network inference;
      • Rich experience with both convolution neural network and recurrent neural network, and familiar popular deep learning packages, including caffe, Theano (Keras);
    • Develop algorithm for high quality multi-frame super-resolution optimized for dynamic scene;
    • Several patents pending and papers under review;
    • Skills: C, Android (NDK), X86 SSE, ARM NEON, OpenCL, CEVA Vector processing, Python, CUDA, Java, OpenMP, Multi-threading, Socket (TCP/UDP).
  • Intern, Qualcomm, 2012.

    • Algorithm design and development for optical base multi-touch system;
    • Winner of 2nd prize in Qualcomm Qtech 2012 and two patents (US20140264034 A1, WO2014158946 A1, US9109949);
  • Intern, Sharp Laboratory of America, 2011.

    • Algorithm design and development for defect detection;
    • Algorithm development for template based image matching.
    • Codes deployed in commercial products and filed as US patent (US8705839 B2);
  • Research Assistant, Arizona State Univeristy, 2009~2015.

    • Automatically evaluate the motion skills in surgical simulation with tracking-free approaches.

      • Learn discriminative models from relative labels which relaxes the labeling requirement.
      • Capable of extract the temporal dynamics from the video to facilitate the detailed analysis;
    • Propose a discriminative dictionary learning algorithm to improve the sparse representation based face recognition system.

      • Reduce the dictionary size without compromising the accuracy.
      • Further reduce the dictionary size and improve the accuracy via learning a decomposition of face images into physically meaningful components;
    • Learn multiple classifiers jointly via multi-task learning.

      • Exploit relative labels to abandon the requirement of the tradition labeling.
      • Succeeded in varying applications including motion analysis, image classification and image co-segmentation;
    • Build vision-based human-robot interaction in indoor environment

      • Capable of following target-subject in complex environment
      • Interact via vision based gesture recognition in real time;
    • Enhance the non-refence image quality assessment via the guidance of visual saliency;

HONORS AND AWARDS

  • Samsung Best Paper Award 2015 Merit Award (50 out of 1100 candidates)
  • Samsung Best Patent Review Committee Member Golden Awards 2015
  • University Graduate Fellowship Award, 2013.4, 2009.9
  • Qualcomm Qtech 2012 2nd Prize, 2012.6
  • Outstanding Volunteer Service Award by ACM MM 2011, 2011.11

Patents

  • US-2016379352-A1,Label-free non-reference image quality assessment via deep neural network ,"Samsung Electronics Co., Ltd.","Qiang Zhang, Zhengping Ji, Lilong SHI, Ilia Ovsiannikov",
  • US-2013129188-A1,Electronic devices for defect detection ,"Sharp Laboratories Of America, Inc.","Qiang Zhang, Xinyu Xu, Chang Yuan, Hae-Jong Seo, Petrus J.L. van Beek",
  • US-2014264034-A1,Near-field optical sensing system ,"Qualcomm Mems Technologies, Inc.","Xiquan Cui, Muhammed I. Sezan, Russell Wayne Gruhlke, Qiang Zhang",
  • US-9934557-B2,Method and apparatus of image representation and processing for dynamic vision sensor ,"Samsung Electronics Co., Ltd", Zhengping Ji, Kyoobin Lee, Qiang Zhang, Yibing Michelle Wang, Hyun Surk Ryu, Ilia Ovsiannikov,
  • US-2016309135-A1,Concurrent rgbz sensor and system ,"Ilia Ovsiannikov, Yibing Michelle Wang, Gregory Waligorski, Qiang Zhang",
  • US-2016358314-A1,Method and apparatus of multi-frame super resolution robust to local and global motion ,"Zhengping Ji, Qiang Zhang, Lilong SHI, Ilia Ovsiannikov",
  • US-2017185871-A1,Method and apparatus of neural network based image signal processor ,"Qiang Zhang, Zhengping Ji, Yibing Michelle Wang, Ilia Ovsiannikov",
  • US-2016350649-A1,Method and apparatus of learning neural network via hierarchical ensemble learning ,"Qiang Zhang, Zhengping Ji, Lilong SHI, Ilia Ovsiannikov",
  • US-2017213105-A1,Method and apparatus for event sampling of dynamic vision sensor on image formation ,"Zhengping Ji, Qiang Zhang, Kyoobin Lee, Yibing Michelle Wang, Hyun Surk Ryu, Ilia Ovsiannikov",

Publications

Book

  • Qiang Zhang, Baoxin Li, ``Dictionary Learning in Visual Computing", Morgan & Claypool, doi:10.2200/S00640ED1V01Y201504IVM018

Dissertation

  • Qiang Zhang, ``Semantic Sparse Learning in Images and Videos", Doctoral Dissertation, Ph.D. Computer Science, Arizona State University, 2014

Deep Learning

  • Z Ji, I Ovsiannikov, Y Wang, L Shi, Q Zhang, Reducing weight precision of convolutional neural networks towards large-scale on-chip image recognition, SPIE Sensing Technology+ Applications, 94960A-94960A-9

Motion Analysis

  • Y Wang, Q Zhang, B Li, Efficient unsupervised abnormal crowd activity detection based on a spatiotemporal saliency detector, Applications of Computer Vision (WACV), 2016 IEEE Winter Conference on, 1-9
  • Lin Chen, Qiang Zhang, Peng Zhang and Baoxin Li. INSTRUCTIVE VIDEO RETRIEVAL FOR SURGICAL SKILL COACHING USING ATTRIBUTE LEARNING. IEEE International Conference on Multimedia and Expo (ICME) 2015, Torino, Italy.
  • Qiang Zhang, Baoxin Li, ``Relative Hidden Markov Models for Video-based Evaluation of Motion Skills in Surgical Training," Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Qiang Zhang and Baoxin Li, Relative Hidden Markov Models for Evaluating Motion Skills, IEEE Computer Vision and Pattern Recognition (CVPR) 2013, Portland, OR
  • Lin Chen, Qiongjie Tian, Qiang Zhang and Baoxin Li. Learning Skill-Defining Latent Space in Video-Based Analysis of Surgical Expertise – A Multi-Stream Fusion Approach. NextMed/MMVR20. San Diego, CA, 2013.
  • Qiongjie Tian, Lin Chen, Qiang Zhang and Baoxin Li. Enhancing Fundamentals of Laparoscopic Surgery Trainer Box via Designing A Multi-Sensor Feedback System. NextMed/MMVR20. San Diego, CA, 2013.
  • Qiang Zhang, Lin Chen, Qiongjie Tian and Baoxin Li. Video-based analysis of motion skills in simulation-based surgical training. SPIE Multimedia Content Access: Algorithms and Systems VII. San Francisco, CA, 2013.
  • Qiang Zhang and Baoxin Li. Video-based motion expertise analysis in simulation-based surgical training using hierarchical dirichlet process hidden markov model. In Proceedings of the 2011 international ACM workshop on Medical multimedia analysis and retrieval (MMAR '11). ACM [oral], New York, NY, USA, 19-24.
  • Zhang, Qiang and Li, Baoxin, Towards Computational Understanding of Skill Levels in Simulation-Based Surgical Training via Automatic Video Analysis, International Symposium on Visual Computing (ISVC) 2010, Las Vegas, NV

Face Recognition

  • Qiang Zhang and Baoxin Li. Mining Discriminative Components With Low-Rank And Sparsity Constraints for Face Recognition. The 18th ACM SIGKDD International Conference On Knowledge Discovery and Data Mining (SIGKDD 2012).
  • Qiang Zhang and Baoxin Li, Joint Sparsity Model with Matrix Completion for an Ensemble of Images, IEEE International Conference on Image Processing (ICIP) 2010, Hong Kong, China
  • Qiang Zhang and Baoxin Li, Discriminative K-SVD for Dictionary Learning in Face Recognition, IEEE Computer Vision and Pattern Recognition (CVPR) 2010, San Francisco, CA

Multi-task Learning

  • Qiang Zhang, Jiayu Zhou, Yilin Wang, Jieping Ye and Baoxin Li, Image Cosegmentation via Multi-task Learning, BMVC 2014, Nottingham, UK
  • Lin Chen, Qiang Zhang and Baoxin Li, Predicting Multiple Attributes via Relative Multi-task Learning, IEEE Computer Vision and Pattern Recognition (CVPR) 2014, Columbus, OH
  • Qiang Zhang, Baoxin Li, ``Max Margin Multi-Attribute Learning with Low Rank Constraint," Image Processing, IEEE Transactions on

Image Processing and Others

  • Z Ji, Q Zhang, L Shi, I Ovsiannikov, Multi-frame Super Resolution Robust to Local and Global Motion, SPIE Medical Imaging, 101370Y-101370Y-7
  • Yilin Wang, Qiang Zhang and Baoxin Li, STRUCTURE-PRESERVING IMAGE QUALITY ASSESSMENT, IEEE International Conference on Multimedia and Expo (ICME) 2015, Torino, Italy
  • Yilin Wang, Qiang Zhang and Baoxin Li, Semantic Saliency Weighted SSIM for Video Quality Assessment, International Workshop on Video Processing and Quality Metrics for Consumer Electronics (VPQM) 2014, Chandler, AZ
  • Qiang Zhang, Chang Yuan, Xinyu Xu, Peter Van Beek, Hae jong Seo, and Baoxin Li. Efficient defect detection with sign information of Walsh Hadamard transform. IS&T/SPIE Image Processing: Machine Vision Applications VI. San Francisco, CA, 2013
  • Jin Zhou, Qiang Zhang, Baoxin Li and Ananya Das, Synthesis of Stereoscopic Views from Monocular Endoscopic Videos, IEEE Computer Vision and Pattern Recognition (CVPR) 2010 workshop on Mathematical Methods in Biomedical Image Analysis, San Francisco, CA
  • Qiang Zhang and Pengfei Xu and Wen Li and Zhongke Wu and Mingquan Zhou, Efficient Edge Matching Using Improved Hierarchical Chamfer Matching, Aug, IEEE International Symposium on Circuits and Systems (ISCAS) 2009, Taipei, Taiwan
  • Qiang Zhang and Hua Li and Yan Zhao and Xinlu Liu, Exploration of Event-Evoked Oscillaotry Activities during a Cognitive Task, The 4th International Conference on Natural Computation and The 5th International Conference on Fuzzy Systems and Knowledge Discovery (ICNC-FSKD) 2008, Jinan , China

Services

  • Volunteer of IEEE IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2010, 2013, ACM Multimedia 2011
  • Reviewer of IEEE International Conference on Computer Vision (ICCV) 2015, International Joint Conference on Artificial Intelligence (IJCAI) 2015, 2016, IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2015, 2016,
  • Reviewer of International Journal of Machine Learning and Cybernetics (JMLC)
  • Reviewer of IEEE Transactions on Pattern Analysis and Machine Intelligence, IEEE Signal Processing Letters, IEEE Transactions on Circuits and Systems for Video Technology, IEEE Transactions on Knowledge and Data Engineering
  • Reviewer of Elsevier Journal of Visual Communication and Image Representation, Elsevier Robotics and Autonomous Systems, Elsevier Pattern Recognition, Elsevier Information Fusion, Elsevier Signal Processing: Image Communication, Elsevier Journal of Systems and Software
  • Reviewer of SPIE Journal of Electronic Imaging
  • Reviewer of Journal of Multimedia, Journal of Computers
  • Reviewer of Springer International Journal of Machine Learning and Cybernetics, Springer Computational and Applied Mathematics

Qiang Zhang's Projects

android icon android

This will be served as my android programming learning repository

android-keyboard-gadget icon android-keyboard-gadget

Convert your Android device into USB keyboard/mouse, control your PC from your Android device remotely, including BIOS/bootloader.

android-ndk-profiler icon android-ndk-profiler

A library you can compile into your Android NDK code to generate gprof-compatible profile information.

autofocus icon autofocus

This is the repository for my android app AutoFocus

automatic-api icon automatic-api

A list of software that turns your database into a REST/GraphQL API

button_constant icon button_constant

This package holds the constant shared the front end and back end of Button Tech.

chat-via-protobuf icon chat-via-protobuf

This repository holds my project for an iOS chatting app with Protobuf, AFNetworking and JSQMessageController, which supports chatting via text, picture, voice and video. Only client side code is hosted here.

chinese-poetry icon chinese-poetry

最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。

chinese-xinhua icon chinese-xinhua

中华新华字典数据库。包括歇后语,成语,词语,汉字。提供新华字典API。

deeplearntoolbox icon deeplearntoolbox

Matlab/Octave toolbox for deep learning. Includes Deep Belief Nets, Stacked Autoencoders, Convolutional Neural Nets, Convolutional Autoencoders and vanilla Neural Nets. Each method has examples to get you started.

dermnet icon dermnet

This repository include the code for crawling the images from [Dermnet](http://www.dermnet.com/dermatology-pictures-skin-disease-pictures/). Please obtain the license for usage those images from [Dermnet](http://www.dermnet.com/contacts.php).

docker-ngrok-server icon docker-ngrok-server

ngrok服务器的docker image,实现内网穿透。也可以下载附件中的ngrok客户端使用

docker-squid icon docker-squid

Dockerfile to create a Docker container image for Squid proxy server

enterprise-registration-data-of-chinese-mainland icon enterprise-registration-data-of-chinese-mainland

**大陆 31 个省份1978 年至 2019 年一千多万工商企业注册信息,包含企业名称、注册地址、统一社会信用代码、地区、注册日期、经营范围、法人代表、注册资金、企业类型等详细资料。This repository is an dataset of over 10,000,000 enterprise registration data of 31 provinces in Chinese mainland from 1978 to 2019.【工商大数据】、【企业信息】、【enterprise registration data】。

esp-adf icon esp-adf

Espressif Audio Development Framework for ESP32-A1S

fb.resnet.torch icon fb.resnet.torch

Torch implementation of ResNet from http://arxiv.org/abs/1512.03385 and training scripts

followme icon followme

This is the repository for the project Follow ME

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.