Giter VIP home page Giter VIP logo

anchor's Introduction

ANCHOR

08/2018 update: Our single model top-1 test accuracy is now 97.68%. We changed the model a little bit and also applied a few optimization techniques. As of this time (08/2018), our result is again the best in the world, i.e., the state of the art. (end of update)


Here are model, weights, and test samples for our handwritten Chinese character recognition(HCCR) development.

The model is VGG like, we did try other CNN architectures such as ResNet, DenseNet, but so far this VGG-like model performs the best. We also tried wider (increase number of conv filters per layer) nets, their results is on par with smaller one but take much longer to train.

As of now (04/2017), this model achieves better test accuracy (97.25%) than any other published results, including ensemble results. (With ensemble, we did actually have even better accuracy!).

The test data provided are a subset (about 1/10th) of test data from ICDAR 2013 Chinese Handwriting Recognition Competition. There are 3755 characters, each character has 6 samples. So there are 3755*5=22530 samples total.

(The data is copyrighted by Institute of Automation, Chinese Academy of Science, see: C.-L. Liu, F. Yin, D.-H. Wang, Q.-F. Wang, CASIA online and offline Chinese handwriting databases, Proc. 11th International Conference on Document Analysis and Recognition (ICDAR), Beijing, China, 2011, pp.37-41.)

Install

Just do:

pip install keras tensorflow-gpu h5py pillow

If you don't have GPU, replace tensorflow-gpu with tensorflow, note it takes significantly longer to run without GPU.

Usage

Clone this repo. (Warning: this is a big repo >100M, clone with caution.)

If no argument:

python model_test.py

will run all test samples in data/test (22530 total) and will print out loss and accuracy.

If provided -p argument:

python model_test.py -p

It will give 3 top predictions for those samples that the model gives wrong first prediction.

If you give it a filename:

python model_test.py sample.png

It will print 3 top predictions for the single sample.

Fun

峻 C054-f-f.png Model is 100% sure it is , unfortunately it is .

拨 C044-f-f.png Our model is 100% sure it is , but it is

尹 C017-f-f.png model is 100 sure it is , but it is .

挚 C016-f-f.png Model is 100% sure it is , but it is .

AFAICT, model is correct in all these cases, test data are simply incorrectly labeled. (Or the writer wrote the wrong characters)

Now for some unsure predictions by the model:

雀 C046-f-f.png Model is about 50/50 on and , I am too. (labeled )

卡 C017-f-f.png Model is 50/50 on and , so am I. (labeled )

荡 C044-f-f.png Model is torn between and . (labeled )

队 C017-f-f.png Top 3 are 义戏叉, it's messed up. (labeled )

Name

ANCHOR stands for "ANgzhou Chinese Handwriting Optical Recognition"

anchor's People

Contributors

wensheng avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.