Giter VIP home page Giter VIP logo

scut-enstext's Introduction

SCUT-EnsText

The SCUT-EnsText Dataset for the research of scene text removal is released by Deep Leaning and Visual Computing Lab of South China University of Technology. The dataset can be downloaded through the following link:

train set - Baidu Cloud (Password : 5xwi) - Google Drive

test set - Baidu Cloud (Password : u0ks) - Google Drive

The code for EraseNet can be referred to EraseNet.

Note: The SCUT-EnsText dataset can only be used for non-commercial research purpose. Trainging set and testing set are available now, but the training set is encrypted with additional code. For scholars or organization who wants to use the SCUT-EnsText database, please first fill in this Application Form and send it via email to us ([email protected], or [email protected]). When submiting the application form to us, please list or attached 1-2 of your publications in recent 6 years to indicate that you (or your team) do research in the related research fields of OCR, image inpainting, text editting, and so on. We will give you the decompression password after your letter has been received and approved.

Dataset Description

The SCUT-EnsText benchmark aims to motivate more advanced deep learning models for scene text removal task. All of the images in our dataset are collected from several public real-world scene text reading benchmarks, including ICDAR2013, ICDAR-2015, MS COCO-Text, SVT, MLT-2017, MLT-2019, and ArTs.

SCUT-EnsText contains a total of 3,562 images with diverse text characteristics, including text shape (horizontal text, arbitrary quadrilateral text and curved text) and languages(English and Chinese). It is split into a training set and a testing set. To ensure that both of them have the same data distribution, we randomly select approximately 70% of the images for training and the remainder of the images for testing. In total, the training set contains 2,749 images with 16,460 words, while the testing set contains 813 images with 4,864 words.

image

Citation and Contact

Please consider to cite our paper when you use our dataset:

@ARTICLE{Erase,
  author={Liu, Chongyu and Liu, Yuliang and Jin, lianwen and Zhang, Shuaitao and Luo, Canjie and Wang, Yongpan},
  journal={IEEE Transactions on Image Processing}, 
  title={EraseNet: End-to-End Text Removal in the Wild}, 
  year={2020},
  volume={29},
  pages={8760-8775},}

For any quetions about the dataset please contact the authors by sending email to Chongyu Liu([email protected]) or Prof. Jin([email protected]).

Copyright

Copyright © 2020 SCUT-DLVC. All Rights Reserved.

Sample

scut-enstext's People

Contributors

hciilab avatar lcy0604 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.