Giter VIP home page Giter VIP logo

Comments (13)

dmMaze avatar dmMaze commented on May 9, 2024 2

Thanks for suggesting, MangaInpainting is a fantastic work, I would like to read its paper (also its related work) before adding it to this project, perhaps later this month.

from ballonstranslator.

Snowad14 avatar Snowad14 commented on May 9, 2024

I was thinking of adding it at the same time as EasyOCR (better result for English text). I'll probably do it next vacation if it's not already added by someone else

from ballonstranslator.

bropines avatar bropines commented on May 9, 2024

I would add it myself, but I don't have knowledge in pyqt and, in principle, how to embed it all into an application, I'm just learning.

from ballonstranslator.

Snowad14 avatar Snowad14 commented on May 9, 2024

There is nothing to do with pyqt, everything is in the dl folder and you already have examples

from ballonstranslator.

bropines avatar bropines commented on May 9, 2024

Well, I'll see what I can do.

from ballonstranslator.

bropines avatar bropines commented on May 9, 2024

There is nothing to do with pyqt, everything is in the dl folder and you already have examples

When can you be free?
I didn't understand anything how to add)

from ballonstranslator.

rspreet92 avatar rspreet92 commented on May 9, 2024

@dmMaze any plan for EasyOCR ?

from ballonstranslator.

Snowad14 avatar Snowad14 commented on May 9, 2024

I looked at it but I saw that in fact the result was not very good either (even if still a little better than the one used). One of the problems is that comic text detector can't separate two bubbles that are quite close

from ballonstranslator.

bropines avatar bropines commented on May 9, 2024

I looked at it but I saw that in fact the result was not very good either (even if still a little better than the one used). One of the problems is that comic text detector can't separate two bubbles that are quite close

Can give the user this feature? Like before launches, you put a marker where there should be two, three or more clouds. Or masturbate a neuron on a hundred manga. Tell me how, I have a lot of it, I'll teach it

from ballonstranslator.

Snowad14 avatar Snowad14 commented on May 9, 2024

I was talking about EasyOCR, and I'm not good at deep learning / pyqt

from ballonstranslator.

bropines avatar bropines commented on May 9, 2024

I was talking about EasyOCR, and I'm not good at deep learning / pyqt

I understood. And I also found out that EasyOCR does not cope well with Japanese/Chinese characters. More precisely, she often does not consider it as a text

from ballonstranslator.

dmMaze avatar dmMaze commented on May 9, 2024

any plan for EasyOCR ?

I'm not very convinced by its result. Its detected bboxes are actually a bit messed up compared to CTD. Also, it performs worse on manga.

Colab demo:
https://colab.research.google.com/drive/1etdG7_mpoSi5ekhbek53Ex6UJO6B3hgk?usp=sharing#scrollTo=mOUcpfxMAfKV

EasyOCR

image

image

CTD

image

image

The detector is kind of more troublesome to be replaced than the recognizer. The main difference between CTD and other text detectors is that 1. ctd performs text mask segmentation and 2. textlines grouping (detect a block of text).

In order to integrate other detectors we can design some rule-based algorithms to do 1 & 2 or even utilize CTD to do them and combine the result. I don't think it is worth the effort and would rather retrain the CTD on more diverse data instead of trying other OCR or online APIs. It would be helpful if you can provide some failure cases (for analyzing not training)

comic text detector can't separate two bubbles that are quite close

I believe the main reason why it performs 2 worse on manga is that part of its training data comes from Manga109-s, which treated connected balloons as a block. Some postprocessing had been deployed to address it but things could go wrong.

from ballonstranslator.

bropines avatar bropines commented on May 9, 2024

By the way, are we going to start implementing this inpaint or is there no point in it?

from ballonstranslator.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.