Giter VIP home page Giter VIP logo

aasimsani / artificial_manga_panel_dataset Goto Github PK

View Code? Open in Web Editor NEW
42.0 2.0 8.0 6.79 MB

There's no publicly available free-to-use Manga Dataset, so I decided to make one artificially!

Home Page: https://aasimsani.github.io/artificial_manga_panel_dataset

License: MIT License

Python 99.25% Jupyter Notebook 0.75%
manga-panels japanese-fonts kaggle dataset artificial-data artificial-dataset layouting-engine jesc-dataset manga

artificial_manga_panel_dataset's People

Contributors

aasimsani avatar jlog3 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

artificial_manga_panel_dataset's Issues

Kaggle datasets/speech_bubbles_dataset/writing_area_labels.csv file, the 'imagename' entries contain a "~" character in their paths, whereas the actual file paths in the speech_bubbles_dataset/files/ directory use a "-" character instead

In the Kaggle datasets/speech_bubbles_dataset/writing_area_labels.csv file, the 'imagename' entries contain a "~" character in their paths, whereas the actual file paths in the speech_bubbles_dataset/files/ directory use a "-" character instead.

wrong: "datasets/speech_bubbles_dataset/files/white~friendlystock-chat-bubble-47.png"
correct: "datasets/speech_bubbles_dataset/files/white-friendlystock-chat-bubble-47.png"

As a workaround, I've implemented a simple string replacement operation in my local script to replace the "~" character with "-", and the function is working as expected.
main.py 128 and 191 you can add, after the initial speech_bubble_tags declaration so that:

speech_bubble_tags = pd.read_csv(speech_bubbles_path +
                                         "writing_area_labels.csv")
# Replace "~" with "-" in the 'imagename' column to match the file paths
speech_bubble_tags['imagename'] = speech_bubble_tags['imagename'].str.replace('~', '-') # fix kaggle path error

Error logs:
Testing:
FAILURES test_speech_bubble_dumping
speech_bubble_writing_area = speech_bubble_tags[area_idx]['label']

      speech_bubble_writing_area = speech_bubble_writing_area.values[0]

E IndexError: index 0 is out of bounds for axis 0 with size 0

preprocesing/layout_engine/page_dataset_creator.py:1546: IndexError

Traceback (most recent call last):
File "main.py", line 203, in
speech_bubble_tags
File "/home/john/Desktop/translate/artificial_manga_panel_dataset/preprocesing/layout_engine/page_dataset_creator.py", line 2365, in create_page_metadata
speech_bubble_tags
File "/home/john/Desktop/translate/artificial_manga_panel_dataset/preprocesing/layout_engine/page_dataset_creator.py", line 1660, in populate_panels
minimum_speech_bubbles
File "/home/john/Desktop/translate/artificial_manga_panel_dataset/preprocesing/layout_engine/page_dataset_creator.py", line 1546, in create_single_panel_metadata
speech_bubble_writing_area = speech_bubble_writing_area.values[0]
IndexError: index 0 is out of bounds for axis 0 with size 0

P.S. This project is really awesome thank you!

Amazing job!

Hello! I'm Felipe.

Do you have a pretrained IA for Object Detection for Comics using Manga109 Annotations?
I read README.md and you used this paper to benchmark your dataset.

I want to create a autogenerated videos from Manga or Manhwa (get panels, show panels instead whole page, get text of bubbles speech, create narrations, add sfx, etc.). The idea, you will input manga's page and it will return a video

Disclaimer: I don't too much about IA. I'm a frontend developer and its my first time at IA

Conclusion: If you don't have a pretrained or source code for implementation, would you like to give me an idea to start build my own?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.