Giter VIP home page Giter VIP logo

skotz / captcha-breaking-library Goto Github PK

View Code? Open in Web Editor NEW
81.0 81.0 25.0 70.82 MB

Neural network, contour analysis, bitmap vector subtraction CAPTCHA solving library and scripting language with perceptive color space segmentation

License: GNU General Public License v3.0

C# 98.02% Inno Setup 1.85% Batchfile 0.13%
c-sharp captcha captcha-breaking colorspace contour-analysis neural-network scripting-language

captcha-breaking-library's People

Contributors

skotz avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

captcha-breaking-library's Issues

Create walk-through

There's currently a massive learning curve to get your first project going. Create a walk-through to help new users.

Images name to train

Hi Scott, during the training I see that:
Image name will be the value in captcha but if there is no other way to name the file we will not able to train multiple image with same value but different in format. Say I have image: ABC.bmp & if another image name ABC.bmp with different format, I will not able to train the solver with 2nd one if not overwrite 1st.

Do you see any way we can solve this?

Thank you.

Please support to provide document on how to train/generate db for solver!

Thank Skotz for your very good lib.

I'm working on an captcha which I can work on the filter to have clean final image but currently I don't see how to train the solver and get db?

Actually I have just read previous Issue ticket but not sure how to implement the training properly.

Such as we need to change script to Train & Save instead of Solve?
TRAIN, "I:\qqq"
SAVE, I:\qqq\new.db"

I do see we need a test pool of as big as possible images but how can tell Solver which is good which is bad?

On your script in previous Issue:
TRAIN, "qqq"
FULLTEST, "qqq", "test.txt"

I even see the result: TOTAL CORRECT: 213 (213/266 = 80.08%)

Is that we are manually calculate or the lib support that?

Thank for your support in advanced.

My script.
SetMode, all
SetupSegmenter, BLOB, 30, 40, 4
SetupSolver, BVS, "ABCDEFGHIJKLMNOPQRSTUVWXYZ", 50, 60

DefinePreconditions
Resize, 400, 140
RemoveSmallBlobs, 10, 4, 4
BlackAndWhite
Grow
Grow
Median
EndPreconditions

Solve, %IMAGE%

Create solver creation wizard tool

You currently need to continuously play around with image manipulation functions to correctly segment an image, saving intermediate steps to image files for examination. Create an integrated environment with a wizard-like tool for tweaking segmentation settings.

Seems TRAIN is not working

SETMODE, ALL
SETUPSEGMENTER,  BLOB, 3, 15, 6
SETUPSOLVER,     MNN, "0123456789", 20, 20
DEFINEPRECONDITIONS
    REMOVENONCOLOR, 60
    MEDIAN, 3
    BINARIZE, 80
    REMOVESMALLBLOBS, 180, 4, 14
ENDPRECONDITIONS
TRAIN, "I:\qqq"
SAVE, I:\qqq\new.db"

new.db file too small (less then one capthca file 60kb bmp)
Captcha images very good segmented.
FULLTEST show 0 from 266 images recognized

Merge training and production script execution

Right now you basically need to create two CBL scripts to solve a CAPTCHA: one for training the solver and one for solving individual images. Create some way to merge these two ideas together to prevent confusion.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.