Giter VIP home page Giter VIP logo

kevincobain2000 / email_extractor Goto Github PK

View Code? Open in Web Editor NEW
65.0 9.0 34.0 417 KB

Yes it works! God Speed. Email Extractor by Full Url Crawl. Extract emails and web urls from a website with full crawl or option limit, depth of urls to crawl using terminal.

Home Page: https://email-extractor.coveritup.app/extract

Shell 1.43% Go 33.29% JavaScript 0.43% Astro 64.84%
email email-extractor url-crawler crawl-all-urls email-extraction email-marketing online-email-extractor

email_extractor's People

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

email_extractor's Issues

Add to save

I will like this script to result able to save to file.
Also if possible to save interval by internal not until it crawl all urls Eg. if we are to crawl 40,000 pages website. it will crawl all before it grep one email to file

[Feature request] Domain exclusion list

It would be nice to have a way of excluding a list of URLs or base domains such that the crawler, if it comes to one of these domains, doesn't end up clicking through it and continuing the crawl.

This could be useful for sites that you know are not useful or valid for the search.

save interval by internal from #1

creating separate issue from #1

@sirolele

Also if possible to save interval by internal not until it crawl all urls Eg. if we are to crawl 40,000 pages website. it will crawl all before it grep one email to file

[Feature request] Parse email templates and reform

For example:

  • email(at)domain(dot)com
  • email[at]domain[dot]com
  • first(dot)last(at)domain(dot)com

If the full text is being parsed for email matching then I think it would be good to match and then reform the email based on substitution.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.