Giter VIP home page Giter VIP logo

scheduller-crawl-action-dashbaoard's Introduction

one of my goals is implementing a crawl dashbard with some action and scheduler. When I started to find anybody else who thinks of it I found some good repos:

https://github.com/DmitryKey/url-crawler https://github.com/gurtejrehal/FALCON---AI-Data-Crawler https://github.com/Lifeni/crawler-test

Exactly what I need as web app is [Web Alert link:https://webalert.me] android apk.

Although It can be something like [Airbyte link:airbyte.io ??] as [ETL links:wiki ?? ] but I've never seen ETL with crawller. Also like [processmaker link:processmaker.com ??] it can be a BPMN, but in this case you can't design yourself and it's single responsible for crawlling and schedulling a crawl with approach of extending in feauture as global scheduller.

so I preffer name it scheduller-crawl-action-dashbaoard

Proposal

At first I beleive that it can do with laravel shcedule and basic [Gazzel link: ??] curl to gather data from any url. After that it may concern what will happen if the IP banned or return some error. So it's better to start with a simple idea that I need just a dashboard to list some schdeduled job as curl and then improve it in [Roadmap link:#roadmap ??] section.

Roadmap

  • dashboard to view crawlled urls
  • changing intervall in dashboard
  • show response of crawlled urls in dashbard
  • add static action to save crawlled urls in database as a Model
  • try to make action dynamic in help of [processmake link: processmaker.com ??]
  • try to make specific action based on older crawls and responses
  • work on DOM responses
  • implement API to expose gathered and structured data
  • try to demo the repo gllobally and get feedback
  • debug
  • design new roadmap

scheduller-crawl-action-dashbaoard's People

Watchers

H Shariati avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.