Giter VIP home page Giter VIP logo

lightcrawler's Introduction

lightcrawler

Crawl a website and run it through Google lighthouse

npm install --save-dev lightcrawler

lightcrawler --url https://atom.io/ --config lightcrawler-config.json

where lightcrawler-config.json looks something like this:

{
  "extends": "lighthouse:default",
  "settings": {
    "crawler": {
      "maxDepth": 2,
      "maxChromeInstances": 5
    },
    "onlyCategories": [
      "Accessibility",
      "Performance",
      "Best Practices"
    ],
    "onlyAudits": [
      "accesskeys",
      "aria-allowed-attr",
      "external-anchors-use-rel-noopener",
      "geolocation-on-start",
      "no-document-write",
      "no-mutation-events",
      "no-old-flexbox",
      "time-to-interactive",
      "user-timings",
      "viewport",
      "without-javascript"
    ]
  }
}

Enjoy!

lightcrawler's People

Contributors

kevinsawicki avatar kuychaco avatar robertjgabriel avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

lightcrawler's Issues

Ability to use the extra-headers feature of lighthouse

Lighthouse has an --extra-headers cli flag that allows us to pass in additional header e.g. Cookies with each request. Is there a way of doing this with lightcrawler?

I have look through the code but I cannot see anyway of doing this at the moment.

Error: spawn UNKNOWN

I have created a test project, but it occured an error when trying to run

Error: spawn UNKNOWN
at exports._errnoException (util.js:1020:11)
at ChildProcess.spawn (internal/child_process.js:328:11)
at Object.exports.spawn (child_process.js:369:9)
....

Any ideas?

Feature: pass credentials to login

Web apps often have a large chunk of their functionality hidden beyond a login screen. It'd be cool if there was a way to fill in these credentials while crawling so that we can follow up on these pages. Simple crawler supports this, so all that's needed is support in the API. Thanks!

Are there page by page results?

If the site is crawled with Lighthouse where are all of the URLs and the page scores? I was expecting a CSV or JSON file output.

Use case: site report in HTML

A lighthouse user sent me a use case and I thought it may make sense on the lightcrawler side:

I have a clean WP site, I install a new theme, and we want to run a set of audits across the pages: crawl the pages, record LH reports for each page, take final screenshots of each page, and spit out a summary report with links to individual pages.

Is that UX something you'd consider in-scope for lightcrawler? It seems to have a little different approach as far as reporting (HTML vs stdout) so curious what you think.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.