Giter VIP home page Giter VIP logo

Comments (3)

Dexterp37 avatar Dexterp37 commented on June 11, 2024

The probe scraper currently produces 3 JSON files:

  • revisions.json
  • general.json
  • probes.json

The general.json only contains the lastUpdate property, which is the date the files were generated.

The revisions.json file contains a list of revision hash -> Firefox version mapping for each channel.

Finally, the probes.json file contains the data for all the probes. It has the following format:

{
  probe_id: {
    type: 'histogram',
    name: 'some name',
    history: {
      "nightly": [
        {
          optout: True,
          ... other probe details ...
          revisions: {first: ..., last: ...}
        },
        ...
      ],
      ... other channels ...
    }
  }
}

My proposal is to keep the current format, writing a separate probes.json file for each channel, and only list the relevant channel data in the history section of the probe.

@sunahsuh and @fbertsch, since you are the main consumers of the probe data I can think of right now, what do you think of this proposal?

from probe-scraper.

fbertsch avatar fbertsch commented on June 11, 2024

Quick recap from discussion in IRC:

  • This will include historic probes. Yay!
  • We can remove revisions.json and just sub in the FF version
  • Long-term we may need to rethink how we deal with historic revisions

from probe-scraper.

Dexterp37 avatar Dexterp37 commented on June 11, 2024

This happened, the output is already being written to S3 and the documentation changes landed. Closing this off!

from probe-scraper.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.