Giter VIP home page Giter VIP logo

Comments (4)

rviscomi avatar rviscomi commented on September 15, 2024 2
  1. All the old URLs are dead and gone now? (for desktop)

Of the 1.3M desktop URLs we've been testing for ~6 months, 1M of them are still in the corpus. We considered including the remaining 300k URLs for consistency across crawls but opted not to for simplicity.

(we = myself, @igrigorik @pmeenan @paulcalvano)

  1. Presumably you're doing the same thing for mobile?

Yes. We plan to roll that out next month.

  1. @brendankenny said there weren't a lot of https urls in the old HA url list.. Whats the balance in this new URL list?

This list is 60% HTTPS:

SELECT
  SUM(IF(STARTS_WITH(url, 'https'), 1, 0)) / COUNT(0) AS pct_https
FROM
  `httparchive.urls.2018_12_15_desktop`
  1. We thought the 1.5M run was barely finishing in under 2 weeks. Is there new WPT capacity being added to handle this load?

Yeah, we used to run each page 3 times on desktop, 3 on mobile, and 1 on Lighthouse. As of December 1 we're doing it 1 time on all 3 settings. That affords us the space to increase the desktop corpus. To afford the mobile increase, we're going to reduce the crawl frequency from 15 days to monthly. I wrote up a short summary / sanity check in this doc.

from httparchive.org.

rviscomi avatar rviscomi commented on September 15, 2024
mysql> select count(0) from urlsdev;
+----------+
| count(0) |
+----------+
|  3880557 |
+----------+
1 row in set (0.00 sec)

mysql> select count(0) from urlsmobile;
+----------+
| count(0) |
+----------+
|  1294054 |
+----------+
1 row in set (0.44 sec)

from httparchive.org.

paulirish avatar paulirish commented on September 15, 2024

Random questions:

  1. All the old URLs are dead and gone now? (for desktop)
  2. Presumably you're doing the same thing for mobile?
  3. @brendankenny said there weren't a lot of https urls in the old HA url list.. Whats the balance in this new URL list?
  4. We thought the 1.5M run was barely finishing in under 2 weeks. Is there new WPT capacity being added to handle this load?

from httparchive.org.

paulirish avatar paulirish commented on September 15, 2024

Sounds great! Thanks for the answers.

from httparchive.org.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.