Trying to figure out if there's a way that <a href="https://github.com/DataDog/datadog

Yes, Datadog evaluates the idea to provide some for knapsack's edge case to ret

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Improve handling of skipped tests and dynamic parallelism about knapsack_pro-ruby HOT 10 OPEN

devinburnette commented on June 23, 2024

Improve handling of skipped tests and dynamic parallelism

from knapsack_pro-ruby.

Comments (10)

anmarchenko commented on June 23, 2024 1

Yes, Datadog evaluates the idea to provide some script for knapsack's edge case to return percentage of skipped tests (the best we can do on library's side), we will see if this is going to be enough for @devinburnette's case

from knapsack_pro-ruby.

anmarchenko commented on June 23, 2024

From Datadog side, I would be happy to provide a list of skipped tests to knapsack library at some point in the lifecycle; unfortunately right now we instrument the knapsack:queue:rspec_go command that is running after the parallel runners are created. The runners are created outside of knapsack (in static config of Circle CI), so I am curious what could be done in this case.

from knapsack_pro-ruby.

ArturT commented on June 23, 2024

@devinburnette Please ensure you use the latest version of the knapsack_pro gem and datadog-ci.

knapsack determines subsets and breaks the list up but because most things are being skipped, some parallel nodes end up with no tests to run and it throws the entire split timings out of wack and the job takes too long to complete (in some cases longer than without skipping any tests).

Could you share a link to the problematic CI build in the knapsack_pro dashboard or commit hash?

@anmarchenko said:

The runners are created outside of knapsack (in static config of Circle CI), so I am curious what could be done in this case.

@devinburnette Does it mean you use CircleCI and the Rerun only failed tests feature?
https://docs.knapsackpro.com/ruby/circleci/#rerun-only-failed-tests

Is the problem related to retried CI nodes that run only a subset of the test suite? In such case CircleCI reruns all the nodes but only some would run tests when there is less test files than the number of parallel nodes. Number of parallel nodes would have to be configured in CircleCI yml config file even if Knapsack API could provide the suggested number of parallel nodes to use.

datadog's ITR examines the test coverage and determines that most of the tests can actually be skipped, but we still pass the full list of tests to knapsack in our CI config.

@devinburnette How do you pass list of tests to Knapsack?

from knapsack_pro-ruby.

anmarchenko commented on June 23, 2024

This is not a retry they are doing but a normal test run: Datadog is able to skip tests based on commit data to run only relevant tests. So for example instead of 9500 tests it runs 100 of them and skips 9400: dozens of parallel workers waste time in this case.

from knapsack_pro-ruby.

ArturT commented on June 23, 2024

@anmarchenko Are tests skipped by DataDog during the runtime of the RSpec tests?
Is it visible as skipped (pending tests) in the RSpec summary?

Or does DataDog return a list of tests to skip up front and then Knapsack is fed with the list using KNAPSACK_PRO_TEST_FILE_LIST_SOURCE_FILE?

from knapsack_pro-ruby.

devinburnette commented on June 23, 2024

Yes. They are skipped during the runtime of rspec marked as pending. The source file passed to knapsack contains the entire set of tests including the skipped ones.

from knapsack_pro-ruby.

ArturT commented on June 23, 2024

Do you skip individual test cases within a test file or do you always skip all test cases within a test file?

from knapsack_pro-ruby.

anmarchenko commented on June 23, 2024

Individual tests within a test file. The tests are visible as pending in RSpec summary.

from knapsack_pro-ruby.

ArturT commented on June 23, 2024

DataDog could provide the total execution time of tests that are going to run (without skipped tests) because it tracks individual test cases.

Note

Knapsack Pro does not track RSpec individual test cases unless they belong to bottleneck test files that must be split by examples with this feature.

The customer could use the total execution time of tests to estimate the number of parallel nodes needed. Perhaps it could be a Ruby or bash script that pulls execution time from DataDog, estimates the parallel nodes number, and configures CI.

For example, DataDog wants to run 100 test cases. Their total execution time is 30 minutes.
The customer desires to run tests under 5 minutes per parallel CI node.

30 minutes / 5 minutes = 6 nodes are needed.

The customer dynamically configures their CI to run 6 nodes instead.
Knapsack does the rest without any changes.

Would that make sense? Do you see any issue with this approach?

from knapsack_pro-ruby.

ArturT commented on June 23, 2024

The percentage of skipped tests might be good enough. If you run only 20% of the test suite, then you could also run 20% of the parallel nodes, or a bit more if these 20% of tests are very slow.

from knapsack_pro-ruby.

Improve handling of skipped tests and dynamic parallelism about knapsack_pro-ruby HOT 10 OPEN

Comments (10)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent