openzfs / zfs-buildbot Goto Github PK

The OpenZFS Buildbot Configuration

License: BSD 2-Clause "Simplified" License

Python 37.73% CSS 5.72% Shell 56.55%

zfs-buildbot's Introduction

The OpenZFS Buildbot Configuration

Welcome, this is the buildbot CI infrastructure used by the OpenZFS project. It's used to automate the process of testing pull requests. If you would like to contribute to improving our testing infrastructure please open a pull request against this GitHub repository.

Build and Test Strategy

Pull Requests

The OpenZFS project relies on GitHub pull requests to track proposed changes. Your pull request will be automatically tested by the buildbot and updated to reflect the results of the testing. As you fix the code and push revisions to your branch at GitHub those changes will queued for testing. There is no need to close a pull request and open a new one. However, it is strongly recommended that when refreshing a pull request you rebase it against the latest code to avoid merge conflicts.

Once a pull request passes all the automated regression testing it will be reviewed for inclusion by at least two OpenZFS developers. Normally they will provide review feedback on the change and may ask you to modify your pull request. Please provide a timely responses for reviewers (within weeks, not months) otherwise your submission could be postponed or even rejected.

When all of the required builders report as passed, and the reviewers have added their signed-off-by the pull request can be merged. After being merged the buildbot tests the change again to ensure nothing was overlooked. This is done to ensure the buildbot always stays green.

In the unlikely event that this final round of testing reveals an issue the merge may be reverted and the pull request reopened. Please continue iterating with the OpenZFS developers in a new pull request until the issue is resolved the changes can be merged.

By default, the top most commit in your PR will be functionally tested on a subset of required builders. Furthermore, the top five commits in your PR be compile tested.

Individual builders may be requested on a per-commit basis by including the Requires-builders: directive in the commit message. When requesting specific builders they should be enumerated as a comma separated list.

Supported Builders (Platforms / Distributions):
- amazon2: Amazon Linux 2 (x86_64)
- centos6: CentOS 6 (x86_64)
- centos7: CentOS 7 (x86_64)
- debian10: Debian 10 (x86_64)
- fedora30: Fedora 30 (x86_64)
- ubuntu16: Ubuntu 16.04 LTS (x86_64)
- ubuntu18: Ubuntu 18.04 LTS (x86_64)
- freebsd12: FreeBSD 12 (x86_64)
- freebsd13: FreeBSD 13 (x86_64)
- freebsd14: FreeBSD 14 (x86_64)
Additional Builders:
- arch: Build for all supported architectures.
- builtin: Build OpenZFS in to the latest (unreleased) Linux kernel.
- coverage: Perform a code coverage analysis (ztest, ZTS).
- none: Disable testing on all builders.
- perf: Perform baseline performance testing.
- style: Perform all required style checks.
Examples:
- Requires-builders: arch,style,amazon2,coverage
- Requires-builders: none

Builder Types

When a new pull request is opened it is queued up for testing on all of the available builders. There are four primary types of builders:

STYLE: These builders are responsible for performing static analysis. Currently, style checking and linting is performed for each submitted change. Changes are required to pass static analysis before being accepted.
BUILD: These builders are responsible for verifying that a change doesn't break the build on a given platform. Every commit in the pull request is built and reported on individually. This helps guarantee that developers never accidentally break the build.

To maximize coverage the are builders for major Linux distributions and FreeBSD. This allows us to catch distribution specific issues and to verify the build on a wide range of kernels.

Additional builders are maintained to test alternate architectures. If you're interested in setting up a builder for your distribution or architecture see the 'Adding a Builder' section below.

No elevated permissions are required for this type of builder. However, it is assumed that all required development tools and headers are already installed on the system.
TEST: These builders are responsible for testing a change. This is accomplished by first building new packages, installing those packages, and finally running a standard battery of tests. Due to the time involved in running the entire suite of tests only the last commit in the pull request will be tested.
PERF: These builders are responsible for running performance regression tests for a change. These builders are not used by default and are only used when perf is provided as an option to Requires-builders in a commit message.

Reliable test results are obtained by using ec2 latent builders. This is important because when testing kernel modules it is not uncommon for a flaw in the patch to cause the system to panic. In this case the build slave will be suddenly disconnected and the master must be able to terminate the instance and continue on testing the next pull request.

Therefore, for each pull request a new pristine ec2 instance is created and the system is bootstrapped to start a new build slave. This new slave needs be configured such that the buildbot user is allowed to run the sudo command without a password. This ensures the build slave can install packages, load the new kernel modules, and run other administrative commands. Once all testing has completed the instance is immediately terminated.

Build Steps and the `runurl` Utility

The OpenZFS project's buildbot makes extensive use of the runurl utility. This small script takes as its sole argument the URL of a script to execute. This allows us to configure a build step which references a trusted URL with the desired script. This means the logic for a particular build step can be separated from the master.cfg which has some advantages:

Minimizes the disruption caused by restarting the buildbot to make changes live. This is only required when modifying the master.cfg. For example, when adding/removing a builder or adding a test suite.
Build and tests scripts can be run independently making it easy for developers to locally test proposed changes before submitting them.
Allows for per-builder and per-build customization via the environment. Each script can optionally source the following files to influence its behavior.
- /etc/buildslave - This file is dynamically generated by the bb-bootstrap.sh script and is run at boot time by the ec2 user data facility. It includes all the information required to configure and start a latent buildslave. Most importabtly for scripts this includes the BB_NAME variable which is set to the build slave name.
- <builddir>/TEST - This file is dynamically generated by the bb-test-prepare.sh script which is run before any of the test suites. It contains the contents of the TEST file from the ZFS source tree being tested. Additionally, all lines in your commit message which begin with TEST_ are appended to it. This allows for the commit message to control the exact testing being performed.
Provides a consistent way to trap and handle signals from the buildbot master. This is particularly helpful when attempting to collect debug information prior to terminating an instance.

Test Suites

To perform comprehensive testing binary packages are built, installed, and used by all the test suites. This helps catch both packaging mistakes and ensures we are testing in a realistic environment. However, some customization of the environment is required and that is the job of the bb-test-prepare.sh script. It dynamically generates the TEST file described above and may further customize the environment as needed.

After the environment has been prepared all of the registered test suites are run as separate build steps using runurl. By convention each of these scripts is named bb-test-* and expected to exit with zero on success. An exit value of 1 indicates failure, 2 indicates a warning, and 3 indicates the step was skipped. If a build step fails the entire build is consider a failure. However, individual steps may exit with a warning or skipped exit code without failing the build. These steps are color coded in the waterfall and console views so they are easy to identify. A test script is expected to attempt to cleanup the environment so subsequent test suites can be run. Depending on the severity of the failure this may not be possible and additional test results should be treated skeptically.

A bb-test-* script should be designed to be as self-contained, flexible, and resilient as possible. A good example of this is the bb-test-ztest.sh script which can be found in the scripts/ directory. It is laid out as follows:

Source the /etc/buildslave and TEST files to pick up the build slave configuration and any per-build customization to the environment.
Check if the test suite should be skipped. By convention a TEST_*_SKIP variable is used and when set to "Yes" the test is skipped. Environment variables which can be overridden should try to follow the same naming convention: TEST_<test-suite>_<variable>.
Conditionally apply default values for environment variables which have not been explicitly specified. This makes is easy to skim the script and determine what options are available and what the defaults setting are.
Add a trap which attempts to cleanup the test environment on EXIT. This way if the script unexpectedly exits subsequent tests may still be able to run successfully. This is also a good opportunity to collect useful debug output such as the contents of the dmesg log. Trapping SIGTERM is useful because the build master is configured to raise this signal before terminating a script which reaches its maximum timeout.
At the core of the test script it should configure the environment as needed and then run the tests. This will likely involve loading the kernel modules, setting up disks or files to be used as vdevs and invoking the actual test suite. It's advisable to run the test suite as a subprocess and wait for it to complete. This allows the parent process to continue to handle any signals it receives.
Finally make sure your test script leaves the system as it found it. In many case this can be handled by the existing EXIT trap but this should be kept in mind.

Configuring the Master

Credentials

The master/passwords.py file contains the credentials required for the buildbot to interact with ec2 and GitHub. It stores static passwords for non-ec2 build slaves, the web interface and buildbot try. See the master/passwords.py.sample file for a complete description.

Adding a Builder

The process for adding a new builder varies slightly depending on the type (BUILD or TEST) and if it's a standard or latent builder. In all cases the process begins by adding the new builder and slave to the master.cfg file. One important thing to be aware of is that each builder can potentially have multiple build slaves. However, the OpenZFS project configures each builder to have exactly one build slave.

The first step is to determine what kind of slave your setting up. Both standard and latent build slaves are supported but only latent build slaves are suitable for use by TEST builders. Again this is because test slaves expect a pristine test environment for every build and this can only be reliably accomplished by using a new instance for every test run.

Once you've settled on the type of slave add a line to the c['slaves'] array in the BUILDSLAVES section of the master.cfg. Use the functions mkEC2BuildSlave, mkEC2TestSlave, or mkBuildSlave as appropriate.

mkEC2BuildSlave(name, ami) - Takes a user defined name and available ami identifier. When using this type of slave a new on-demand ec2 instance will be substantiated and bootstrapped using the ec2 user data interface. Make sure the bb-bootstrap.sh script has been updated to be aware of how to install and start the build slave based the name provided. The name normally includes the distribution and version to make is easy to install and start the build slave. Once running this type of slave won't be terminated until the builders pending queue has been empty for 30 minutes. This allows for multiple builds to be handled in quick succession.
mkEC2TestSlave(name, ami) - An ec2 test slave is virtually the same as an ec2 build slave but they do differ in a few important ways. Most importantly a test slave creates a large ec2 spot instance. Large instances with multiple processors aren't required for building but they are needed to expose concurrency issues. Test instances are terminated and re-instantiated between every build to guarantee reproducible results.
mkBuildSlave(name) - Used to create a normal dedicated build slave with the given name. Build slaves of this type must be manually configured to connect to the build master at build.openzfs.org:9989 using a prearranged password. This password is stored along with the slave name in the master/passwords.py file. The build slave must be available 24/7 to run jobs and have all required ZFS build dependencies installed. This kind of builder is best suited for testing non-x86 architectures which are unsupported by ec2.

Now that you've created a build slave a builder needs to be created which owns it. Jump down to the c['builders'] array in the BUILDERS section and add a BuilderConfig entry to the appropriate section. Each builder must have a unique name and your slave must be added to the list of slavenames. Set the factory to build_factory for BUILD type builders and use the test_factory for TEST builders. Then set properties, tags and mergeRequests options as appropriate.

Next add the builder by name to either the build_builders or test_builders list in the SCHEDULERS section. These describe which builders should react to incoming changes.

Finally, you must restart the build master to make it aware of your new builder. It's generally a good idea to run buildbot checkconfig to verify your changes. Then wait until the buildbot is idle before running buildbot restart in order to avoid killing running builds.

Updating an EC2 Build Slave to Use a Different AMI

New AMIs for the latest release of a distribution are frequently published for ec2. These updated AMIs can be used by replacing the current AMI identifier used by the build slave with the new AMI identifier. All build slaves are listed in the BUILDSLAVES section of the master.cfg file. Remember the buildbot will need to be restarted to pick up the change.

Adding a Test Suite

Test suites are added as a new test_factory build step in the FACTORIES section of the master.cfg file. As described in the 'Test Suites' section each test suite consists of a wrapper script which is executed by the runurl utility.

All test scripts should be named bb-test-* and placed in the scripts/ directory. When command is executed the script will be fetched and run. At a minimum each build step should run in a different workdir and specify a maximum run time using maxTime. By default buildbot will log all output to stdio. Optionally, the contents of specific files can be logged by adding them to logfiles. Finally, set a clear description and descriptionDone message to be reported by the web interface.

To activate the new build step the build master must be restarted. It's generally a good idea to run buildbot checkconfig to verify your changes. Then wait until the buildbot is idle before running buildbot restart in order to avoid killing running builds.

Running a Private Master

The official OpenZFS buildbot can be accessed by everyone at http://build.openzfs.org/ and it is integrated with the project's GitHub pull requests. Developers are encouraged to use this infrastructure when working on a change. However, this code can be used as a basis for building a private build and test environment. This may be useful when working on extending the testing infrastructure itself.

Generally speaking to do this you will need to create a password.py file with your credentials, then list your builders in the master.cfg file, and finally start the builder master. It's assumed you're already familiar with Amazon ec2 instances and their terminology.

Private master setup example on Ubuntu 18.04 AWS instance

As "ubuntu" user:

sudo apt-get -y update
sudo apt-get -y upgrade
sudo apt-get -y dist-upgrade
sudo reboot
sudo apt-get -y install python python-pip gcc nginx virtualenv
sudo adduser --home /home/buildbot buildbot
sudo cp -a ~/.ssh /home/buildbot
sudo chown -R buildbot:buildbot /home/buildbot/.ssh

Create a new `/etc/nginx/nginx.conf` to proxy the buildbot webserver on port 8010 to port 80:

user www-data;

worker_processes 4;
pid /run/nginx.pid;

events {
	worker_connections 768;
	# multi_accept on;
}

http {
	server {
		listen 80 default_server;
		listen [::]:80 default_server;

		location / {
			proxy_set_header Host $http_host;
			proxy_set_header X-Real-IP $remote_addr;
			proxy_set_header X-Forwarded-For $proxy_add_x_forwarded_for;
			proxy_set_header X-Forwarded-Proto $scheme;

			proxy_pass http://localhost:8010;
		}

		location /scripts/ {
			root /home/buildbot/zfs-buildbot;
		}

		# Server sent event (sse) settings
		location /sse {
			proxy_buffering off;
			proxy_pass http://localhost:8010;
		}

		# Websocket settings
		location /ws {
			proxy_http_version 1.1;
			proxy_set_header Upgrade $http_upgrade;
			proxy_set_header Connection "upgrade";
			proxy_pass http://localhost:8010;
			proxy_read_timeout 6000s;
		}

		location /change_hook/github {
			proxy_set_header Host $http_host;
			proxy_set_header X-Real-IP $remote_addr;
			proxy_set_header X-Forwarded-For $proxy_add_x_forwarded_for;
			proxy_set_header X-Forwarded-Proto $scheme;
			proxy_pass http://localhost:8010/change_hook/github;
		}
	}
}" > /etc/nginx/nginx.conf

Enable nginx

sudo systemctl enable nginx
sudo systemctl start nginx

Now login as "buildbot" user and run:

virtualenv sandbox
source sandbox/bin/activate
pip2 install boto
pip2 install requests
pip2 install txgithub
pip2 install service_identity
pip2 install buildbot==0.8.14

# Patch our pip version of buildbot
#
# Note, the pip version of buildbot doesn't come with cfg-buildslaves.rst
# which some of our patches require.  Create a fake cfg-buildslaves.rst
# so that patch doesn't fail.
mkdir -p ~/sandbox/lib/python2.7/site-packages/docs/manual/
curl https://raw.githubusercontent.com/buildbot/buildbot/eight/master/docs/manual/cfg-buildslaves.rst -o ~/sandbox/lib/python2.7/site-packages/docs/manual/cfg-buildslaves.rst
cd ~/zfs-buildbot/master/patches/
for i in *.patch ; do patch -l -d ~/sandbox/lib/python2.7/site-packages -p2 < $i ; done

# Do initial buildbot setup and start the server
cd ~/zfs-buildbot/master
buildbot create-master .
buildbot start .

Here are some more tweaks you may want for your local buildbot server:

Update buildslaves.py with a custom builder name (like "ZFSBuilder-devel")
Set your name as the owner of the build workers.
Use a different security_name firewall that allows you to login to a builder. That way, if it doesn't start correctly you can login to the builder and troubleshoot. You will also need to update your ec2_default_keypair_name in password.py to point to your SSH key as well.
Update the lines that reference 'build.zfsonlinux.org' and 'raw.githubusercontent.com' to point to your buildbot server's IP.
If you're using a test branch from your own repo to kick off a buildbot build, you will want to add your repo to repository.py.

(see below diff for examples of 1-5).

diff --git a/master/buildslaves.py b/master/buildslaves.py
index abc3911..2310dfd 100644
--- a/master/buildslaves.py
+++ b/master/buildslaves.py
@@ -133,7 +133,7 @@ esac
     def __init__(self, name, password=None, master='', url='', mode="BUILD",
                 instance_type="c5d.large", identifier=ec2_default_access,
                 secret_identifier=ec2_default_secret,
-                keypair_name=ec2_default_keypair_name, security_name='ZFSBuilder',
+                keypair_name=ec2_default_keypair_name, security_name='ZFSBuilder-can-ssh',
                 subnet_id=None, security_group_ids=None,
                 user_data=None, region="us-west-1", placement='a', max_builds=1,
                 build_wait_timeout=60, spot_instance=False, max_spot_price=0.10,
@@ -147,18 +147,20 @@ esac
         if not tags or tags is None:
             tags={
                 "ENV"      : "DEV",
-                "Name"     : "ZFSBuilder",
+                "Name"     : "ZFSBuilder-devel",
                 "ORG"      : "COMP",
-                "OWNER"    : "behlendorf1",
+                "OWNER"    : <your name here>,
                 "PLATFORM" : self.name,
                 "PROJECT"  : "ZFS",
             }

         if master in (None, ''):
-            master = "build.zfsonlinux.org:9989"
+            master = "<your buildbot server IP address>:9989"

         if url in (None, ''):
-            url = "https://raw.githubusercontent.com/openzfs/zfs-buildbot/master/scripts/"
+            url = "http://<your buildbot server IP address>/scripts/"

         if password is None:
             password = ZFSEC2Slave.pass_generator()


index 8082691..cf06f10 100644
--- a/master/master.cfg
+++ b/master/master.cfg
@@ -18,8 +18,10 @@ import re

 bb_slave_port = 9989
 bb_web_port = 8010
-bb_master = "build.zfsonlinux.org:9989"
-bb_url = "https://raw.githubusercontent.com/openzfs/zfs-buildbot/master/scripts/"
+bb_master = "<your buildbot server IP address>:9989"
+bb_url = "http://<your buildbot server IP address>/scripts/"


diff --git a/master/repository.py b/master/repository.py
index 8306bcc..d23d85c 100644
--- a/master/repository.py
+++ b/master/repository.py
@@ -1,7 +1,8 @@
 # -*- python -*-
 # ex: set syntax=python:

-zfs_repo = "https://github.com/openzfs/zfs.git"
+zfs_repo = "https://github.com/<your repo here>/zfs.git"
 linux_repo = "https://github.com/torvalds/linux.git"

 all_repositories = {
@@ -9,4 +10,8 @@ all_repositories = {
     "https://github.com/openzfs/zfs" : 'zfs',
     "https://github.com/torvalds/linux.git" : 'linux',
     "https://github.com/openzfs/zfs.git" : 'zfs',
+    "https://github.com/<your repo here>/zfs" : 'zfs',
+    "https://github.com/<your repo here>/zfs.git" : 'zfs',

In your github settings, set your github webhook to point to: http://<your buildbot server IP>/change_hook/github

Also set your github webhook secret to your github_secret value in password.py.

Licensing

See the LICENSE file for license rights and limitations.

zfs-buildbot's People

Contributors

Stargazers

Watchers

zfs-buildbot's Issues

priority of building

I've noticed that the build system build commits in the order that they were pushed to the pull requests. It also seems that all commits in a pull request are rebuilt, even if only a new commit is pushed at the end.

If possible, it would be nice to:

First build all 'final' commits, i.e. the ones where currently the TEST builds are performed.
Then single-commit pull requests are not so easily stalled by multi-commit requests.
Prefer to build commits that have most other successfully built commits.
Then code that fails style check and later fast builds will not hold the slower builders.

Avoid rebuilding previously built commits when a pull request is refreshed.

When a pull request is refreshed, the ZFS buildbot may rebuild commits that have already been built and haven't changed. It appears that the pull request event w/ synchronize action points us to a full list of commits. So to avoid duplicates, we would need to add some form of mechanism to the buildbot to keep track of what has been recently built.

We began discussing this in #40. Creating an issue so we don't lose track of it.

Suggestions from @inkdot7:

Regarding duplicates: https://developer.github.com/v3/activity/events/types/#pushevent makes me suspect that there could be an distinct member of each commit, that may be of use.

Otherwise, a dictionary might be more clear in what it does. One could keep a list of e.g. the 100 last unique commit sha that have been propagated through CustomGitHubEventHandler. Then it will not grow too large. With the current turn-around in the buildbots, that should be plenty. The main purpose is to prevent back-to-back rebuilds when additional fix-up commits are pushed. Each time handle_pull_request is invoked, a local dictionary for quick lookup could be made from the list.

keepalive_interval never used when making a test slave

mkEC2TestSlave passes keepalive_interval=60 to mkEC2UserDataSlave. mkEC2UserDataSlave does not handle keepalive_interval. Solution is simple, add handling for keepalive_interval in mkEC2UserDataSlave. Only question is, what's an appropriate default value for keepalive_interval?

Allow ZFS commit message to specify SPL dependency

As described in openzfs/zfs#3935 it would be useful if a ZFS commit message could specify either a SPL git hash or branch name as a dependency. In the short term this would allow us to automate the testing of merging the SPL in to the ZFS source tree. Perhaps something like this:

Example commit

Example commit message.

Requires-spl: refs/pull/PR/head

I don't think bb-build-linux.sh actually builds ZFS in

I had a long bug report written, but deleted it, so here's the short one.

I was wondering how openzfs/zfs#12056 could happen, and then I tried to reproduce it and found another incompatibility which I'll be filing shortly, so I was wondering how that could happen and checked output from the kernel.org builtin bot, and noticed that none of the CC/AS/LD/etc steps mentioned zfs, so I ran bb-build-linux.sh locally, and the resulting .config after it completed said "CONFIG_ZFS is not set" and did not contain ZFS.

I'll be opening a PR once I debug this, I just thought you probably should know.

List of public push webhooks

It would be nice to have a list of the public (if any) PUSH web hooks in the ZoL repos.

I'm considering setting up a Jenkins auto builder at home, but I need (would like) the URLs to subscribe to.

Add install-fedora-on-zfs

The project https://github.com/Rudd-O/install-fedora-on-zfs has the ability to build a bootable disk image that boots from ZFS (and LUKS beneath it, if so desired). I do not know how buildbot works so I can't at this time produce a pull request to do it, but here's the details:

script source: master branch from the above URL
script inputs: --with-prebuilt-rpms=path/to/splandzfs/rpms/ --releasever=fedora-releasever
script output: root.img (positional parameter)

The script will boot-cycle the root.img two times — one to test that the no-hostonly initrd works fine, and another to test that the hostonly initrd works fine. This happens automatically within QEMU and requires no intervention (but, in the event that the boot process fails, QEMU will be killed after 12 minutes by the script).

Everything the script does is logged in detail, so diagnosing and reproducing any issue is trivial after that.

Fix `buildbot sighup` password authentication

When starting a latent ec2 build slave a random password is generated for that slave to use. This works fine until buildbot sighup is run to pick up configuration changes on the master. This somehow results in the master and the slave expecting different passwords so the slave is no longer able to connect. We'd like to be able to use buildbot sighup to pickup configuration changes because it doesn't require us to stop any currently running builds.

Upgrade buildbot to >= 2.7.x

We're still running buildbot 0.8.x, which requires python2 and is super old. We should upgrade to 2.7.x or higher. I've already started work on this.

Update xfstests

The zfs branch in our fork of xfstests needs to be rebased on the latest version of xfstests. This should be done in a way which is acceptable to the upstream xfstests maintainers. This would allow us to use a stock version of xfstests.

Use latest AMIs

One of the limitations of the buildbot infrastructure is that to keep current the master.cfg must be updated manually as new AMIs are published. The buildbot documentation described ami filters as the solution to this issue. We should investigating using them in our master.cfg, see:

http://docs.buildbot.net/latest/manual/cfg-buildslaves-ec2.html

Add the ZFS Test Suite

Add the ZFS Test Suite to the list of tests. This may require applying some of the changes made in openzfs/zfs#3578. Specifically, we should disable all tests which are currently failing until they can be inspected.

Debian 8 x86_64 (TEST) reports bc not found

build step 13, xfstests failed. stdio output shows that it is failing to find 'bc' (the math program) when checking its environment, before running any actual tests.

+ sudo -E ./check -zfs -x dio -x sendfile -x user
bc not found
+ RESULT=1
+ exit 1

This string appears in the xfstests-zfs code, in common.config:

130 export SED_PROG="`set_prog_path sed`"
131 [ "$SED_PROG" = "" ] && _fatal "sed not found"
132
133 export BC_PROG="`set_prog_path bc`"
134 [ "$BC_PROG" = "" ] && _fatal "bc not found"
135
136 export PS_ALL_FLAGS="-ef"
137

not specifying "Requires-spl:" fails to clone spl-0.7-release branch

somehow stray" gets appended to git clone branch parameter

 watching logfiles {}
 argv: ['git', 'clone', '--branch', 'spl-0.7-release"', 'https://github.com/zfsonlinux/spl.git', '.']
 using PTY: False
Cloning into '.'...
fatal: Remote branch spl-0.7-release" not found in upstream origin

check build logs from openzfs/zfs#8227

and a workaround by adding Requires-spl: spl-0.7-release
openzfs/zfs#8256

not sure how this " gets there and what leaks it.

Automate OpenZFS patch porting

We recently discovered that it is possible to cherry-pick patches directly from the OpenZFS tree. Git will automatically attempt to find the right file to apply a hunk to if it is given enough time to process a change.

To allow git the time to search, you need the following in your git config

[merge]
    renameLimit = 999999

Now, in order to cherry-pick, clone your fork of zfs and add both zfsonlinux/zfs and openzfs/openzfs as remotes. Perform a git fetch --all. Now you can perform a git cherry-pick <OpenZFS commit> to port patches to ZoL.

It would be nice to have a script that would attempt to perform the above process automatically given an OpenZFS commit.

Support multiple buildslaves

The buildbot configuration should be updated to intelligently handle multiple buildslaves per builder. Specifically we would want the builder to prefer an on-demand t2.micro slave for builds unless the pending builder queue was more than half a dozen deep. At which point it should be allowed to power up a larger spot instance to speed things up.

This can be done with buildbot adding multiple slaves per builder and providing a custom nextSlave() function which includes the above logic.

Update the Instance Type to M5.Large

You're stilling using the m3.large instance type which is $.14 an hour standard and 2vCPU,7.5GB-RAM. You may be able to get better and cheaper performance by migrating to the newer M5.Large instances which are $0.096 an hour and 2vCPU/8GB-RAM.

Add EBS volumes for testing

All testing is currently performed against file vdevs since block storage wasn't easily available prior to migrating to EC2. However, the EC2 latent build slaves support adding volumes to instances. The master.cfg should be updated to attach at least one volume and the test scripts updated to use it. This will allow us to test the vdev_disk.c implementation.

SPL build not used during subsequent ZFS build

ZFS was built against the incorrect SPL, even though the SPL build showed the correct revision hash and reported success.

See build 51 on CentOS 7.1 x86_64: http://build.zfsonlinux.org/builders/CentOS%207.1%20x86_64%20%28TEST%29/builds/51

From the SPL build stdio:

+ test -f /etc/buildslave
+ . /etc/buildslave
++ BB_MASTER=build.zfsonlinux.org:9989
++ BB_NAME=CentOS-7.1-x86_64-testslave
++ BB_PASSWORD=PRMZQJZJV9NCI6GEFFRNW1C0
++ BB_ADMIN='Automated latent BuildBot slave <[email protected]>'
++ BB_DIR=/var/lib/buildbot/slaves/zfs
+ CONFIG_LOG=configure.log
+ case "$BB_NAME" in
+ CONFIG_OPTIONS='--enable-debug --with-spec=redhat'
+ MAKE_LOG=make.log
+ MAKE_OPTIONS=
+ INSTALL_LOG=install.log
+ ./autogen.sh
+ ./configure --enable-debug --with-spec=redhat
+ make pkg
+ case "$BB_NAME" in
+ sudo rm spl-0.6.5-8_ga3f5cc4.el7.centos.src.rpm spl-dkms-0.6.5-8_ga3f5cc4.el7.centos.src.rpm spl-kmod-0.6.5-8_ga3f5cc4.el7.centos.src.rpm spl-dkms-0.6.5-8_ga3f5cc4.el7.centos.noarch.rpm
+ sudo yum -y localinstall kmod-spl-0.6.5-8_ga3f5cc4.el7.centos.x86_64.rpm kmod-spl-devel-0.6.5-8_ga3f5cc4.el7.centos.x86_64.rpm spl-0.6.5-8_ga3f5cc4.el7.centos.x86_64.rpm spl-debuginfo-0.6.5-8_ga3f5cc4.el7.centos.x86_64.rpm
+ exit 0
program finished with exit code 0

From the ZFS build config.log:

configure:21277: checking spl source directory
configure:21326: result: /usr/src/spl-0.6.5
configure:21337: checking spl build directory
configure:21373: result: /usr/src/spl-0.6.5/3.10.0-229.20.1.el7.x86_64
configure:21385: checking spl source version
configure:21408: result: 0.6.5-11_ge7b75d9
configure:21419: checking spl file name for module symbols

Note that the ZFS config.log reports a different SPL source version

Add builders for 13-RELEASE and without --enable-debug per platform?

Differences between FBSD 13-RELEASE and 13-STABLE and --enable-debug versus not resulted in openzfs/zfs#13145, where building on 13-RELEASE was broken in two places and the CI hadn't noticed.

It'd be nice if non-debug builds didn't break without the CI noticing.

(While I'm asking for ponies, maybe a Debian 11 or sid builder so things like #13083 and #13103 are found sooner?)

Buildbot could use a distro version update

(I'm happy to twiddle this in my still-for-a-bit-copious free time if anyone likes...)

Currently, we build against:

Centos 7 {x86_64} (build+test)
Centos 8 {x86_64} (build+test)
Centos 8 Stream {x86_64} (build+test)
Debian 8 {armel, ppc64} (build)
Debian 10 {arm64,x86_64} (build)
Fedora 33 {x86_64} (build+test)
FBSD {12,13,14}-CURRENT snapshots {x86_64} (build+test)
Kernel.org git tip builtin {x86_64} (build)
Ubuntu {16.04} {i386} (build)
Ubuntu {18.04, 20.04} {x86_64} (build+test)

I would suggest at a minimum adding:

~~Some CentOS 8 alternative that will keep getting updates after 2021, be it Rocky or Alma or w/e {x86_64} (build+test)~~ Whoops, missed #234. Nice.
Debian 11 {arm64,x86_64} (build+test)
Fedora 35 {x86_64} (build+test)
FreeBSD something {not little-endian} (build+test)

It might also be useful to add a couple weirder things like:

Some Linux (kernel and OpenZFS built with Clang) (build and test)
Some Linux (kernel + OpenZFS built with KASAN) (build and test)

Though unless you could source somewhere for regularly built Clang kernels, that could be a significant burden, and building the whole kernel and then running the test suite each time could suck...maybe a bot that runs once a day/week against master, or doesn't block merges (e.g. best effort) would be better for one or both of those?

I'd also like to know whether there's a reason not to just make at least the non-x86_64 buildbots testbots too? Does it just take far too long?

I don't know when people want to drop support for older things, so I don't know when it would make sense to drop the Debian 8 trees. (It might also make sense to try using ELTS for armel if we want to see what some users might likely be running and not just "old kernel", though it's not available for ppc64, and that should probably involve a recurring payment to Freexian if using it in a non-personal setting...)

The FBSD/something BE suggestion is because it'd be nice to make sure the FBSD codepaths don't somehow have BE-specific issues. I'd suggest sparc64 for the variety (and because of openzfs/zfs#12008 not being noticed for a long time), but sourcing a build slave for that that would finish runs before the sun burns out could prove tricky, and qemu is...not amazing at sparc64. (I also don't know how the ppc64 slaves are done from quickly looking at the config, so I don't know if that's just paying the qemu cost or actual machines hosted somewhere or if there's a source for ppc64 VMs somewhere...)

Just some thoughts. Not exactly the highest priority, but keeps coming to mind periodically.

Add libvirt latent build slaves

Adding support for libvirt latent build slaves will enable testing for architectures other than x86_64. Currently only x86_64 systems are supported by ec2.

http://docs.buildbot.net/latest/manual/cfg-buildslaves-libvirt.html

Add Posix Test Suite

Add Posix Test Suite to the list of test suites or determine that xfstests already provides similar test coverage.

Introduce Spell Checking

Introduce spell checking to the STYLE builder. @ka7 has produced a tool to accomplish this (https://github.com/ka7/misspell_fixer). It would be nice to add this to avoid misspellings in the future.

Is there a known-issues.html with PRs?

I see that known-issues.sh knows how to generate the page with PR failures included, but that that option presumably isn't being used here.

Is there/could there be a version of that page added to whatever cron generates it that includes PRs, perhaps with a shorter timescale (e.g. 7/14 days)? I suspect it would make things like the issues behind openzfs/zfs#12663 or #238 flamingly obvious...

Fix Requires-spl syntax

After submitting a PR with the Requires-spl: keyword all subsequent build requests will use the same SPL commit instead of defaulting back to master. Restarting the build master resets the SPL dependency and is the current work around the issue.

Handle spot instance termination notices

Because spot instances are used for ZFS testing it's possible that Amazon will terminate them in the middle of a test run if the spot price changes. A 2 minute warning is provided to the instance before termination, we should check for this warning and use it to cleanly terminate buildbot. See the following link for Amazon's recommendation for how to handle termination notices.

https://aws.amazon.com/blogs/aws/new-ec2-spot-instance-termination-notices/

Introduce a rebase check to the STYLE builder

As pull requests become older, it is highly likely that bug fixes have been performed. Some of these bug fixes/improvements are either fixes to issues found in the test suite or changes that may impact the test suite (i.e. disabled/enabled tests). To combat this, we can introduce a check in the STYLE builder which can try and determine how "stale" a pull request is.

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.

openzfs / zfs-buildbot Goto Github PK

zfs-buildbot's Introduction

The OpenZFS Buildbot Configuration

Build and Test Strategy

Pull Requests

Builder Types

Build Steps and the runurl Utility

Test Suites

Configuring the Master

Credentials

Adding a Builder

Updating an EC2 Build Slave to Use a Different AMI

Adding a Test Suite

Running a Private Master

Private master setup example on Ubuntu 18.04 AWS instance

As "ubuntu" user:

Create a new /etc/nginx/nginx.conf to proxy the buildbot webserver on port 8010 to port 80:

Enable nginx

Now login as "buildbot" user and run:

Licensing

zfs-buildbot's People

Contributors

Stargazers

Watchers

Forkers

zfs-buildbot's Issues

Recommend Projects

Recommend Topics

Recommend Org

Build Steps and the `runurl` Utility

Create a new `/etc/nginx/nginx.conf` to proxy the buildbot webserver on port 8010 to port 80: