Skip to content
Dale Wijnand edited this page Feb 20, 2020 · 32 revisions

Maintaining the community build

This is partly for community build maintainers on the Scala team, and partly for project authors interested in having their projects in the community build and keeping the version included current.

The Troubleshooting a failure page is more specifically targeted at project authors and external contributors. (It overlaps somewhat with this page.)

Background

You'll want to be familiar with the basics of our CI infrastructure, as documented at https://github.com/scala/scala-jenkins-infra/blob/master/doc/overview.md. Most community build maintenance tasks only involve making commits in the scala/community-build repo, but some types of improvements may require scala/scala-jenkins-infra changes.

Credentials

You'll need an account on our Jenkins instance (https://scala-ci.typesafe.com) with enough power to start jobs, cancel jobs, and edit configs. (Permanent config changes are made through scala-jenkins-infra, but it's sometimes helpful to make temporary changes on the fly.)

Getting set up to make scala-jenkins-infra is harder; you might wait and see if you need it. See the doc in that repo.

dbuild

dbuild is the meta-build-tool that is the foundation of the community build. (It was also used to build version 1 of Lightbend's Reactive Platform.)

You'll need at least a basic grounding in what dbuild is about; so at least skim through the dbuild manual at https://lightbend.github.io/dbuild/ . If it isn't clear 1) why something like dbuild is needed for this, and 2) basically how dbuild does its job, talk to someone on the Scala team about it.

Jenkins

Don't forget to login into Jenkins, or you won't see all of the options that are available. Jenkins is exceptionally eager to log you out again, so prepare to find yourself clicking that "login" link a lot. (Help wanted: find out why that is and fix it?)

There are multiple jobs, entries in a matrix of Scala versions (2.11/2.12/2.13) and JDK versions (8/11/14).

All the builds run nightly. When you're actively maintaining the builds, you'll be triggering a lot of runs manually. The "rebuild last" button is handy for this.

Each build has parameters, with an appropriate set of defaults. When manually triggering a build, you'll often want to change some of the following:

  • repo_user: if you're testing a change in your fork of the community-build repo, change this from scala to your own GitHub user name
  • repo_ref: your change in your fork is presumably in a feature branch, so put the branch name here
  • version: this is a Scala version number such as 2.13.2-bin-abcd123 (nightly Scala) or 2.13.2-bin-abcd123-SNAPSHOT (Scala from PR validation)
    • dbuild caches results from previous runs of the community build, so when testing changes, your first choice will to be use a version number for which a recent fully-green build exists.

Manually aborting a community build through the Jenkins UI doesn't lose any completed and cached builds; so if you have additional changes you want to test, it's OK to abort an existing run and start a new one with your additional changes.

Branches

As mentioned already, there are multiple builds by Scala version and JDK version.

Changes are merged as follows: 2.11.x -> 2.12.x -> 2.13.x. Since the 2.11 and 2.12 builds are pretty much frozen these days, this will normally just be things like sbt version bumps and things like improvements to the various scripts.

There isn't any schedule or automation for the merging, it's just something we do when we remember to.

Particular builds are assigned to particular Jenkins workers ("behemoth" 1, 2, or 3) to get the greatest advantage of dbuild's build caching.

Quality standard

You'll want to look over the issues in https://github.com/scala/community-build/issues to get an idea of what shortcomings currently exist. The community build is capable of absorbing a practically unlimited amount of time and effort if you let it, so it's often necessary to make cost/benefit tradeoffs between getting things right and getting them "good enough".

In an ideal world:

  • every significant Lightbend and Scala Center library would be included
  • every popular third-party open-source Scala library would be included
    • and perhaps also some important Scala codebases that aren't themselves libraries
  • every included project would be tracked from a recent yet stable branch
  • every time we forked a project to make it work, we would notify the maintainers and make sure they incorporate the needed changes, so we can eliminate the fork and go back to tracking a branch
  • every included project's test suite would be run

and so on. In practice, counterexamples are abundant.

Regardless, as long as we include lots of projects, with lots of code, in reasonably recent versions, we're doing pretty well. We're not trying to provide the entire Scala community with CI.

Scala and sbt versions

Every so often, check to see if we're using the latest Scala and the latest sbt to do extractions and builds, and bump the versions if not.

To bump the Scala version, you only need to edit nightly.properties.

For sbt, use git grep to find all the places that need to be updated. The main one is in community.conf, but it's also good to use the latest version in the other places.

Project versions

All projects are built from fixed SHAs.

Every so often, we move them all to new SHAs, by using the latest commit on the branch or tag specified in the first line of each proj/*.conf file.

How often? More often is good because we catch problems earlier. Less often is good because it avoids unnecessary effort and churn. What's a good compromise? Monthly or so, probably.

To advance all the SHAs, run ./advance. There is a high likelihood something will break, so you want to do this on a branch (and probably a pull request), run Jenkins on the branch, and make sure everything is green before merging the changes.

Note that the ./advance script can also be a run for one or more more specific target names, e.g. ./advance akka playframework lagom, that kind of thing. This is useful in various circumstances, but it's especially common if you want to use a different branch or tag. Editing the URL in the comment at the top of a project's config file doesn't affect dbuild directly; you have to ./advance myproject to make the change take effect.

Troubleshooting

What typically goes wrong with the community build?

Build times out

Over time, the builds get slower: as we add more projects, as the projects themselves grow, as we add more resolvers to the config file, etc.

In PRs such as https://github.com/scala/scala-jenkins-infra/pull/129 we try to set the timeouts high enough so that the builds will normally pass, but low enough so that if builds start taking abnormally long, we are made aware of it, and are forced to make a conscious decision about whether to just increase the timeouts again, or try to figure out whether there's some performance problem we ought to tackle directly.

If a build does time out, usually you can just run it again with the same parameters, and the new run will finish much faster thanks to dbuild's caching.

Servers run out of disk space

Keep an occasional (weekly?) eye on the free space numbers at https://scala-ci.typesafe.com/computer/. (You need to be logged in to Jenkins to see the numbers. Use "Launch slave agent" to fire up the behemoths if needed.)

It is normal for disk usage to creep up slowly over time, and eventually a behemoth will need cache clearing, but if we're running out of disk space more than very occasionally, the root cause should be found and fixed.

Upstream changes cause regression

As you'll see in the URLs referenced in proj/*.conf, we actively track current development branches of many projects, by running the advance script to move to newer SHAs. Advancing the SHAs is very likely to cause failures, so we normally make a PR for the new SHAs and make sure it's green before merging the changes. When advancing many SHAs at once, it may be difficult to determine the true source of a failure. If a build fails, the cause isn't always that repo. It might be because some upstream project's code changed.

In response, you can either:

  • advance the SHA but add something to community.conf to address the problem (e.g. by disabling a subproject, changing the build settings, etc.)
  • freeze at an older rev (perhaps also forking to make a needed change)
    • when depending on an old rev it's better to depend on a version tag than a raw SHA
    • if you fork, include in your comment what the fork point was and why; be nice to future you and future maintainers

Project's tests are flaky

We encourage upstream maintainers not to leave intermittently failing tests in their suites, but it happens sometimes. Try to report the problem upstream.

If you must disable tests, try to do it just for an offending subproject, if there is one, rather than the whole build.

extra.run-tests: false disables not only running tests, but also compiling them. If there are test failures, instead first try extra.test-tasks: ["compile"], so we at least compile the tests.

Project includes an unnecessary or unstable subproject

Some people's builds are a grab bag of stable vs unstable, official vs experimental stuff. If a failure can be isolated to a particular subproject, and if other people don't depend on it, you can just skip it (see examples in common.conf).

Project's branch isn't stable enough to track

If a particular project regresses often, it may be worth contacting the project maintainer and asking (or just looking for yourself on GitHub) if there's a less volatile branch we could track. We're not trying to provide the entire Scala community with CI, after all.

Project's build is too picky

e.g., they might be using -Xfatal-warnings. It's rarely worth the effort to maintain that high a standard in the context of the community build. There are examples in proj/*.conf of how to disable this.

or the build might be picky about Scala version numbers and get confused by the unusual/unexpected scalaVersion and scalaBinaryVersion values that are used in the community build.

or the build might be sensitive to sbt version. normally we use the latest sbt version to build everything, but that might fail for certain projects. common.conf has examples of how to force a lower sbt version for a certain project

Missing resolver

If a dependency of a project's build definition (typically an sbt plugin, or a library that plugin depends on) can't be found, you'll need to add a resolver to the Artifactory config on scala-ci.

You might also need to add a resolver if a project has a binary Java dependency that isn't on Maven Central.

"both visible" error

if you see an error message like "org.scalacheck#scalacheck from scalacheck and scalacheck, both visible in space “default”” a typical cause is that you have an upgraded to a version of a library that has added Scala.js support, and dbuild is seeing both JVM and JS artifacts. usual fix: override extra.projects and/or extra.exclude

Version conflicts

It's often possible to resolve a version incompatibility by forking a project and patching its source a little bit. If that happens, try to contribute the change upstream, so your fork won't be needed anymore.

It's difficult to know in advance whether upgrading library A will trigger a cascade of upgrades and regressions. Sometimes upgrades multiply, problems multiply, and you end up stuck. If you fail, tell the story of your failed attempt in a ticket, in case someone wants to try again eventually.

If an intractable version conflict occurs, the build may need to split into multiple spaces. See next section.

Spaces

We try to use dbuild's "spaces" feature as little as possible, since it makes the build configuration a lot harder to understand and troubleshoot, and it increases build times.

Sometimes, though, it may be necessary to support two different versions of the same project, if the versions are source-incompatible and both used by important projects. In that case, putting the different versions in separate spaces is the only way forward.

Currently (January 2020) the 2.13.x build doesn't use spaces. In the 2.12.x build, the only such split we have is a jawn 0.10 versus 0.11 split, which you'll see in the config file.

Binary dependencies

If some project is needed as a dependency but building it from source is too difficult or annoying, consider adding it as a binary dependency. dbuild supports this with system: "ivy"; there are a few examples in the history. (This only works for Scala versions where binary compatibility is frozen; during the milestone and release candidate phases, everything must be built from source.)

General advice

Advance the SHAs reasonably often

Run the advance script and PR the changes it makes, and test them before merging. It's hard to know how often makes sense, but monthly seems like a bare minimum.

Use the narrow script for local runs

This can really save you a lot of time. See Local runs.

Put forks in the scalacommunitybuild organization

https://github.com/scalacommunitybuild was created so we don't end up using forks scattered around various personal accounts. (You'll see such personal forks in the 2.11 builds, where we haven't bothered to clean up.) We don't want to use forks that might be lost, and we don't want to depend on commits that might disappear, either.

(In the future we might use the organization for freezing, too, not just forking. Anytime we freeze either an individual project, or freeze all of the projects at release time, there is a danger of the commit we froze at disppearing from GitHub later. If we only froze to commits stored in the scalacommunitybuild organization, that danger would go away.)

Don't trust old forks

They gradually become out of date, since advance doesn't advance them.

Get help from project maintainers

It's not your job to troubleshoot every failure yourself. Open a ticket, paste in relevant section of the error log (with a link to the full log on Jenkins), @-mention the maintainer, and ask for their help. It's their code, so they'll probably spot the problem faster, and usually maintainers are happy to be included in the community build and happy to help.

Talk to other maintainers

Toni (dbuild author), Eugene and Dale (sbt wizards), etc. They know more about dbuild, sbt, resolvers, Artifactory, etc than you ever will. Everyone on the Scala team has a history with these things, too.

Be patient

The community build is important, but it is rarely urgent. It only becomes urgent when a Scala release is impending.

The rest of the time, it's fine if things move slowly: because you're waiting for a project maintainer, because you're waiting for slow builds to finish, because a change needs multiple rounds of testing to get right, etc. It's common for this to drag on for days and weeks. It's fine. The community build is a moving target.

Leave bread crumbs

When you fix an issue, leave clues for future-you by including the error message in the commit comment. And if an error message seems mysterious, look for it in git log, a previous maintainer might have seen it before and left clues for you.

Pretend you're the main character of "Memento"

Every Jenkins run begins with extracting the dependencies of all of the projects, even if none of them need to be rebuilt. This takes a few minutes, which is too long and sit and watch, so inevitably you'll start working on something else. And actually getting feedback on the change you made might take 30 minutes, or an hour, or 10+ hours if everything has to be rebuilt! By the time you get back to it, it's easy to forget what you were even doing.

It's tempting to maintain mental flow by keeping a tight watch on your builds, but doing so is a huge waste of time. It's better to always write down everything you're thinking and doing as you go, so it's never too costly to set the community build aside while you wait for results. Get something else done, and once your results are in, if you have good notes on where you were, it won't be that hard to reconstruct your mental state and figure out what your next move is.

It's basically the same advice as "leave bread crumbs", but you're leaving the crumbs for yourself in 10 minutes or 1 hour or 1 day, not just some other sad sack months or years later.

Test multiple changes at once?

Don't just tackle one open ticket, considering tackling 3 or 4 at once, at least if you're reasonably confident the changes are independent and won't interfere with each other. A test run can take a long time, so why not maximize the amount of information you get from each run?

Experts only! This is definitely bad advice at least some of the time.

Perhaps a project is more trouble than it's worth

Just because a project was in at one time doesn't mean we need to keep it forever. Keep cost/benefit in mind.

The same goes for questions of whether to run tests, what subprojects to include, and so forth.

Local Artifactory

Extracting dependencies from lots of projects, and resolving and retrieving build-time dependencies such as sbt plugins from lots of projects, is inherently slow. If you do a lot of community build work, you'll want to run a local Artifactory to speed it up substantially, especially if your internet connection is slow.

See Local Artifactory for instructions.

Clone this wiki locally