feat: Code Coverage Reporting via CI #167

courtneypacheco · 2024-12-17T20:32:36Z

This document provides design suggestions for CI code coverage reporting. Initially, the high-level goal is to provide a report in each pull request to highlight:

Current code coverage of main for each InstructLab repo
The code coverage on the new lines in the PR
What the code coverage will become if said PR is merged (e.g., 53% coverage -> 62% coverage)

We can ultimately decide if we want to enforce a minimum coverage % in PRs, but for now, I just want to report the coverage for informational purposes

This document provides design suggestions for CI code coverage reporting, as well as describe high-level goals. Also, I've added unrecognized words to the spell checking dictionary Signed-off-by: Courtney Pacheco <[email protected]>

RobotSail · 2024-12-17T21:29:25Z

docs/ci/ci-code-coverage-reporting.md

+
+* Code coverage should be reported in PR builds as a GitHub _bot_ comment
+* New code:
+  * Should show % coverage on new lines of code.


Should we require this explicitly for all repos? There are some repos like instructlab/training which are challenging to test meaningfully for certain aspects due to how we expect them to run (distributed, multi-processed).

Maybe it would make more sense to uphold a certain % of coverage maintained? So that if new changes increase the lines, we are not falling below a certain threshold.

This doesn't seem to be suggesting a requirement, just a communication of how much of the new code is being covered by testing

RobotSail · 2024-12-17T21:30:29Z

docs/ci/ci-code-coverage-reporting.md

+  * Free to use in open source projects.
+  * Codecov reports all the data on GitHub in a comment. [Example](https://docs.codecov.com/docs/pull-request-comments).
+* Cons / Callouts:
+  * Not applicable at this time, but if we want to use it in >1 private repo, we must purchase a customer plan.


We don't have any private repos (or plan to), so this shouldn't affect us.

RobotSail · 2024-12-17T21:31:42Z

docs/ci/ci-code-coverage-reporting.md

+* Pros:
+  * Free to use via [MIT license here](https://github.com/coverallsapp/github-action/blob/main/LICENSE.md).
+  * Appears to be free for open source repos, but like with Codecov, we should validate that the ToS doesn’t have fine print, etc.
+  * Can report all the data in a GitHub comment. [Example](https://github.com/coverallsapp/coveralls-node-demo/pull/19#issuecomment-1035344985).


This is a really nice addition.

RobotSail · 2024-12-17T21:32:19Z

docs/ci/ci-code-coverage-reporting.md

+  * Appears to be free for open source repos, but like with Codecov, we should validate that the ToS doesn’t have fine print, etc.
+  * Can report all the data in a GitHub comment. [Example](https://github.com/coverallsapp/coveralls-node-demo/pull/19#issuecomment-1035344985).
+* Cons / Callouts:
+  * Code coverage visuals, etc. are all viewed through the Coveralls front end (coveralls.io), so if that service were to go down, then we’re technically at their mercy.


If it's FOSS/MIT licensed, could we not stand up our own instance?

True, but I'd prefer to avoid the overhead (is possible)

RobotSail

Thanks for this dev-doc, this will be a very positive change to have in our repos. I think ultimately all of the coverage tools will be good to have, so it seems like it's more about preference than necessarily picking the wrong one.

bjhargrave · 2024-12-17T21:37:01Z

I am not sure the utility of this. Unless it is someone's job to increase the coverage amount, the coverage amount is just a metric no one cares about and is more emails from GitHub for each PR run. I don't think I have ever worked on an open source project where coverage reporting has ever been really helpful.

A lighter weight idea would be to simply upload the coverage data/reports as a build artifact which someone can go get if they really care. For example (the example is test reports, but same basic idea):

https://github.com/bndtools/bnd/blob/ce60cc705bea2ef75126cb17967b34a50ab3b4a6/.github/workflows/cibuild.yml#L103-L111

courtneypacheco · 2024-12-17T22:19:15Z

I am not sure the utility of this. Unless it is someone's job to increase the coverage amount, the coverage amount is just a metric no one cares about and is more emails from GitHub for each PR run. I don't think I have ever worked on an open source project where coverage reporting has ever been really helpful.

A lighter weight idea would be to simply upload the coverage data/reports as a build artifact which someone can go get if they really care. For example (the example is test reports, but same basic idea):

https://github.com/bndtools/bnd/blob/ce60cc705bea2ef75126cb17967b34a50ab3b4a6/.github/workflows/cibuild.yml#L103-L111

Yep, I know what you mean about metrics.

I don't think the number itself tells us much about code quality either because in theory, a repo can have "100% code coverage," yet still suffer quality issues. After all, if we write code wrong and then write our unit tests wrong as well, it is possible that the unit tests pass when they shouldn't.

However, if the code coverage is sitting at 53% and a contributor creates a large PR that reduces the overall code coverage to 30% (as a gross exaggeration), then we might want to look at it with more critical eyes. Same if someone creates a PR that adds 200 lines with only 5% code coverage.

Ultimately, my goal is not to bombard anyone with tons of emails, especially from a code coverage bot! But I do want to make it easy for contributors and reviewers to see if a certain file or critical piece of code was accidentally left untested. Yes, contributors can look at the actual code coverage report file instead of us having a bot commenting about untested code lines, but having coverage information summarized right in the PR would likely be helpful for PR reviewers.

What do you think?

bjhargrave · 2024-12-17T22:33:41Z

having coverage information summarized right in the PR would likely be helpful for PR reviewers.

Yes, but adding the comments sends emails. :-(

In any case, we need to be careful in the GitHub workflows to avoid security issues which can lead to supply chain attacks.

courtneypacheco · 2024-12-17T22:56:44Z

having coverage information summarized right in the PR would likely be helpful for PR reviewers.

Yes, but adding the comments sends emails. :-(

In any case, we need to be careful in the GitHub workflows to avoid security issues which can lead to supply chain attacks.

Yep, agreed.

A couple of options off the top of my head to mitigate the number notifications:

We could disable the automatic code coverage bot reporting when PRs are marked as drafts, or
We could disable the bot comments by default, but allow any user to call it manually, such as when the PR author or reviewer is concerned about code quality — e.g., https://docs.github.com/en/actions/writing-workflows/choosing-when-your-workflow-runs/events-that-trigger-workflows#workflow_dispatch, or
Set up filters to block certain bot emails — e.g., https://docs.github.com/en/account-and-profile/managing-subscriptions-and-notifications-on-github/setting-up-notifications/configuring-notifications

For security concerns regarding workflows: Ansible, Santa and other popular open source repos do actively use Codecov or Coveralls to assess code coverage. I can definitely look into how these repos have mitigated security risks via their workflow logic and work to identify any concerns we may have if we were to introduce such workflows in any of our repos.

nathan-weinberg · 2024-12-18T19:20:42Z

docs/ci/ci-code-coverage-reporting.md

+
+* Code coverage should be reported in PR builds as a GitHub _bot_ comment
+* New code:
+  * Should show % coverage on new lines of code.


This doesn't seem to be suggesting a requirement, just a communication of how much of the new code is being covered by testing

nathan-weinberg · 2024-12-18T19:21:05Z

docs/ci/ci-code-coverage-reporting.md

+  * Free to use in open source projects.
+  * Codecov reports all the data on GitHub in a comment. [Example](https://docs.codecov.com/docs/pull-request-comments).
+* Cons / Callouts:
+  * Not applicable at this time, but if we want to use it in >1 private repo, we must purchase a customer plan.


nathan-weinberg · 2024-12-18T19:21:35Z

docs/ci/ci-code-coverage-reporting.md

+  * Appears to be free for open source repos, but like with Codecov, we should validate that the ToS doesn’t have fine print, etc.
+  * Can report all the data in a GitHub comment. [Example](https://github.com/coverallsapp/coveralls-node-demo/pull/19#issuecomment-1035344985).
+* Cons / Callouts:
+  * Code coverage visuals, etc. are all viewed through the Coveralls front end (coveralls.io), so if that service were to go down, then we’re technically at their mercy.


True, but I'd prefer to avoid the overhead (is possible)

nathan-weinberg · 2024-12-18T19:24:19Z

docs/ci/ci-code-coverage-reporting.md

+  * Explains what isn’t covered and gives detailed suggestions on how to increase code coverage. (Seems better than Codecov, but… it’s also a more refined product?).
+  * Keeps a running history of your code coverage reporting, if desired.
+* Cons / Callouts:
+  * Appears to be "free to use," but Sonar wants you to store coverage data in their cloud and storage costs will apply unless you agree to host your own Sonar service -- in which case, we'll need to pay for a license. (Not ideal.)


I say we avoid this one

nathan-weinberg · 2024-12-18T19:24:29Z

docs/ci/ci-code-coverage-reporting.md

+
+## Code Coverage Reporting Bot: Options
+
+### [Codecov](https://github.com/marketplace/codecov)


This would be my preference

cdoern

this is very thorough, I like all of the ideas here. My preference for code coverage is CodeCov

nathan-weinberg requested review from cdoern, nathan-weinberg, danmcp, alimaredia and bjhargrave December 17, 2024 20:34

courtneypacheco force-pushed the dev-docs-ci-code-coverage branch 3 times, most recently from 5dc709c to e818651 Compare December 17, 2024 20:53

courtneypacheco force-pushed the dev-docs-ci-code-coverage branch from e818651 to 06fd0a3 Compare December 17, 2024 21:06

RobotSail reviewed Dec 17, 2024

View reviewed changes

RobotSail approved these changes Dec 17, 2024

View reviewed changes

nathan-weinberg reviewed Dec 18, 2024

View reviewed changes

cdoern approved these changes Dec 18, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Code Coverage Reporting via CI #167

feat: Code Coverage Reporting via CI #167

courtneypacheco commented Dec 17, 2024

RobotSail Dec 17, 2024

nathan-weinberg Dec 18, 2024

RobotSail Dec 17, 2024

nathan-weinberg Dec 18, 2024

cdoern Dec 18, 2024

RobotSail Dec 17, 2024

RobotSail Dec 17, 2024

nathan-weinberg Dec 18, 2024

RobotSail left a comment

bjhargrave commented Dec 17, 2024

courtneypacheco commented Dec 17, 2024

bjhargrave commented Dec 17, 2024

courtneypacheco commented Dec 17, 2024

nathan-weinberg Dec 18, 2024

nathan-weinberg Dec 18, 2024

nathan-weinberg Dec 18, 2024

nathan-weinberg Dec 18, 2024

nathan-weinberg Dec 18, 2024

cdoern Dec 18, 2024

cdoern left a comment


		## Code Coverage Reporting Bot: Options

		### [Codecov](https://github.com/marketplace/codecov)

feat: Code Coverage Reporting via CI #167

Are you sure you want to change the base?

feat: Code Coverage Reporting via CI #167

Conversation

courtneypacheco commented Dec 17, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

RobotSail left a comment

Choose a reason for hiding this comment

bjhargrave commented Dec 17, 2024

courtneypacheco commented Dec 17, 2024

bjhargrave commented Dec 17, 2024

courtneypacheco commented Dec 17, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

cdoern left a comment

Choose a reason for hiding this comment