[CI, enhancement] Add ability to build with `gcov`, adding C++ code coverage for the `onedal` folder in codecov #2249

icfaust · 2025-01-09T08:50:48Z

Description

Coverage statistics have been recently added to sklearnex via codecov. This was focused on the Python code, but C++ is the other important aspect for integrating oneDAL (representing a fifth of the code in the repo). This PR does the following:

Adds ability to build with gcov for GNU C++ and Intel DPC++ compilers in cmake set by the SKLEARNEX_GCOV environment variable
Integrates C++ code coverage statistics in the Windows and Linux DPC++ Github Actions (currently Python 3.9 and Python3.11)
Switches coverage.py data format from json format to lcov (with branching enabled) to match C++ addition

It does this by using the gcovr package, and by a bash script (generate_coverage_files.sh). Note, to use gcov, the build Numpy must be installed, otherwise it will fail (usually changed in the requirements-test.txt installation). Therefore, the bash script uses a NUMPY_BUILD environment variable in order to store this information. This does not impact the daal4py build, as it is considered legacy code.

Consequences:

Adds C++ coverage metrics
Greatly reduces coverage percentages (by using branch rather than line coverage which is stricter)
Adds ~3 minute total via the longer builds and report generation, increasing the CI runtime by 10%
lcov files are now to be considered the standard code coverage format for sklearnex

Note: windows visual studio does not easily maintain a code coverage tool, and is therefore neglected. The majority of code is operated with the DPC build, with only a small segment being missed.

No performance benchmarks necessary.

PR should start as a draft, then move to ready for review state after CI is passed and all applicable checkboxes are closed.
This approach ensures that reviewers don't spend extra time asking for regular requirements.

You can remove a checkbox as not applicable only if it doesn't relate to this PR in any way.
For example, PR with docs update doesn't require checkboxes for performance while PR with any change in actual code should have checkboxes and justify how this code change is expected to affect performance (or justification should be self-evident).

Checklist to comply with before moving PR from draft:

PR completeness and readability

I have reviewed my changes thoroughly before submitting this pull request.
I have commented my code, particularly in hard-to-understand areas.
I have updated the documentation to reflect the changes or created a separate PR with update and provided its number in the description, if necessary.
Git commit message contains an appropriate signed-off-by string (see CONTRIBUTING.md for details).
I have added a respective label(s) to PR if I have a permission for that.
I have resolved any merge conflicts that might occur with the base branch.

Testing

I have run it locally and tested the changes extensively.
All CI jobs are green or I have provided justification why they aren't.
I have extended testing suite if new functionality was introduced in this PR.

Performance

I have measured performance for affected algorithms using scikit-learn_bench and provided at least summary table with measured data, if performance change is expected.
I have provided justification why performance has changed or why changes are not expected.
I have provided justification why quality metrics have changed or why changes are not expected.
I have extended benchmarking suite and provided corresponding scikit-learn_bench PR if new measurable functionality was introduced in this PR.

codecov · 2025-01-09T09:36:29Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Flag	Coverage Δ
azure	`76.78% <ø> (-7.12%)`	⬇️
github	`70.19% <ø> (-13.00%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

see 98 files with indirect coverage changes

icfaust · 2025-01-13T02:03:49Z

/intelci: run

scripts/CMakeLists.txt

david-cortes-intel · 2025-01-13T11:44:57Z

@icfaust What would be the right way to generate a report locally? I haven't been able to get an interactive html report from this PR.

icfaust · 2025-01-13T12:03:07Z

@icfaust What would be the right way to generate a report locally? I haven't been able to get an interactive html report from this PR.

You should be able to use generate_coverage_reports.sh or most of the commands in there. The big thing is the gcov is touchy with cmake, and the build directory is the root for gcovr. If you are separating the build install and the test requirements installs, make sure to be careful with the numpy version. Are you using g++ or icpx?

david-cortes-intel · 2025-01-13T12:10:06Z

@icfaust What would be the right way to generate a report locally? I haven't been able to get an interactive html report from this PR.

You should be able to use generate_coverage_reports.sh or most of the commands in there. The big thing is the gcov is touchy with cmake, and the build directory is the root for gcovr. If you are separating the build install and the test requirements installs, make sure to be careful with the numpy version. Are you using g++ or icpx?

Meaning: it's not generated as part of the run.sh files for coverage? If that's the case, would be ideal to start documenting these things in the .md files.

david-cortes-intel · 2025-01-13T12:13:21Z

@icfaust Can you attach here an .html report of the C++ code coverage that would be generated?

icfaust · 2025-01-13T12:15:37Z

@icfaust Can you attach here an .html report of the C++ code coverage that would be generated?

I recommend looking at codecov: #2249 (comment)

It has a file explorer and unifies all the builds together: https://app.codecov.io/gh/uxlfoundation/scikit-learn-intelex/pull/2249/tree

.github/scripts/generate_coverage_reports.sh

Co-authored-by: david-cortes-intel <david.cortes@intel.com>

icfaust added 2 commits January 9, 2025 09:50

Update ci.yml

2899fb3

Update ci.yml

1b2302b

icfaust and others added 27 commits January 9, 2025 13:42

Update ci.yml

ec3953e

Update ci.yml

3563133

Update ci.yml

ad933e2

force use of coverage to start

6f0ed12

try again

8915d24

attempt again

cd4f95e

add -Xarch_host

765fa20

further checking

e5ca42d

make verbose

ca44fb7

attempts at integration

adfa455

add echos for testing

228f6e3

move installation

9a6a704

ignore error induced by numpy switch

b22c8a9

Update generate_coverage_reports.sh

f1a2299

Update generate_coverage_reports.sh

7db1eb5

Update generate_coverage_reports.sh

6bbba03

Update generate_coverage_reports.sh

65c4cf2

Update ci.yml

3878f8b

Update ci.yml

84bd99d

Update ci.yml

b301863

Update generate_coverage_reports.sh

df19a3f

Update ci.yml

de9d701

Update generate_coverage_reports.sh

c2a6f12

Update generate_coverage_reports.sh

5b5caa0

Update ci.yml

10c1ad8

Update run_test.bat

613a6bf

Update run_test.sh

c75446a

icfaust added 2 commits January 13, 2025 02:21

Update CMakeLists.txt

728b83e

Update ci.yml

d1e55b1

icfaust changed the title ~~[WIP] verify azp status DO NOT MERGE~~ [CI, enhancement] Add ability to build for gcov C++ code coverage Jan 13, 2025

icfaust changed the title ~~[CI, enhancement] Add ability to build for gcov C++ code coverage~~ [CI, enhancement] Add ability to build for gcov C++ code coverage for the onedal folder Jan 13, 2025

icfaust changed the title ~~[CI, enhancement] Add ability to build for gcov C++ code coverage for the onedal folder~~ [CI, enhancement] Add ability to build for gcov C++ code, adding C++ code coverage for the onedal folder in codecov Jan 13, 2025

icfaust changed the title ~~[CI, enhancement] Add ability to build for gcov C++ code, adding C++ code coverage for the onedal folder in codecov~~ [CI, enhancement] Add ability to build with gcov, adding C++ code coverage for the onedal folder in codecov Jan 13, 2025

icfaust marked this pull request as ready for review January 13, 2025 02:03

icfaust requested review from Alexsandruss, samir-nasibli, napetrov, homksei, ahuber21 and ethanglaser as code owners January 13, 2025 02:03

david-cortes-intel reviewed Jan 13, 2025

View reviewed changes

scripts/CMakeLists.txt Outdated Show resolved Hide resolved

icfaust added 2 commits January 13, 2025 10:08

Update CMakeLists.txt

0290055

Update CMakeLists.txt

e5ecf2a

icfaust requested a review from david-cortes-intel January 13, 2025 10:51

icfaust and others added 2 commits January 13, 2025 13:32

Merge branch 'uxlfoundation:main' into dev/c_plus_plus_coverage

ea702e4

switch azure to lcov

9b73ea6

david-cortes-intel reviewed Jan 13, 2025

View reviewed changes

.github/scripts/generate_coverage_reports.sh Outdated Show resolved Hide resolved

Update .github/scripts/generate_coverage_reports.sh

d085abe

Co-authored-by: david-cortes-intel <david.cortes@intel.com>

david-cortes-intel approved these changes Jan 13, 2025

View reviewed changes

icfaust merged commit aad2a9c into uxlfoundation:main Jan 14, 2025
27 checks passed

icfaust deleted the dev/c_plus_plus_coverage branch January 14, 2025 10:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CI, enhancement] Add ability to build with `gcov`, adding C++ code coverage for the `onedal` folder in codecov #2249

[CI, enhancement] Add ability to build with `gcov`, adding C++ code coverage for the `onedal` folder in codecov #2249

icfaust commented Jan 9, 2025 •

edited

Loading

codecov bot commented Jan 9, 2025 •

edited

Loading

icfaust commented Jan 13, 2025

david-cortes-intel commented Jan 13, 2025

icfaust commented Jan 13, 2025 •

edited

Loading

david-cortes-intel commented Jan 13, 2025

david-cortes-intel commented Jan 13, 2025

icfaust commented Jan 13, 2025 •

edited

Loading

[CI, enhancement] Add ability to build with gcov, adding C++ code coverage for the onedal folder in codecov #2249

[CI, enhancement] Add ability to build with gcov, adding C++ code coverage for the onedal folder in codecov #2249

Conversation

icfaust commented Jan 9, 2025 • edited Loading

Description

codecov bot commented Jan 9, 2025 • edited Loading

Codecov Report

icfaust commented Jan 13, 2025

david-cortes-intel commented Jan 13, 2025

icfaust commented Jan 13, 2025 • edited Loading

david-cortes-intel commented Jan 13, 2025

david-cortes-intel commented Jan 13, 2025

icfaust commented Jan 13, 2025 • edited Loading

[CI, enhancement] Add ability to build with `gcov`, adding C++ code coverage for the `onedal` folder in codecov #2249

[CI, enhancement] Add ability to build with `gcov`, adding C++ code coverage for the `onedal` folder in codecov #2249

icfaust commented Jan 9, 2025 •

edited

Loading

codecov bot commented Jan 9, 2025 •

edited

Loading

icfaust commented Jan 13, 2025 •

edited

Loading

icfaust commented Jan 13, 2025 •

edited

Loading