allow for encoded labels in `Report` #189

giovanni-guidini · 2023-11-22T16:34:15Z

These changes extend the Report to use the _labels_index.
Everything is controlled via a rollout, so in practice there's a single repo that will use
the compressed labels, my sentry fork (giovanni-guidini/sentry). For testing purposes.

There are 2 important facts here:

Given worker / shared pairs we can only have (old, old) or (new, new).
where "old" denote the code that doesn't do label encoding, and "new" the code that does.
The (new, new) pair can handle the cases the (old, old) does and the new one of label encoding,
but the (old, old) pair will generate corrupted results if it tries to process an encoded report.

SO it's important to woll out the (new, new) pair into prod before turning the feature on for everyone.
Which is what we are doing, so we're fine (probably).

At first I was very much in the "let's tear everything down and build again", but because the (new, new) pair
needs to handle the unencoded versions as well I decided to make smaller changes so the code can co-exist in harmony.

Updates shared to the version where `Report` has a `_labels_index` and makes use of them in processing.

context: codecov/engineering-team #768 Creates a rollout for label compression. This is gonna help us to test and safely release the feature in the wild. Notice that currently the label compression does nothing. There are comments similar to `TODO: needs shared update`. The update in question is codecov/shared#79 So these changes mostly prep the terrain for future changes that will actually do something, and add the guardrails to avoid issues when deploying. In particular it adds some helper methods to the `ArchiveService`, creates a kinda-stubbed `LabelsIndexService`, passes more data to the `_adjust_sessions` portion of `raw_upload_processor` where most changes will occur. If you're curious as to what the end result will probably look like see #180

These changes extend the `Report` to use the `_labels_index`. Everything is controlled via a rollout, so in practice there's a single repo that will use the compressed labels, my sentry fork (giovanni-guidini/sentry). For testing purposes. There are 2 important facts here: 1. Given worker / shared pairs we can only have (old, old) or (new, new). where "old" denote the code that doesn't do label encoding, and "new" the code that does. 2. The (new, new) pair can handle the cases the (old, old) does and the new one of label encoding, but the (old, old) pair will generate corrupted results if it tries to process an encoded report. SO it's important to woll out the (new, new) pair into prod _before_ turning the feature on for everyone. Which is what we are doing, so we're fine (probably). At first I was very much in the "let's tear everything down and build again", but because the (new, new) pair needs to handle the unencoded versions as well I decided to make smaller changes so the code can co-exist in harmony.

codecov-staging · 2023-11-22T16:41:27Z

Codecov Report

Merging #189 (1fd8f48) into main (f751090) will decrease coverage by 0.01%.
The diff coverage is 98.37%.

@@            Coverage Diff             @@
##             main     #189      +/-   ##
==========================================
- Coverage   98.35%   98.35%   -0.01%     
==========================================
  Files         353      357       +4     
  Lines       27844    28576     +732     
==========================================
+ Hits        27386    28105     +719     
- Misses        458      471      +13

Flag	Coverage Δ
integration	`98.35% <98.37%> (-0.01%)`	⬇️
latest-uploader-overall	`98.35% <98.37%> (-0.01%)`	⬇️
unit	`98.35% <98.37%> (-0.01%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Components	Coverage Δ
NonTestCode	`96.74% <93.92%> (-0.08%)`	⬇️
OutsideTasks	`98.13% <97.51%> (-0.03%)`	⬇️

Files	Coverage Δ
helpers/labels.py	`100.00% <100.00%> (ø)`
services/archive.py	`94.11% <100.00%> (ø)`
services/report/__init__.py	`97.94% <100.00%> (+0.02%)`	⬆️
services/report/languages/pycoverage.py	`100.00% <100.00%> (ø)`
...ces/report/languages/tests/unit/test_pycoverage.py	`100.00% <ø> (ø)`
...uages/tests/unit/test_pycoverage_encoded_labels.py	`100.00% <100.00%> (ø)`
services/report/tests/unit/test_labels_index.py	`100.00% <100.00%> (ø)`
services/report/tests/unit/test_report_builder.py	`100.00% <ø> (ø)`
...t/tests/unit/test_report_builder_encoded_labels.py	`100.00% <100.00%> (ø)`
services/report/tests/unit/test_sessions.py	`100.00% <100.00%> (ø)`
... and 8 more

codecov-qa · 2023-11-22T16:41:29Z

Codecov Report

Merging #189 (1fd8f48) into main (f751090) will decrease coverage by 0.01%.
The diff coverage is 98.37%.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #189      +/-   ##
==========================================
- Coverage   98.35%   98.35%   -0.01%     
==========================================
  Files         353      357       +4     
  Lines       27844    28576     +732     
==========================================
+ Hits        27386    28105     +719     
- Misses        458      471      +13

Flag	Coverage Δ
integration	`98.35% <98.37%> (-0.01%)`	⬇️
latest-uploader-overall	`98.35% <98.37%> (-0.01%)`	⬇️
unit	`98.35% <98.37%> (-0.01%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Components	Coverage Δ
NonTestCode	`96.74% <93.92%> (-0.08%)`	⬇️
OutsideTasks	`98.13% <97.51%> (-0.03%)`	⬇️

Files	Coverage Δ
helpers/labels.py	`100.00% <100.00%> (ø)`
services/archive.py	`94.11% <100.00%> (ø)`
services/report/__init__.py	`97.94% <100.00%> (+0.02%)`	⬆️
services/report/languages/pycoverage.py	`100.00% <100.00%> (ø)`
...ces/report/languages/tests/unit/test_pycoverage.py	`100.00% <ø> (ø)`
...uages/tests/unit/test_pycoverage_encoded_labels.py	`100.00% <100.00%> (ø)`
services/report/tests/unit/test_labels_index.py	`100.00% <100.00%> (ø)`
services/report/tests/unit/test_report_builder.py	`100.00% <ø> (ø)`
...t/tests/unit/test_report_builder_encoded_labels.py	`100.00% <100.00%> (ø)`
services/report/tests/unit/test_sessions.py	`100.00% <100.00%> (ø)`
... and 8 more

codecov · 2023-11-22T16:41:42Z

Codecov Report

Merging #189 (1fd8f48) into main (f751090) will decrease coverage by 0.01%.
The diff coverage is 98.37%.

Changes have been made to critical files, which contain lines commonly executed in production. Learn more

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #189      +/-   ##
==========================================
- Coverage   98.31%   98.31%   -0.01%     
==========================================
  Files         382      386       +4     
  Lines       28489    29222     +733     
==========================================
+ Hits        28010    28730     +720     
- Misses        479      492      +13

Flag	Coverage Δ
integration	`98.35% <98.37%> (-0.01%)`	⬇️
latest-uploader-overall	`98.35% <98.37%> (-0.01%)`	⬇️
onlysomelabels	`98.31% <98.37%> (-0.01%)`	⬇️
unit	`98.35% <98.37%> (-0.01%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Components	Coverage Δ
NonTestCode	`96.62% <93.92%> (-0.07%)`	⬇️
OutsideTasks	`98.13% <97.51%> (-0.03%)`	⬇️

Files	Coverage Δ
helpers/labels.py	`100.00% <100.00%> (ø)`
services/archive.py Critical	`94.16% <100.00%> (ø)`
services/report/__init__.py Critical	`97.94% <100.00%> (+0.02%)`	⬆️
services/report/languages/pycoverage.py	`100.00% <100.00%> (ø)`
...ces/report/languages/tests/unit/test_pycoverage.py	`100.00% <100.00%> (ø)`
...uages/tests/unit/test_pycoverage_encoded_labels.py	`100.00% <100.00%> (ø)`
services/report/tests/unit/test_labels_index.py	`100.00% <100.00%> (ø)`
services/report/tests/unit/test_report_builder.py	`100.00% <ø> (ø)`
...t/tests/unit/test_report_builder_encoded_labels.py	`100.00% <100.00%> (ø)`
services/report/tests/unit/test_sessions.py	`100.00% <100.00%> (ø)`
... and 8 more

Related Entrypoints
run/app.tasks.upload.Upload
run/app.tasks.upload.UploadProcessor
run/app.tasks.profiling.normalizer
run/app.tasks.upload.PreProcessUpload
run/app.tasks.label_analysis.process_request

context: codecov/engineering-team #768 Creates a rollout for label compression. This is gonna help us to test and safely release the feature in the wild. Notice that currently the label compression does nothing. There are comments similar to `TODO: needs shared update`. The update in question is codecov/shared#79 So these changes mostly prep the terrain for future changes that will actually do something, and add the guardrails to avoid issues when deploying. In particular it adds some helper methods to the `ArchiveService`, creates a kinda-stubbed `LabelsIndexService`, passes more data to the `_adjust_sessions` portion of `raw_upload_processor` where most changes will occur. If you're curious as to what the end result will probably look like see #180

…/update-shared

When generating the report the labels_index was being generaged with the SpecialLabel in the index. That's a problem because: 1. It's different from the default index wheere you have the corresponding label, so you end up having 2 indexes for the same label 2. THe enum value is not JSON serializable By already using the corresponding label we solve both problems at once.

While processing reports they get merged into a temporary report (because there might be multiple reports in a single upload) THe temporary reports is what ultimately gets merged with the original one, so the labels need to be passed to it.

WHen carrying forward the report we also need to carryforward the label index, if the parent_commit has one.

scott-codecov

Dang, this gets complicated. I almost wonder if it'd be easier to make sure the next label request (after this is deployed) just returns ALL labels and then we can just carry on from there using label IDs only. Not entirely sure how to make that happen and maybe there's additional nuance that makes such an approach infeasible. This is just a lot of logic to handle the 2 potential label encodings

scott-codecov · 2023-11-28T17:09:08Z

services/report/__init__.py

+            #   1. It's necessary for labels flags to be carryforward, so it's ok to carryforward the entire index
+            #   2. As tests are renamed the index might start to be filled with stale labels. This is not good.
+            #      but I'm unsure if we should try to clean it up at this point. Cleaning it up requires going through
+            #      all lines of the report. It might be better suited for an offline job.


I agree - could this happen in the upload finisher perhaps after triggering the notification task?

I'm unsure... certainly after the notification task. But I'm not sure about the upload finisher because it is scheduled for every upload task. It does have a lock for it, but maybe I'll have to implement a multi-reader/single write lock for the labels file?

My biggest concern with putting it in the upload finisher is that there might still be uploads being actively processed at the same time. I think the cleanup should happen a few hours after the last upload came in, to be extra sure.

services/report/labels_index.py

scott-codecov · 2023-11-28T17:13:22Z

services/report/labels_index.py


-    def __init__(self, commit_report: CommitReport) -> None:
-        self.commit_report = commit_report


Curious about this change - what's wrong with passing in the commit report?

There's nothing wrong with doing that per se, but it doesn't match all usages.

In the carry forward bit the commit_report doesn't exist yet - or I don't have access to it (for the parent commit) - and yet I create LabelIndexServices for the commits. So I needed to be able to create them without the commit_report.

I have since taken Matt's suggestion around that bit of code tho, so I can revert back the code if preferred.

scott-codecov · 2023-11-28T17:17:08Z

services/report/languages/pycoverage.py

@@ -22,23 +22,54 @@ def matches_content(self, content, first_line, name) -> bool:
            and "files" in content
        )

-    def _convert_testname_to_label(self, testname, labels_table):
+    def _convert_testname_to_label(self, testname) -> str:


I find the testname terminology a little confusing since I wasn't sure what the difference between a label and a test name is. Maybe _normalize_label would describe this behavior?

I think that's a valid change. I'll do that.

Honestly I don't know what the exact difference between "label" and "test name" was meant to be (by Thiago).
My working mental model is that "label" is a more flexible term than "test name", because "test name" obviously denotes a specific test name. While a "label" might denote a group of tests, and might not be generated by the testing tool at all.

scott-codecov · 2023-11-28T17:19:22Z

services/report/report_builder.py

@@ -173,7 +191,9 @@ def create_coverage_line(
        coverage,
        *,
        coverage_type: CoverageType,
-        labels_list_of_lists: typing.List[typing.Union[str, SpecialLabelsEnum]] = None,
+        labels_list_of_lists: Union[
+            List[Union[str, SpecialLabelsEnum]], List[int]


scott-codecov · 2023-11-28T17:20:33Z

tasks/label_analysis.py

@@ -29,6 +31,8 @@
    SpecialLabelsEnum.CODECOV_ALL_LABELS_PLACEHOLDER.corresponding_label
 )

+GLOBAL_LEVEL_LABEL_IDX = 0


Seems like this should go deeper in the report service or shared or something. I saw 0 explicitly referenced elsewhere and should maybe use this constant instead

scott-codecov · 2023-11-28T17:21:17Z

tasks/label_analysis.py

@@ -40,6 +44,24 @@ class LinesRelevantToChange(TypedDict):
    files: Dict[str, Optional[LinesRelevantToChangeInFile]]


+class ExistingLabelSetsEncoded(NamedTuple):


What's the advantage of NamedTuple over using @dataclass?

No idea, really. But originally this was a tuple, so I thought I'd keep it similar to a tuple.

Apparently there are considerations to be made: https://stackoverflow.com/questions/51671699/data-classes-vs-typing-namedtuple-primary-use-cases

But the usage is quite small for this case in particular.
What I want out of this is typing info, to improve code readability.
I'm OK changing to dataclass if preferred.

matt-codecov

i don't really understand what's happening, i should maybe pause and ramp up and then try reviewing again

services/report/labels_index.py

services/report/__init__.py

services/report/labels_index.py

matt-codecov · 2023-11-29T05:01:50Z

services/report/languages/pycoverage.py

+                    if report_builder_session.should_use_label_index:
+                        label_list_of_lists = [
+                            self._get_list_of_label_ids(
+                                report_builder_session.label_index,
+                                file_coverage.get("contexts", {}).get(str(ln), []),
+                            )
+                        ]
+                    else:
+                        label_list_of_lists = [
+                            [self._convert_testname_to_label(testname)]
+                            for testname in file_coverage.get("contexts", {}).get(
+                                str(ln), []
+                            )
+                        ]


top branch creates an outer list containing a single large inner list, bottom branch creates an outer list containing many single-entry inner lists. is it okay that they don't match?

THat's a very good catch. There is a difference indeed.

This goes to ReportBuilderSession.create_coverage_line (here). On the top it creates a single datapoint with all labels. In the bottom it creates many datapoints, one label per datapoint.

This difference is crucial when updating the report with new uploads. In particular when we delete datapoints. It seems that the bottom one is more precise and keeps data available longer. Sadly it also uses more space. But I'll keep the same way it was before.

services/report/languages/pycoverage.py

matt-codecov · 2023-11-29T05:07:32Z

services/report/raw_upload_processor.py

+        # It's OK to update this here because it's inside the
+        # UploadProcessing lock, so it's exclusive access
+        original_report._labels_index[new_index] = label_or_id


_labels_index is just an instance var, right? what would the race be?

is there an alternative to this that would not make parallel upload processing harder?

The race would be different instances of the same report (that all pull the labels_index from the same file) writing different things to it and then trying to save the report to the same destination. But because all changes happen only in the merge portion of processing it's fine. THe merge needs to be in a lock anyway.

However I should be more careful that the report might be processing and not processing at the same time... and only save it into storage if it was modified.

Anyway, it doesn't really matter for parallel processing, because the uploads will be processed into temporary reports with their indexes. You can even merge them into a single report to be merged with the original in parallel, just take care of the index (with the aux functions in this file). But ultimately you will have to put a lock in the original report when merging to it, right?

services/report/report_builder.py

@matt-codecov

As pointed out by @matt-codecov, there was a difference between the old behaviour creates "outer list containing many single-entry inner lists" whereas the new behavior was creating "outer list containing a single large inner list". This difference is subtle but it matters. Each of the inner lists become a datapoint with all the labels in the list. Later on, when [deleting labels](https://github.com/codecov/shared/blob/main/shared/reports/editable.py#L52) we delete a datapoint if _any_ of the labels to delete is in it. This means that the old behavior made the datapoints stick around for longer. I should probably investigate if we ever merge datapoints... not sure Anyway, matching old behavior.

Addressing various review comments around renaming functions, updating comments and small refactors. Also updates shared again, where `_labels_index` was renamed to `labels_index`, given that it's accessed all the time. The bigest refactor here is around carrying forawrd the labels index. As suggested we are skipping the `LabelsIndexService` and directly using the `ArchiveService` to copy the file from parent commit to child commit.

lables_index is used in the following scenarios: 1. merging reports during processing 2. label_analysis In these cases we load it, use it, unload it. Unloading includes saving it to storage again, and that is how changes are persisted. ONLY scenario 1 actually makes changes to it. Now imagine that for some reason there's a label-analysis request that takes a long time with the index open(unlikely) and in that period an upload is processed (unlikely). Then we would loose the updated index when label-analysis re-writes it to storage. To prevent that without a proper locking mechanism we can just let label_analysis NOT save it back. And because the merging scenario already runs inside a lock we are fine :)

giovanni-guidini · 2023-12-05T19:59:15Z

This PR will be replaced by another one.

giovanni-guidini added 3 commits November 20, 2023 17:22

Use label_ids, not labels.

a3ff689

Updates shared to the version where `Report` has a `_labels_index` and makes use of them in processing.

giovanni-guidini requested review from matt-codecov and scott-codecov November 22, 2023 16:34

giovanni-guidini mentioned this pull request Nov 22, 2023

Compress labels in Reports codecov/engineering-team#768

Closed

giovanni-guidini force-pushed the gio/label-compressed/rollout branch from 1e257c1 to bb537db Compare November 24, 2023 13:14

Merge branch 'gio/label-compressed/rollout' into gio/label-compressed…

a45514e

…/update-shared

giovanni-guidini temporarily deployed to staging November 24, 2023 13:50 — with GitHub Actions Inactive

giovanni-guidini added 3 commits November 28, 2023 10:21

fix: Add labels_index to temporary_report

0a37005

While processing reports they get merged into a temporary report (because there might be multiple reports in a single upload) THe temporary reports is what ultimately gets merged with the original one, so the labels need to be passed to it.

fix: carryforward the label index

e004bd6

WHen carrying forward the report we also need to carryforward the label index, if the parent_commit has one.

giovanni-guidini force-pushed the gio/label-compressed/update-shared branch from a4febe6 to e004bd6 Compare November 28, 2023 15:11

scott-codecov reviewed Nov 28, 2023

View reviewed changes

matt-codecov reviewed Nov 29, 2023

View reviewed changes

services/report/report_builder.py Outdated Show resolved Hide resolved

Base automatically changed from gio/label-compressed/rollout to main November 29, 2023 12:28

giovanni-guidini added 3 commits November 29, 2023 11:49

giovanni-guidini requested review from scott-codecov and matt-codecov November 29, 2023 18:53

Merge branch 'main' into gio/label-compressed/update-shared

1fd8f48

giovanni-guidini temporarily deployed to staging November 30, 2023 17:17 — with GitHub Actions Inactive

giovanni-guidini mentioned this pull request Dec 4, 2023

Use encoded labels in reports with them #200

Merged

giovanni-guidini closed this Dec 5, 2023

This pull request was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

allow for encoded labels in `Report` #189

allow for encoded labels in `Report` #189

giovanni-guidini commented Nov 22, 2023

codecov-staging bot commented Nov 22, 2023 •

edited

Loading

codecov-qa bot commented Nov 22, 2023 •

edited

Loading

codecov bot commented Nov 22, 2023 •

edited

Loading

scott-codecov left a comment

scott-codecov Nov 28, 2023

giovanni-guidini Nov 29, 2023

scott-codecov Nov 28, 2023

giovanni-guidini Nov 29, 2023

giovanni-guidini Nov 29, 2023

scott-codecov Nov 28, 2023

giovanni-guidini Nov 29, 2023

scott-codecov Nov 28, 2023

scott-codecov Nov 28, 2023

scott-codecov Nov 28, 2023

giovanni-guidini Nov 29, 2023

giovanni-guidini Nov 29, 2023

matt-codecov left a comment

matt-codecov Nov 29, 2023

giovanni-guidini Nov 29, 2023

matt-codecov Nov 29, 2023

giovanni-guidini Nov 29, 2023

giovanni-guidini commented Dec 5, 2023


		def __init__(self, commit_report: CommitReport) -> None:
		self.commit_report = commit_report

		@@ -40,6 +44,24 @@ class LinesRelevantToChange(TypedDict):
		files: Dict[str, Optional[LinesRelevantToChangeInFile]]


		class ExistingLabelSetsEncoded(NamedTuple):

allow for encoded labels in Report #189

allow for encoded labels in Report #189

Conversation

giovanni-guidini commented Nov 22, 2023

codecov-staging bot commented Nov 22, 2023 • edited Loading

Codecov Report

codecov-qa bot commented Nov 22, 2023 • edited Loading

Codecov Report

codecov bot commented Nov 22, 2023 • edited Loading

Codecov Report

scott-codecov left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

matt-codecov left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

giovanni-guidini commented Dec 5, 2023

allow for encoded labels in `Report` #189

allow for encoded labels in `Report` #189

codecov-staging bot commented Nov 22, 2023 •

edited

Loading

codecov-qa bot commented Nov 22, 2023 •

edited

Loading

codecov bot commented Nov 22, 2023 •

edited

Loading