Use sp_return_logs for billing reports, not monthly table by zachmargolis · Pull Request #6935 · 18F/identity-idp

zachmargolis · 2022-09-09T20:29:22Z

Accuracy

We caught a few discrepancies in our reports

the monthly table is off in some some cases, I couldn't get it to match sp_return_logs, so since sp_return_logs is the raw table, it seemed more accurate and easier to check if both reports used the same thing
when we use BETWEEN '2021-01-01' AND '2021-01-31' we get truncated results compared to the fuller timestamp version BETWEEN '2021-01-01 00:00:00.00000' AND '2021-01-31 23:59:59.999999' so switches this to use that

Performance

sp_return_logs is a bigger table, so these queries will likely be slower. I am running them now to make sure they return eventually, but unfortunately I think we need to be making this performance/accuracy tradeoff right now.

Next Steps

Apparently billing only uses these two reports, so maybe we should remove all the rest
Maybe we drop the monthly tables since we have trouble reconciling them, and we still retain the raw data

* Also update month ranges to use timestamps instead of dates to make sure we're at the correct beginning/end of day

zachmargolis · 2022-09-09T20:30:05Z

app/jobs/reports/month_helper.rb

    #   ]
    # @param [Range<Date>] date_range
-    # @return [Array<Range<Date>>]
+    # @return [Array<Range<Time>>]


technically these are ActiveSupport::TimeWithZone ... but that was a lot more to write, and that's a subclass, so I felt like the gist was correct here

app/jobs/reports/month_helper.rb

**Why**: Sometimes these long-running queries get serialization errors, so let's retry them individually rather than throw away the whole report

changelog: Internal, Reporting, Update billing reports to be more accurate

zachmargolis · 2022-09-09T21:29:59Z

app/services/db/monthly_sp_auth_count/total_monthly_auth_counts_within_iaa_window.rb

+          temp_copy = ial_to_year_month_to_users.deep_dup
+
+          with_retries(
+            max_tries: 3,
+            rescue: PG::TRSerializationFailure,
+            handler: proc { ial_to_year_month_to_users = temp_copy },
+          ) do


see commit notes (70ae9c6) for a longer comment, but short version is we get these errors occasionally and I figured it was worth a shot seeing if we could quickly retry + recover rather than abort the entire job

and in case it's not obvious what it's doing, we have a ruby nested hash/multiset thing object where we add results incrementally and this is creating a copy before streaming the results, and then restoring the copy to the last known good result every time the query fails

we could rewrite this as begin/rescue syntax but we'd need add some sleep code and need a retry counter ourself, so this seemed clear enough? the alternative would be:

temp_copy = ial_to_year_month_to_users.deep_dup attempt_count = 0 begin stream_query(query) do |row| # ... end rescue PG::TRSerializationFailure => e attempt_count += 1 if attempt_count <3 ial_to_year_month_to_users = temp_copy retry else raise e end end

zachmargolis · 2022-09-09T22:35:19Z

spec/services/db/monthly_sp_auth_count/unique_monthly_auth_counts_by_iaa_spec.rb

            iaa_start_date: iaa_range.begin.to_s,
            iaa_end_date: iaa_range.end.to_s,
-            total_auth_count: 300,
+            total_auth_count: 21,


now that we're using the direct table, this was just to create fewer rows and make a faster test

zachmargolis · 2022-09-09T22:35:55Z

spec/jobs/reports/agency_invoice_iaa_supplement_report_spec.rb

+            {
+              iaa: iaa2_key,
+              ial1_total_auth_count: 0,
+              ial2_total_auth_count: 1,
+              ial1_unique_users: 0,
+              ial2_unique_users: 1,
+              ial1_new_unique_users: 0,
+              ial2_new_unique_users: 1,
+              year_month: inside_iaa2.strftime('%Y%m'),
+              iaa_start_date: iaa2_range.begin.to_s,
+              iaa_end_date: iaa2_range.end.to_s,
+            },


I think this is correct now that this has another section... I think it may have been grouped inaccurately before

zachmargolis · 2022-09-09T23:02:59Z

app/services/db/monthly_sp_auth_count/total_monthly_auth_counts_within_iaa_window.rb

-                  sp_return_logs.requested_at::date BETWEEN %{range_start} AND %{range_end}
+                  sp_return_logs.requested_at BETWEEN %{range_start} AND %{range_end}


so I think that this change may mean these are essentially unindexed queries now 😬 , this is our partial index:

identity-idp/db/schema.rb

Line 588 in c62f569

t.index "((requested_at)::date), issuer", name: "index_sp_return_logs_on_requested_at_date_issuer", where: "(returned_at IS NOT NULL)"

cc @mitchellhenke @stevegsa @jmhooper if you have any thoughts on if I should try to add a "plain" index on (requested_at, issuer)? (and it's a huge table so I know it would fail during a normal deploy)

Yes we need requested_at and issuer does nothing it can be dropped.

for the combined billing reports, we do break things out by issuer by month, so I think it does help for those?

update, thanks to some investigation help from @mitchellhenke, the ::date truncation does work like we expect so I undid this timestamp change part in c5b005e

stevegsa

looks great. just need the index and i think we are good to go. not sure why date isn't getting that equivalent full timestamp though. if it was this pr wouldn't be needed correct?

…-logs

work like we expect

zachmargolis added 2 commits September 9, 2022 12:17

Rewrite total-monthly-auths-report to use sp_return_logs

7d39180

Update combined-invoice-supplement report to use sp_return_logs

135d707

* Also update month ranges to use timestamps instead of dates to make sure we're at the correct beginning/end of day

zachmargolis requested a review from a team September 9, 2022 20:29

zachmargolis commented Sep 9, 2022

View reviewed changes

zachmargolis added 4 commits September 9, 2022 13:56

Update MonthHelper to not subtract one, most contracts don't "overlap"

9cda538

Fix a few specs to use the right tables

6a3b737

Add manual retry + rollback

70ae9c6

**Why**: Sometimes these long-running queries get serialization errors, so let's retry them individually rather than throw away the whole report

Add changelog

fbc1675

changelog: Internal, Reporting, Update billing reports to be more accurate

zachmargolis commented Sep 9, 2022

View reviewed changes

zachmargolis marked this pull request as ready for review September 9, 2022 21:46

Fix a few spec stragglers

3f12f17

zachmargolis commented Sep 9, 2022

View reviewed changes

zachmargolis mentioned this pull request Sep 13, 2022

Update total-monthly-auths report to pull from the raw table #6952

Merged

stevegsa approved these changes Sep 13, 2022

View reviewed changes

zachmargolis added 2 commits September 13, 2022 11:58

Merge remote-tracking branch 'origin/main' into margolis-plain-return…

25d6e2d

…-logs

Undo switch to timestamps, turns out ::date truncation does in fact

c5b005e

work like we expect

zachmargolis merged commit 636c497 into main Sep 15, 2022

zachmargolis deleted the margolis-plain-return-logs branch September 15, 2022 20:15

mitchellhenke mentioned this pull request Sep 19, 2022

Deploy RC 210 to Production #6984

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use sp_return_logs for billing reports, not monthly table#6935

Use sp_return_logs for billing reports, not monthly table#6935
zachmargolis merged 9 commits intomainfrom
margolis-plain-return-logs

zachmargolis commented Sep 9, 2022 •

edited

Loading

Uh oh!

zachmargolis Sep 9, 2022

Uh oh!

Uh oh!

zachmargolis Sep 9, 2022

Uh oh!

zachmargolis Sep 9, 2022 •

edited

Loading

Uh oh!

zachmargolis Sep 9, 2022

Uh oh!

zachmargolis Sep 9, 2022

Uh oh!

zachmargolis Sep 9, 2022 •

edited

Loading

Uh oh!

stevegsa Sep 13, 2022

Uh oh!

zachmargolis Sep 13, 2022

Uh oh!

zachmargolis Sep 15, 2022

Uh oh!

stevegsa left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		sp_return_logs.requested_at::date BETWEEN %{range_start} AND %{range_end}
		sp_return_logs.requested_at BETWEEN %{range_start} AND %{range_end}

Conversation

zachmargolis commented Sep 9, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Accuracy

Performance

Next Steps

Uh oh!

zachmargolis Sep 9, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

zachmargolis Sep 9, 2022

Choose a reason for hiding this comment

Uh oh!

zachmargolis Sep 9, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

zachmargolis Sep 9, 2022

Choose a reason for hiding this comment

Uh oh!

zachmargolis Sep 9, 2022

Choose a reason for hiding this comment

Uh oh!

zachmargolis Sep 9, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

stevegsa Sep 13, 2022

Choose a reason for hiding this comment

Uh oh!

zachmargolis Sep 13, 2022

Choose a reason for hiding this comment

Uh oh!

zachmargolis Sep 15, 2022

Choose a reason for hiding this comment

Uh oh!

stevegsa left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

zachmargolis commented Sep 9, 2022 •

edited

Loading

zachmargolis Sep 9, 2022 •

edited

Loading

zachmargolis Sep 9, 2022 •

edited

Loading