LG-8440: Ingest in-person enrollment status updates from SQS by NavaTim · Pull Request #8403 · 18F/identity-idp

NavaTim · 2023-05-16T18:39:50Z

🎫 Ticket

🛠 Summary of changes

Create InPerson::EnrollmentsReadyForStatusCheckJob job to ingest in-person enrollment status updates from USPS.

The following pseudocode approximates the job's behavior, minus error handling.

Report job start to analytics
Loop: Pull SQS messages into received batch
  Initialize empty deletion batch
  Loop: For each message in received batch
    Unwrap SQS payload
    Unwrap SNS payload
    Unwrap SES payload
    Parse raw email body
    Extract enrollment code
    Update enrollment record in DB
    Add SQS message to deletion batch
  End Loop
  Delete messages in deletion batch
End Loop
Report job completion to analytics

This adds the aws-sdk-sqs which was not previously present; this is the official AWS package. The mail gem is (and previously was) a direct dependency, so it has been added to the Gemfile; this does not constitute a substantial change, but reduces the risk of regression.

📜 Testing Plan

Enable and configure feature with valid SQS queue accessible from the IDP worker
Create in-person enrollment record
Send payload to SQS queue containing email payload corresponding to the enrollment record
Wait for or trigger cron job
Check that the ready_for_status_check flag is set on the enrollment record

Logs from local testing

The following logs cover a local test where 7 emails were sent. The batch size was adjusted to 5 in order to ensure the job fetched multiple times.

Expected success and deletion:

1 email was the exact expected format
1 email had whitespace around the enrollment code
1 email contained text without an enrollment code

Expected failure and deletion:

1 email contained a code that did not match an existing enrollment record
3 blank emails

Another error occurred prior to the test where I was able to confirm that the job will log the completion (as expected) even when an error is thrown.

==> ./log/workers.log <==
{"duration_ms":0.019999980926513672,"timestamp":"2023-05-19T22:32:06.176Z","name":"perform_start.active_job","job_class":"InPerson::EnrollmentsReadyForStatusCheckJob","trace_id":null,"queue_name":"GoodJob(low)","job_id":"b1462a37-47aa-4cea-8c30-10abec1e6e2f","enqueued_at":null,"queued_duration_ms":null}
Performing InPerson::EnrollmentsReadyForStatusCheckJob (Job ID: b1462a37-47aa-4cea-8c30-10abec1e6e2f) from GoodJob(low) enqueued at 

==> ./log/events.log <==
{"name":"InPersonEnrollmentsReadyForStatusCheckJob: Job started","properties":{"event_properties":{},"new_event":true,"path":null,"session_duration":null,"user_id":"anonymous-uuid","locale":"en"},"time":"2023-05-19T22:32:06.187Z","id":"80d9ce3a-1abe-4ffa-902a-04d4a2b035dc","visitor_id":"5b81ee2c-e4c4-4404-a5ca-be6a90392293","visit_id":"c8f723d8-09c4-46c9-bacb-ed7c756c6280"}

==> ./log/production.log <==
Aws::SQS::Client 200 1.512384 0 receive_message [ ]

==> ./log/events.log <==
{"name":"InPersonEnrollmentsReadyForStatusCheckJob: Ingestion error","properties":{"event_properties":{"exception_class":"RuntimeError","exception_message":"InPerson::EnrollmentsReadyForStatusCheck::EnrollmentPipeline: Failure occurred when attempting to get email body","sqs_message_id":"5c5e05d9-00f7-435c-b7e2-5f2eb81cc068","sns_message_id":"edccd4fc-1871-56b8-a38d-881de00640d2","ses_aws_message_id":"g8a5ai89mi3om5vbhgclqprpt04rq16m0d0d7r81","ses_mail_timestamp":"2023-05-19T22:20:38.534Z","ses_mail_source":"timothy.bradley@gsa.gov","ses_rfc_origination_date":"2023-05-19T15:20:26-07:00","ses_rfc_message_id":"\u003cCANYMRCHnW5rLsWDetv1RjRc3oMnKxsOZMHM2tSk1H4sv0ALgaA@mail.gmail.com\u003e"},"new_event":true,"path":null,"session_duration":null,"user_id":"anonymous-uuid","locale":"en"},"time":"2023-05-19T22:32:07.809Z","id":"558511d2-74a3-4e0b-b66b-df607c4a43dd","visitor_id":"b4da149d-3a99-4d53-a463-f660c3b2c99a","visit_id":"04153042-24c2-475d-961a-499f7b9e0668"}

==> ./log/development.log <==
  InPersonEnrollment Pluck (0.8ms)  SELECT "in_person_enrollments"."id", "in_person_enrollments"."user_id", "in_person_enrollments"."ready_for_status_check" FROM "in_person_enrollments" WHERE "in_person_enrollments"."enrollment_code" = $1 AND "in_person_enrollments"."status" = $2 ORDER BY "in_person_enrollments"."created_at" DESC LIMIT $3  [["enrollment_code", "2048702198809999"], ["status", 1], ["LIMIT", 1]]

==> ./log/events.log <==
{"name":"InPersonEnrollmentsReadyForStatusCheckJob: Ingestion error","properties":{"event_properties":{"exception_class":"RuntimeError","exception_message":"InPerson::EnrollmentsReadyForStatusCheck::EnrollmentPipeline: Received code for enrollment that does not exist in the database","sqs_message_id":"3e09d8b6-1bce-4870-a8a2-c468498c28a4","sns_message_id":"3fc8db28-4e38-5ae8-98c0-87c9b6c03a85","ses_aws_message_id":"1conen8qaha1cm76pq9qigv52nq41u99gkf8rco1","ses_mail_timestamp":"2023-05-19T22:22:47.518Z","ses_mail_source":"timothy.bradley@gsa.gov","ses_rfc_origination_date":"2023-05-19T15:22:35-07:00","ses_rfc_message_id":"\u003cCANYMRCERPgo78+AwJFbKL2DwmR-2a76-pMeYG8phycuwL12KEw@mail.gmail.com\u003e","enrollment_code":"2048702198809999"},"new_event":false,"path":null,"session_duration":null,"user_id":"anonymous-uuid","locale":"en"},"time":"2023-05-19T22:32:07.856Z","id":"83939068-53a9-468c-90fb-d355d17306c2","visitor_id":"b4da149d-3a99-4d53-a463-f660c3b2c99a","visit_id":"04153042-24c2-475d-961a-499f7b9e0668"}

==> ./log/production.log <==
Aws::SQS::Client 200 0.16097 0 delete_message_batch [ ]
Aws::SQS::Client 200 0.398405 0 receive_message [ ]

==> ./log/development.log <==
  InPersonEnrollment Pluck (0.7ms)  SELECT "in_person_enrollments"."id", "in_person_enrollments"."user_id", "in_person_enrollments"."ready_for_status_check" FROM "in_person_enrollments" WHERE "in_person_enrollments"."enrollment_code" = $1 AND "in_person_enrollments"."status" = $2 ORDER BY "in_person_enrollments"."created_at" DESC LIMIT $3  [["enrollment_code", "2048702198801234"], ["status", 1], ["LIMIT", 1]]
  InPersonEnrollment Load (0.4ms)  SELECT "in_person_enrollments"."id", "in_person_enrollments"."user_id", "in_person_enrollments"."profile_id", "in_person_enrollments"."enrollment_code", "in_person_enrollments"."status_check_attempted_at", "in_person_enrollments"."status_updated_at", "in_person_enrollments"."status", "in_person_enrollments"."created_at", "in_person_enrollments"."updated_at", "in_person_enrollments"."current_address_matches_id", "in_person_enrollments"."selected_location_details", "in_person_enrollments"."unique_id", "in_person_enrollments"."enrollment_established_at", "in_person_enrollments"."issuer", "in_person_enrollments"."follow_up_survey_sent", "in_person_enrollments"."early_reminder_sent", "in_person_enrollments"."late_reminder_sent", "in_person_enrollments"."deadline_passed_sent", "in_person_enrollments"."proofed_at", "in_person_enrollments"."capture_secondary_id_enabled", "in_person_enrollments"."status_check_completed_at", "in_person_enrollments"."ready_for_status_check" FROM "in_person_enrollments" WHERE "in_person_enrollments"."id" = $1 LIMIT $2  [["id", 51], ["LIMIT", 1]]
  TRANSACTION (0.2ms)  BEGIN
  Profile Load (1.0ms)  SELECT "profiles"."id", "profiles"."user_id", "profiles"."active", "profiles"."verified_at", "profiles"."activated_at", "profiles"."created_at", "profiles"."updated_at", "profiles"."encrypted_pii", "profiles"."ssn_signature", "profiles"."encrypted_pii_recovery", "profiles"."deactivation_reason", "profiles"."proofing_components", "profiles"."name_zip_birth_year_signature", "profiles"."reproof_at", "profiles"."initiating_service_provider_issuer", "profiles"."fraud_review_pending_at", "profiles"."fraud_rejection_at", "profiles"."gpo_verification_pending_at" FROM "profiles" WHERE "profiles"."id" = $1 LIMIT $2  [["id", 1], ["LIMIT", 1]]
  User Load (0.5ms)  SELECT "users"."id", "users"."reset_password_token", "users"."reset_password_sent_at", "users"."remember_created_at", "users"."created_at", "users"."updated_at", "users"."confirmed_at", "users"."second_factor_attempts_count", "users"."uuid", "users"."second_factor_locked_at", "users"."phone_confirmed_at", "users"."direct_otp", "users"."direct_otp_sent_at", "users"."unique_session_id", "users"."otp_delivery_preference", "users"."encrypted_password_digest", "users"."encrypted_recovery_code_digest", "users"."remember_device_revoked_at", "users"."email_language", "users"."accepted_terms_at", "users"."encrypted_recovery_code_digest_generated_at" FROM "users" WHERE "users"."id" = $1 LIMIT $2  [["id", 2], ["LIMIT", 1]]
  User Load (0.6ms)  SELECT "users"."id", "users"."reset_password_token", "users"."reset_password_sent_at", "users"."remember_created_at", "users"."created_at", "users"."updated_at", "users"."confirmed_at", "users"."second_factor_attempts_count", "users"."uuid", "users"."second_factor_locked_at", "users"."phone_confirmed_at", "users"."direct_otp", "users"."direct_otp_sent_at", "users"."unique_session_id", "users"."otp_delivery_preference", "users"."encrypted_password_digest", "users"."encrypted_recovery_code_digest", "users"."remember_device_revoked_at", "users"."email_language", "users"."accepted_terms_at", "users"."encrypted_recovery_code_digest_generated_at" FROM "users" WHERE "users"."id" = $1 LIMIT $2  [["id", 2], ["LIMIT", 1]]
  InPersonEnrollment Update (0.6ms)  UPDATE "in_person_enrollments" SET "updated_at" = $1, "ready_for_status_check" = $2 WHERE "in_person_enrollments"."id" = $3  [["updated_at", "2023-05-19 22:32:08.481168"], ["ready_for_status_check", true], ["id", 51]]
  TRANSACTION (2.3ms)  COMMIT
  InPersonEnrollment Pluck (0.6ms)  SELECT "in_person_enrollments"."id", "in_person_enrollments"."user_id", "in_person_enrollments"."ready_for_status_check" FROM "in_person_enrollments" WHERE "in_person_enrollments"."enrollment_code" = $1 AND "in_person_enrollments"."status" = $2 ORDER BY "in_person_enrollments"."created_at" DESC LIMIT $3  [["enrollment_code", "2048702198804353"], ["status", 1], ["LIMIT", 1]]
  InPersonEnrollment Load (0.7ms)  SELECT "in_person_enrollments"."id", "in_person_enrollments"."user_id", "in_person_enrollments"."profile_id", "in_person_enrollments"."enrollment_code", "in_person_enrollments"."status_check_attempted_at", "in_person_enrollments"."status_updated_at", "in_person_enrollments"."status", "in_person_enrollments"."created_at", "in_person_enrollments"."updated_at", "in_person_enrollments"."current_address_matches_id", "in_person_enrollments"."selected_location_details", "in_person_enrollments"."unique_id", "in_person_enrollments"."enrollment_established_at", "in_person_enrollments"."issuer", "in_person_enrollments"."follow_up_survey_sent", "in_person_enrollments"."early_reminder_sent", "in_person_enrollments"."late_reminder_sent", "in_person_enrollments"."deadline_passed_sent", "in_person_enrollments"."proofed_at", "in_person_enrollments"."capture_secondary_id_enabled", "in_person_enrollments"."status_check_completed_at", "in_person_enrollments"."ready_for_status_check" FROM "in_person_enrollments" WHERE "in_person_enrollments"."id" = $1 LIMIT $2  [["id", 44], ["LIMIT", 1]]
  TRANSACTION (0.2ms)  BEGIN
  Profile Load (0.7ms)  SELECT "profiles"."id", "profiles"."user_id", "profiles"."active", "profiles"."verified_at", "profiles"."activated_at", "profiles"."created_at", "profiles"."updated_at", "profiles"."encrypted_pii", "profiles"."ssn_signature", "profiles"."encrypted_pii_recovery", "profiles"."deactivation_reason", "profiles"."proofing_components", "profiles"."name_zip_birth_year_signature", "profiles"."reproof_at", "profiles"."initiating_service_provider_issuer", "profiles"."fraud_review_pending_at", "profiles"."fraud_rejection_at", "profiles"."gpo_verification_pending_at" FROM "profiles" WHERE "profiles"."id" = $1 LIMIT $2  [["id", 8], ["LIMIT", 1]]
  User Load (0.3ms)  SELECT "users"."id", "users"."reset_password_token", "users"."reset_password_sent_at", "users"."remember_created_at", "users"."created_at", "users"."updated_at", "users"."confirmed_at", "users"."second_factor_attempts_count", "users"."uuid", "users"."second_factor_locked_at", "users"."phone_confirmed_at", "users"."direct_otp", "users"."direct_otp_sent_at", "users"."unique_session_id", "users"."otp_delivery_preference", "users"."encrypted_password_digest", "users"."encrypted_recovery_code_digest", "users"."remember_device_revoked_at", "users"."email_language", "users"."accepted_terms_at", "users"."encrypted_recovery_code_digest_generated_at" FROM "users" WHERE "users"."id" = $1 LIMIT $2  [["id", 6], ["LIMIT", 1]]
  User Load (0.3ms)  SELECT "users"."id", "users"."reset_password_token", "users"."reset_password_sent_at", "users"."remember_created_at", "users"."created_at", "users"."updated_at", "users"."confirmed_at", "users"."second_factor_attempts_count", "users"."uuid", "users"."second_factor_locked_at", "users"."phone_confirmed_at", "users"."direct_otp", "users"."direct_otp_sent_at", "users"."unique_session_id", "users"."otp_delivery_preference", "users"."encrypted_password_digest", "users"."encrypted_recovery_code_digest", "users"."remember_device_revoked_at", "users"."email_language", "users"."accepted_terms_at", "users"."encrypted_recovery_code_digest_generated_at" FROM "users" WHERE "users"."id" = $1 LIMIT $2  [["id", 6], ["LIMIT", 1]]
  InPersonEnrollment Update (0.4ms)  UPDATE "in_person_enrollments" SET "updated_at" = $1, "ready_for_status_check" = $2 WHERE "in_person_enrollments"."id" = $3  [["updated_at", "2023-05-19 22:32:08.498820"], ["ready_for_status_check", true], ["id", 44]]
  TRANSACTION (0.4ms)  COMMIT

==> ./log/events.log <==
{"name":"InPersonEnrollmentsReadyForStatusCheckJob: Ingestion error","properties":{"event_properties":{"exception_class":"RuntimeError","exception_message":"InPerson::EnrollmentsReadyForStatusCheck::EnrollmentPipeline: Failure occurred when attempting to get email body","sqs_message_id":"9e08c28e-6a9d-4ed9-85bd-28860f886bcd","sns_message_id":"673c75f0-d6f0-58bd-a425-29112fbb5353","ses_aws_message_id":"vb9vbqtihrm3e1jqaajgvmbvpcl9s9vhse2hca01","ses_mail_timestamp":"2023-05-19T22:24:10.787Z","ses_mail_source":"timothy.bradley@gsa.gov","ses_rfc_origination_date":"2023-05-19T15:23:58-07:00","ses_rfc_message_id":"\u003cCANYMRCHcOBOtLN0nURQL0PTB0g7zOQZVr5c=nJySRisYmbAeJw@mail.gmail.com\u003e"},"new_event":false,"path":null,"session_duration":null,"user_id":"anonymous-uuid","locale":"en"},"time":"2023-05-19T22:32:08.503Z","id":"61a0f8b6-c5f1-4216-b2f4-aeea199d631d","visitor_id":"b4da149d-3a99-4d53-a463-f660c3b2c99a","visit_id":"04153042-24c2-475d-961a-499f7b9e0668"}

==> ./log/production.log <==
Aws::SQS::Client 200 0.156043 0 delete_message_batch [ ]
Aws::SQS::Client 200 0.243185 0 receive_message [ ]

==> ./log/events.log <==
{"name":"InPersonEnrollmentsReadyForStatusCheckJob: Ingestion error","properties":{"event_properties":{"exception_class":"RuntimeError","exception_message":"InPerson::EnrollmentsReadyForStatusCheck::EnrollmentPipeline: Failure occurred when attempting to get email body","sqs_message_id":"643969ae-7915-4ecf-920c-29c9d991f0a1","sns_message_id":"57847377-fbbd-5433-bc71-2f9603f4f0ff","ses_aws_message_id":"okm73d4p9c7ko4dg1b9ahrikr4s8odf7sgd8jdo1","ses_mail_timestamp":"2023-05-19T22:19:53.147Z","ses_mail_source":"timothy.bradley@gsa.gov","ses_rfc_origination_date":"2023-05-19T15:19:40-07:00","ses_rfc_message_id":"\u003cCANYMRCH4=9X-FkB-76VFnQ=0Vb5hMomhYU-9KGM31cg1k3hwDQ@mail.gmail.com\u003e"},"new_event":false,"path":null,"session_duration":null,"user_id":"anonymous-uuid","locale":"en"},"time":"2023-05-19T22:32:08.906Z","id":"07f5c35c-42fe-4940-bef4-65ca9ad2d9cc","visitor_id":"b4da149d-3a99-4d53-a463-f660c3b2c99a","visit_id":"04153042-24c2-475d-961a-499f7b9e0668"}

==> ./log/production.log <==
Aws::SQS::Client 200 0.152797 0 delete_message_batch [ ]
Aws::SQS::Client 200 0.20169 0 receive_message [ ]

==> ./log/events.log <==
{"name":"InPersonEnrollmentsReadyForStatusCheckJob: Ingestion error","properties":{"event_properties":{"exception_class":"RuntimeError","exception_message":"InPerson::EnrollmentsReadyForStatusCheck::EnrollmentPipeline: Failed to extract enrollment code using regex, check email body format and regex","sqs_message_id":"08153bde-f8cf-40d4-b32f-4db8f66e159e","sns_message_id":"495227d2-cab5-5d63-b2ad-7b333279b0eb","ses_aws_message_id":"qgrmdo4teol3jiqbkl9ajn3kaep0dfrfdiuis401","ses_mail_timestamp":"2023-05-19T22:22:04.180Z","ses_mail_source":"timothy.bradley@gsa.gov","ses_rfc_origination_date":"2023-05-19T15:21:52-07:00","ses_rfc_message_id":"\u003cCANYMRCEo_Lv6QN_zd7KzEj4Na=EvcSxjmciJsBQaPB8YjVGtSQ@mail.gmail.com\u003e"},"new_event":false,"path":null,"session_duration":null,"user_id":"anonymous-uuid","locale":"en"},"time":"2023-05-19T22:32:09.265Z","id":"8dbdb93d-aa9e-4264-9527-f089f2304bd4","visitor_id":"b4da149d-3a99-4d53-a463-f660c3b2c99a","visit_id":"04153042-24c2-475d-961a-499f7b9e0668"}

==> ./log/production.log <==
Aws::SQS::Client 200 0.145369 0 delete_message_batch [ ]
Aws::SQS::Client 200 20.191978 0 receive_message [ ]

==> ./log/events.log <==
{"name":"InPersonEnrollmentsReadyForStatusCheckJob: Job completed","properties":{"event_properties":{"fetched_items":7,"processed_items":7,"deleted_items":7,"valid_items":2,"invalid_items":5,"incomplete_items":0,"deletion_failed_items":0},"new_event":true,"path":null,"session_duration":null,"user_id":"anonymous-uuid","locale":"en"},"time":"2023-05-19T22:32:29.604Z","id":"4e662e4f-02b9-48a5-8f84-4b8f2723b5d5","visitor_id":"37bd0788-7732-4a02-81c4-dbaae075398d","visit_id":"10a81cc7-9997-4e8c-afc9-c5b58c0eeaa8"}

==> ./log/workers.log <==
{"duration_ms":23429.647000074387,"timestamp":"2023-05-19T22:32:29.605Z","name":"perform.active_job","job_class":"InPerson::EnrollmentsReadyForStatusCheckJob","trace_id":null,"queue_name":"GoodJob(low)","job_id":"b1462a37-47aa-4cea-8c30-10abec1e6e2f","enqueued_at":null}
Performed InPerson::EnrollmentsReadyForStatusCheckJob (Job ID: b1462a37-47aa-4cea-8c30-10abec1e6e2f) from GoodJob(low) in 23430.02ms

zachmargolis · 2023-05-16T19:13:50Z

app/jobs/in_person/enrollments_ready_for_status_check/uses_analytics.rb

Is "uses" an abbreviation, or is it like the verb like "this uses that"?

Uses is a verb here.

ok! I feel like that's not a common naming convention we have in this repo? Is there a particular reason to start that now?

The use of this convention is pretty isolated, so I don't think it's a big concern. If you want to have the job refactored to use delegation instead of mixins though then I'll rename accordingly.

zachmargolis · 2023-05-16T19:15:25Z

app/jobs/in_person/enrollments_ready_for_status_check/uses_sqs_client.rb

If I'm reading this right, it looks like we are only including this mixin in one place. Do we have plans for future SQS topics? (my understanding is no, we do not) so I think that it would make sense to just inline this for now

This mixin is used by the job (included by both the job directly and the batch processor), but I'm intentionally separating this out to reduce the complexity of working on this section of the code. I did consider renaming this to SqsBatchWrapper.

What about using delegation instead of inheritance via mixin? To have an SQS client/consumer class that we can instantiate instead of mixing in? delegation is much easier to follow than included methods from mixins when tracing code

I was considering that exact refactor before posting this PR. If you think it's worth making the change, then I'll do it.

yes definitely, let's go for it

@zachmargolis I refactored to use DI instead of mixins. There's some lack of clarity about where to put the factories, so for now they live in the job and have their own tests.

zachmargolis · 2023-05-16T19:16:54Z

app/jobs/in_person/enrollments_ready_for_status_check/batch_processor.rb

rescue inside of an ensure is very surprising to me. Could we move the body of this ensure into its own method? that way it can have its own rescue and stuff

The rescue here is very intentional, but I think it's reasonable to move this into another method.

Moved into the new private method process_deletions.

zachmargolis · 2023-05-16T19:18:50Z

app/jobs/in_person/enrollments_ready_for_status_check/enrollment_pipeline.rb

I believe unhandled errors in background jobs already go to NewRelic, do we need to notify directly like this?

We do need additional direct error logging here in order to better capture the context of the error; this is most likely to occur after we have captured the enrollment code. However that part could be isolated to CloudWatch if you think that's particularly important.

dawei-nava · 2023-05-16T19:23:42Z

app/jobs/in_person/enrollments_ready_for_status_check/enrollment_pipeline.rb

here we filter on status, later we update ready_for_status_check, are they the same thing?

No, they currently track different things. status tracks the overall progress of the enrollment creation, checking, and success/failure. ready_for_status_check only covers whether we received a message saying that the enrollment is ready to have its status checked (which is not tracked by status).

The check with USPS may result in a status update if USPS indicates that the enrollment is not present, has expired, or that the user has visited the post office already.

More context here - we are currently checking enrollments that have a pending status. When this and related stories are complete, we will be able to deprioritize (but still eventually check) enrollments that have ready_for_status_check remain as false.

JackRyan1989 · 2023-05-17T14:39:28Z

app/jobs/in_person/enrollments_ready_for_status_check/batch_processor.rb

I'm not getting why the append is happening outside of the above if statement? Based on the comment it feels like the sqs_message should only be appended if process_message returns true?

If process_message returns false, then the enrollment cannot be processed (e.g. invalid JSON, wrong fields, etc.). Keeping it in the queue would unnecessarily slow down the ingestion of other email notifications.

Related to Jack’s question-how does a message meet the condition described in the comment on line 38?

@svalexander The message will meet the condition if an uncaught error occurs while processing the message. One example would be if the database becomes unavailable before we can update the enrollment status.

JackRyan1989 · 2023-05-17T14:40:32Z

app/jobs/in_person/enrollments_ready_for_status_check_job.rb

Is this inclusion necessary, as BatchProcessor already includes this module?

Strictly speaking no, but I consider it important since this class directly uses the poll method.

app/services/analytics_events.rb

svalexander · 2023-05-18T16:43:30Z

app/jobs/in_person/enrollments_ready_for_status_check/batch_processor.rb

Thank you for including these comments, very useful!

svalexander · 2023-05-18T16:49:00Z

app/jobs/in_person/enrollments_ready_for_status_check/batch_processor.rb

Are these items that encountered errors when we attempted to process them?

We're deleting items in the following cases:

Processing succeeds

The item is malformed (e.g. invalid JSON or has wrong fields)

The item does not correspond to a pending enrollment record

If we don't delete the last two, then workers will continue attempting (and failing) to process them.

dawei-nava · 2023-05-18T17:02:48Z

app/jobs/in_person/enrollments_ready_for_status_check/user_analytics_factory.rb

Personally I like factory object. Matt showed me once we have a a lot of analytics events piling together.

svalexander · 2023-05-18T18:55:12Z

config/application.yml.default

Just wondering how we decided on the time limit for this and the next line?

It's normal with SQS to set wait_time_seconds high in order to reduce the number of calls and increase the likelihood that we'll receive items from the SQS on each call. Behind the scenes, SQS is also a distributed queue, which means order is only an approximate and not a strict guarantee (unless we use FIFO) & AWS could potentially be fetching the items from multiple machines.

I'm leaving visibility_timeout_seconds at the default, but it's a parameter that may be worth adjusting if the processing time is extended at some point. This controls when items will become visible to other workers (i.e. essentially for the case of an unreported processing failure).

zachmargolis · 2023-05-18T23:36:08Z

app/jobs/in_person/enrollments_ready_for_status_check/enrollment_pipeline.rb

tap returns the original value, not the block, I think we want a different approach here

1.tap { |x| x + 1 } => 1

Good catch, updated to use then instead. I also updated the test so that it will catch this issue.

JackRyan1989 · 2023-05-23T15:04:08Z

spec/jobs/in_person/enrollments_ready_for_status_check/batch_processor_spec.rb

Aren't the deleted_items and processed_items values in the splatted analytics_hash? Why explicitly list them here? To be explicit?

The default analytics_stats hash has all of the items as zero. The overrides here are primarily intended to represent changes in the items. I have set some of the zero redundantly because it matters for understanding the individual test's behavior.

JackRyan1989 · 2023-05-23T15:12:15Z

spec/jobs/in_person/enrollments_ready_for_status_check/enrollment_pipeline_spec.rb

Do we want to grab this from the config so if it changes there the test will be aware?

This is a unit test; testing the specific configuration is out-of-scope. This is meant to test that the module functions correctly as a unit.

JackRyan1989

LGMT!

Minor non-blocking question: We set a limit of ten messages to be pulled from SQS. The batch processor will handle any length of messages returned from SQS, but I'm wondering if it's worth thinking about setting a limit?

NavaTim · 2023-05-23T16:37:39Z

LGMT!

Minor non-blocking question: We set a limit of ten messages to be pulled from SQS. The batch processor will handle any length of messages returned from SQS, but I'm wondering if it's worth thinking about setting a limit?

@JackRyan1989 I think that the control on this type of behavior should be on the job runtime, if possible, rather than the number of items. This may be a good place to implement or check for existing monitoring on this type of problem.

zachmargolis · 2023-05-23T16:36:08Z

app/jobs/in_person/enrollments_ready_for_status_check/error_reporter.rb

My vote is to remove this direct NewRelic notification, we tend to react to errors in NR as if they are unhandled 500s so seeing errors like this that were caught might make something seem more urgent than it is

This kind of error signals that USPS is sending us garbage data or that AWS has become severely misconfigured. At the current scale it may not be consequential, but it could be extremely consequential at the scales we were preparing for earlier this year.

IOW - if we are getting this at all, then it is very likely to be a problem that needs addressed soon. It doesn't mean we have to stop processing the other queue items, though.

I'm fine keeping the NR notification, but we should reconsider if it turns out not to be useful

app/jobs/in_person/enrollments_ready_for_status_check/sqs_batch_wrapper.rb

zachmargolis · 2023-05-23T16:38:32Z

app/jobs/in_person/enrollments_ready_for_status_check_job.rb

This is just style, I really dislike unless, especially with a compound statement, I think its always clearer as an if

Suggested change

return true unless IdentityConfig.store.in_person_proofing_enabled &&

IdentityConfig.store.in_person_enrollments_ready_job_enabled

if !IdentityConfig.store.in_person_proofing_enabled ||

!IdentityConfig.store.in_person_enrollments_ready_job_enabled

return true

end

I disagree; unless provides a larger visual indication of the conditional behavior than the oft-missed exclamation mark.

maybe the conditions could be pulled into well-named methods, ie:

def in_person_proofing_not_enabled !IdentityConfig.store.in_person_proofing_enabled end

then it could look like this, which imo is a little easier to parse/reason about

if in_person_proofing_not_enabled || in_person_enrollements_ready_job_not_enabled return true end

blank? will return true if a variable is nil or false

return true if IdentityConfig.store.in_person_proofing_enabled.blank? || IdentityConfig.store.in_person_enrollments_ready_job_enabled.blank?

@tomas-nava That sounds like a reasonable compromise.

@Sgtpluck I think locality is preferable here.

zachmargolis · 2023-05-23T16:41:15Z

app/jobs/in_person/enrollments_ready_for_status_check_job.rb

I like using dependency injection, but I think that it seems redundant to pass in the class_name explicitly like this, could the BatchProcessor class have a default implementation that passes its own self.class.name in to the error reporter? Since analytics here is basically just a default/empty instance with no user, I feel like we don't need to worry about passing in the same one necessarily

Ditto for below with EnrollmentPipeline

Default implementations for DI have been a colossal headache for me in the past, so I really prefer to avoid default implementations.

zachmargolis · 2023-05-23T16:45:28Z

spec/jobs/in_person/enrollments_ready_for_status_check/enrollment_pipeline_spec.rb

Since sqs_message is doubling for a plain struct, could we just use the real value type instead?

Suggested change

before(:each) do

allow(sqs_message).to receive(:message_id).

and_return(sqs_message_id)

end

let(:sqs_message) { Aws::SQS::Types::Message.new(message_id: sqs_message_id) }

This builds into the test some assumptions about how the struct is created, so I would prefer to avoid this change.

Is the worry that the class constructor changes and breaks the test, or something else?

@mitchellhenke Yes, roughly speaking. i.e. constructor or factory doing some kind of interpretation on the data passed in rather than setting it to exactly match the original parameters.

There is an additional concern about coupling and misrepresenting parts of the payload that we don't actually use.

zachmargolis · 2023-05-23T16:46:16Z

spec/jobs/in_person/enrollments_ready_for_status_check/enrollment_pipeline_spec.rb

ditto, I think we could simplify the tests by using the actual value type here for sqs_message

Same as above:

This builds into the test some assumptions about how the struct is created, so I would prefer to avoid this change.

spec/jobs/in_person/enrollments_ready_for_status_check/enrollment_pipeline_spec.rb

… (wip)

… into modules

…status updates from SQS

…; update job

…ent_pipeline_spec.rb Co-authored-by: Zach Margolis <zachmargolis@users.noreply.github.com>

…h_wrapper.rb Co-authored-by: Zach Margolis <zachmargolis@users.noreply.github.com>

NavaTim requested review from a team and racingspider May 16, 2023 18:40

zachmargolis reviewed May 16, 2023

View reviewed changes

dawei-nava reviewed May 16, 2023

View reviewed changes

JackRyan1989 reviewed May 17, 2023

View reviewed changes

app/services/analytics_events.rb Outdated Show resolved Hide resolved

NavaTim requested review from JackRyan1989 and zachmargolis May 17, 2023 21:57

svalexander reviewed May 18, 2023

View reviewed changes

dawei-nava reviewed May 18, 2023

View reviewed changes

svalexander reviewed May 18, 2023

View reviewed changes

NavaTim force-pushed the tbradley/lg-8440-in-person-enrollment-sqs-status branch from 23cb016 to 35d6b26 Compare May 18, 2023 21:54

zachmargolis reviewed May 18, 2023

View reviewed changes

NavaTim marked this pull request as ready for review May 19, 2023 22:54

JackRyan1989 reviewed May 23, 2023

View reviewed changes

JackRyan1989 approved these changes May 23, 2023

View reviewed changes

zachmargolis reviewed May 23, 2023

View reviewed changes

NavaTim added 6 commits May 24, 2023 11:21

LG-8440: Install aws-sdk-ruby; create migration for status check field

4386f41

LG-8440: Start writing job to check enrollment status update messages…

e2d7229

… (wip)

LG-8440: Install mail gem; update job to read email sent via SNS/SQS

55e8672

LG-8440: Rename job; improve handling for errors and message deletion

4c79a7d

LG-8440: Continue refining logic around error logging; break up logic…

8b0a44f

… into modules

LG-8440: Split out analytics methods; start writing specs for job

f3da7ef

NavaTim added 19 commits May 24, 2023 11:21

LG-8440: Test and refine error reporting module

10ea163

LG-8440: Continue writing tests; improve docs and error logic

2cec8e4

LG-8440: Continue writing tests

fb3158e

LG-8440: Write test for enrollment pipeline

a40e2db

LG-8440: Fix module imports and default config

18fc6b1

LG-8440: Consolidate SQS client calls; update tests

2cd9be2

LG-8440: Write spec for SQS batch processor

b69a7ba

LG-8440: Format spec file; add method documentation to process_batch

da238a7

LG-8440: Add test for job; add new config value

3026842

LG-8440: Add config keys; rename key; lint fixes

e5ccac3

LG-8440: Update analytics params and documentation

d7a829a

changelog: Internal, In-Person Proofing, Ingest in-person enrollment …

142053a

…status updates from SQS

LG-8440: Extract batch deletion into separate method

19d6c56

LG-8440: Refactor modules to support dependency injection (WIP)

400d935

LG-8440: Move factory methods to job; write tests for factory methods…

c22ad65

…; update job

LG-8440: Fix date logging and test

b2e2fd7

LG-8440: Increase the AWS HTTP read timeout for SQS client

b09dc3e

LG-8440: Update error reporter to use analytics directly

d573b9b

LG-8440: Update to remove analytics factory

853013c

NavaTim force-pushed the tbradley/lg-8440-in-person-enrollment-sqs-status branch from 75255e1 to 853013c Compare May 24, 2023 18:21

NavaTim and others added 4 commits May 24, 2023 11:53

Update spec/jobs/in_person/enrollments_ready_for_status_check/enrollm…

1c7cc98

…ent_pipeline_spec.rb Co-authored-by: Zach Margolis <zachmargolis@users.noreply.github.com>

Update app/jobs/in_person/enrollments_ready_for_status_check/sqs_batc…

f160c10

…h_wrapper.rb Co-authored-by: Zach Margolis <zachmargolis@users.noreply.github.com>

LG-9905: Use blank? instead of unless for flag check

fbe34ee

LG-9905: Fix conditional and update test to catch issue

5697fcf

NavaTim merged commit 3f1177a into main May 25, 2023

NavaTim deleted the tbradley/lg-8440-in-person-enrollment-sqs-status branch May 25, 2023 18:58

JackRyan1989 mentioned this pull request May 25, 2023

LG 8441 Prioritize ready enrollments #8488

Merged

2 tasks

mitchellhenke mentioned this pull request May 30, 2023

Deploy RC 284 to Production #8503

Merged

-      return true unless IdentityConfig.store.in_person_proofing_enabled &&
-                         IdentityConfig.store.in_person_enrollments_ready_job_enabled
+      if !IdentityConfig.store.in_person_proofing_enabled ||
+        !IdentityConfig.store.in_person_enrollments_ready_job_enabled
+        return true
+      end

-    before(:each) do
-      allow(sqs_message).to receive(:message_id).
-        and_return(sqs_message_id)
-    end
+    let(:sqs_message) { Aws::SQS::Types::Message.new(message_id: sqs_message_id) }

Conversation

NavaTim commented May 16, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🎫 Ticket

🛠 Summary of changes

📜 Testing Plan

Uh oh!

zachmargolis May 16, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

NavaTim May 18, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

svalexander May 18, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

NavaTim commented May 16, 2023 •

edited

Loading

zachmargolis May 16, 2023 •

edited

Loading

NavaTim May 18, 2023 •

edited

Loading

svalexander May 18, 2023 •

edited

Loading