LG-7470 | Very crude start of background job [WIP] by n1zyy · Pull Request #7085 · 18F/identity-idp

n1zyy · 2022-10-04T16:42:43Z

🎫 Ticket

LG-7470 TK

🛠 Summary of changes

This is currently so barebones as to be useless, but I wanted to get something up.

📜 Testing Plan

Provide a checklist of steps to confirm the changes.

Step 1
Step 2
Step 3

👀 Screenshots

If relevant, include a screenshot or screen capture of the changes.

Before:

After:

🚀 Notes for Deployment

Include any special instructions for deployment.

app/jobs/irs_attempt_events_batch_job.rb

n1zyy · 2022-10-06T21:09:57Z

app/services/irs_attempts_api/redis_client.rb

      key = key(timestamp)
      redis_pool.with do |client|
+        # see client.hscan which refs https://redis.io/commands/scan/
+        # but it's... a lil' bit weird.


Arguably, this change would be the big win, in allowing us to read and write in batches rather than fetching everything into memory. Will need to play around with this.

I think a good task for tomorrow is to write something quick to bulk-generate events. We're going to want something like that for generating a large sample file for the IRS as well.

My gut feeling is that hgetall won't be a great choice with a huge number of events, and they'll all be loaded in memory. I like the idea of being able to hscan and write them to the file as we fetch them.

It is occurring to me tonight that we've talked about doing this "background processing" to fetch all the events, put them in a flat file, and then store that in Redis rather than S3. Is that actually saving us anything over what we have now? All the more reason to generate a ton of events and bang on this.

n1zyy · 2022-10-07T00:50:58Z

app/jobs/irs_attempt_events_batch_job.rb

+  # Get this to run at the early part of the hour
+
+  def perform(subject_timestamp)
+    puts 'Howdy, partner'


Obviously this should not be merged; just an easy way to see when this runs.

n1zyy · 2022-10-07T00:55:16Z

app/services/irs_attempts_api/redis_client.rb

      key = key(timestamp)
      redis_pool.with do |client|
+        # see client.hscan which refs https://redis.io/commands/scan/
+        # but it's... a lil' bit weird.


I think a good task for tomorrow is to write something quick to bulk-generate events. We're going to want something like that for generating a large sample file for the IRS as well.

My gut feeling is that hgetall won't be a great choice with a huge number of events, and they'll all be loaded in memory. I like the idea of being able to hscan and write them to the file as we fetch them.

It is occurring to me tonight that we've talked about doing this "background processing" to fetch all the events, put them in a flat file, and then store that in Redis rather than S3. Is that actually saving us anything over what we have now? All the more reason to generate a ton of events and bang on this.

n1zyy · 2022-10-07T00:59:37Z

app/jobs/irs_attempt_events_batch_job.rb

+      file.write event
+    end
+    file.close
+    file.path


Commenting in case someone else ends up picking this up.

Right now, this reads all the events for a given hour out of Redis, writes them to a temp file, and then returns the file path (but nothing is looking at the return value, except me in the console). This is not very useful.

We want to figure out where to store this. We've discussed either S3 or Redis. With multiple servers behind a load balancer, and instances periodically recycled, we can't rely on saving the file locally.

We also want to apply encryption and gzip on this. See the fetch_events rake task, at least for the encryption bit.

The other bit of work on this is, once all that's working, change the endpoint to return that file, wherever it's stored, rather than generating it on the fly.

zachmargolis · 2022-11-02T22:51:09Z

Can/should we close this in favor of #7259?

Rwolfe-Nava · 2022-11-03T16:53:14Z

Can/should we close this in favor of #7259?

I think so. Matt is aware that I took this on.

LG-7470 | Very crude start of background job

eeec4c4

n1zyy commented Oct 6, 2022

View reviewed changes

app/jobs/irs_attempt_events_batch_job.rb Show resolved Hide resolved

n1zyy commented Oct 6, 2022

View reviewed changes

n1zyy added 2 commits October 6, 2022 20:48

Adds job to job_configuration.rb

06fd228

Merge branch 'main' into mattw/LG-7470_barebones

9ca68e9

n1zyy commented Oct 7, 2022

View reviewed changes

zachmargolis closed this Nov 3, 2022

n1zyy deleted the mattw/LG-7470_barebones branch October 11, 2023 18:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LG-7470 | Very crude start of background job [WIP]#7085

LG-7470 | Very crude start of background job [WIP]#7085
n1zyy wants to merge 3 commits intomainfrom
mattw/LG-7470_barebones

n1zyy commented Oct 4, 2022

Uh oh!

Uh oh!

n1zyy Oct 6, 2022

Uh oh!

n1zyy Oct 7, 2022

Uh oh!

n1zyy Oct 7, 2022

Uh oh!

n1zyy Oct 7, 2022

Uh oh!

n1zyy Oct 7, 2022

Uh oh!

zachmargolis commented Nov 2, 2022

Uh oh!

Rwolfe-Nava commented Nov 3, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

n1zyy commented Oct 4, 2022

🎫 Ticket

🛠 Summary of changes

📜 Testing Plan

👀 Screenshots

🚀 Notes for Deployment

Uh oh!

Uh oh!

n1zyy Oct 6, 2022

Choose a reason for hiding this comment

Uh oh!

n1zyy Oct 7, 2022

Choose a reason for hiding this comment

Uh oh!

n1zyy Oct 7, 2022

Choose a reason for hiding this comment

Uh oh!

n1zyy Oct 7, 2022

Choose a reason for hiding this comment

Uh oh!

n1zyy Oct 7, 2022

Choose a reason for hiding this comment

Uh oh!

zachmargolis commented Nov 2, 2022

Uh oh!

Rwolfe-Nava commented Nov 3, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants