fix: retrying jobs are not saved to db when pool is closing #31

DNK90 · 2023-01-16T18:20:55Z

Add waitgroup to wait until all retrying jobs that are being on goroutine finish storing job to db
Force controller finishes processing all pending jobs before fetching new logs

…berOfRetryingJob` in `PrepareRetryableJob` is enough

pool.go

…ing to exit and channels are closed

minh-bq · 2023-01-19T06:38:19Z

pool.go

 	dur := time.Until(time.Unix(job.GetNextTry(), 0))
 	if dur <= 0 {
 		return
 	}
+	atomic.AddInt32(&p.numberOfRetryingJob, 1)


This numberOfRetryingJob seems unused

numberOfRetryingJob can be used for stats

minh-bq · 2023-01-19T06:40:00Z

pool.go

 func (p *Pool) updateRetryingJob(job JobHandler) {
 	if job == nil {
 		return
 	}
+
+	p.lock.Lock()


I think this lock is redundant. AFAIK, 2 writes to same row in postgresql are serialized, aren't they?

we use this to lock the pool, not the db, so that the pool cannot exit without finishing this function

For retry job, I think the waitgroup is enough, I'm not sure about the failed job flow. However, I cannot see how this lock can prevent pool from exiting.

There can be some goroutines still calling to Enqueue or RetryJob and panic because of "send to close channel" on both Jobchan and RetryJobChan will happen. This time the pool may already closed and waitgroup for on-the-fly retry-able jobs may be all done. The pool then exits without waiting these events. Calling lock will make sure it cannot close until these events are stored to db

pool.go

…etryableJob`

fix: retrying jobs are not saved to db when pool is closing

6e2edeb

DNK90 requested review from minh-bq and linh-1 January 16, 2023 18:20

chore: only call defer closing retryableWaitGroup and updating `num…

366503e

…berOfRetryingJob` in `PrepareRetryableJob` is enough

minh-bq reviewed Jan 17, 2023

View reviewed changes

pool.go Outdated Show resolved Hide resolved

minh-bq reviewed Jan 17, 2023

View reviewed changes

pool.go Outdated Show resolved Hide resolved

DNK90 added 5 commits January 18, 2023 21:43

chore: Remove example package which is outdated

7c9696e

chore: Remove unused channel and add Pool to listener

64d03b2

chore: add more lock and recover to prevent lost data when pool is go…

12f2166

…ing to exit and channels are closed

nit: change log level of processBatchLogs to Trace

3a94541

nit: remove unused instances in BridgeWorker

d2337ec

minh-bq reviewed Jan 19, 2023

View reviewed changes

DNK90 added 2 commits January 19, 2023 15:49

chore: move add 1 to retryableWaitGroup to before calling `PrepareR…

8238af2

…etryableJob`

chore: correct benchmark with new changes

e450ccf

DNK90 requested a review from minh-bq January 19, 2023 09:17

minh-bq approved these changes Jan 19, 2023

View reviewed changes

DNK90 merged commit 8bcddcb into master Jan 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: retrying jobs are not saved to db when pool is closing #31

fix: retrying jobs are not saved to db when pool is closing #31

DNK90 commented Jan 16, 2023 •

edited

Loading

minh-bq Jan 19, 2023

DNK90 Jan 19, 2023

minh-bq Jan 19, 2023

DNK90 Jan 19, 2023

minh-bq Jan 19, 2023

DNK90 Jan 19, 2023

fix: retrying jobs are not saved to db when pool is closing #31

fix: retrying jobs are not saved to db when pool is closing #31

Conversation

DNK90 commented Jan 16, 2023 • edited Loading

minh-bq Jan 19, 2023

Choose a reason for hiding this comment

DNK90 Jan 19, 2023

Choose a reason for hiding this comment

minh-bq Jan 19, 2023

Choose a reason for hiding this comment

DNK90 Jan 19, 2023

Choose a reason for hiding this comment

minh-bq Jan 19, 2023

Choose a reason for hiding this comment

DNK90 Jan 19, 2023

Choose a reason for hiding this comment

DNK90 commented Jan 16, 2023 •

edited

Loading