Extend ledger request timeout by tsachiherman · Pull Request #1390 · algorand/go-algorand

tsachiherman · 2020-08-12T22:28:29Z

Summary

The recent increase in ledger size require the server side handler to allow longer timeouts.
This became a bigger issue that I was original hoping to deal with : The timeout configuration for the issue is the "WriteTimeout", which is defined once you start the HTTP server.

That, naturally, doesn't give us the required resolution needed in order to associate a specific request handler with a given timeout. In order to achieve that, I extended the request tracker so that it would keep track of all incoming connections, even across HTTP requests ( i.e. http requests might use http 1.1 and re-use the connection ).

From that point it was quite straight-forward : expose the entry point from the GossipNode interface and update the timeout on the handler.

Test Plan

Tested manually.

tsachiherman · 2020-08-12T22:30:33Z

 		return
 	}
+	if conn := ls.net.GetHTTPRequestConnection(request); conn != nil {
+		conn.SetWriteDeadline(time.Now().Add(maxCatchpointFileWritingDuration))


Ideally, we would figure out what the file size is, and have a custom timeout per file. For now, I'm leaving it with a higher max value as I don't want to go into the rabbit hole of retrieving the file size.

algonautshant

Review in progress....
Some comments.

algonautshant · 2020-08-13T04:38:19Z

-	requests   []*TrackerRequest // this is an ordered list, according to the requestsHistory.created
+	remoteHost             string
+	requests               []*TrackerRequest            // this is an ordered list, according to the requestsHistory.created
+	additionalHostRequests map[*TrackerRequest]struct{} // additional requests that aren't included in the "requests", and always assumed to be "alive".


The name prefix "additional" is not informative. Can you please add more information and preferably rename it.
Also, what will the map point to? What will struct{} be?

The purpose of the map is to track the count of entries for that tracker, while providing easy ( and efficient ) way to delete them.

algonautshant · 2020-08-13T19:53:56Z

 func (ard *hostIncomingRequests) countConnections(rateLimitingWindowStartTime time.Time) (count uint) {
 	i := ard.findTimestampIndex(rateLimitingWindowStartTime)
-	return uint(len(ard.requests) - i)
+	return uint(len(ard.requests) - i + len(ard.additionalHostRequests))


We are not checking if the connections in additionalHostRequests are after rateLimitingWindowStartTime.
What is the catch? Maybe the comment should change?

All the connections in ard.additionalHostRequests are such that have already reached the http handler.
As such, these will get deleted only when the connection.Close() is called.
The tracking of the last-30-seconds window doesn't apply for these since these might be a long-living connections.

The purpose of the 30-seconds window was to track incoming connections that never made it to the http handler. Once they did, we don't really care about their "window".

brianolson · 2020-08-13T19:55:24Z

I'm rusty looking at network code and this is extra twisty because of the request tracker, but I didn't see anything wrong with it.

algonautshant · 2020-08-13T20:48:03Z

 		}

-		trackerRequest := makeTrackerRequest(conn.RemoteAddr().String(), "", "", time.Now())
+		trackerRequest := makeTrackerRequest(conn.RemoteAddr().String(), "", "", time.Now(), conn)


conn is being passed here, but isn't this nil at this point?
It gets initialized further down:

conn = &requestTrackedConnection{Conn: conn, tracker: rt}

Never mind. I missed the initialization above.

algonautshant

Looks good.
Thanks for the explanations.

Increase write timeout on ledger requests.

…ttp.ResponseController instead

tsachiherman added 7 commits August 12, 2020 13:50

stage changes

fd0ac8e

Extend write timeout for ledger requests.

f652539

Update request tracker to obtain the connection

c4c9620

fix typo.

098f8bb

Merge branch 'master' into tsachi/extend_ledger_request_write_timeout

45035e9

rollback.

b1500e0

undo.

70e7094

tsachiherman self-assigned this Aug 12, 2020

tsachiherman added the Infrastructure label Aug 12, 2020

tsachiherman requested review from algonautshant and brianolson August 12, 2020 22:28

tsachiherman commented Aug 12, 2020

View reviewed changes

tsachiherman added 2 commits August 12, 2020 21:59

small bugfix.

7cd7ec5

bugfix.

a8bfd4e

algonautshant reviewed Aug 13, 2020

View reviewed changes

tsachiherman added 2 commits August 13, 2020 09:50

update comment.

2f68e6d

Fix typo.

84796c6

tsachizehub added this to the Sprint 7 milestone Aug 13, 2020

algonautshant reviewed Aug 13, 2020

View reviewed changes

algonautshant approved these changes Aug 13, 2020

View reviewed changes

tsachiherman merged commit d8e3983 into algorand:master Aug 13, 2020

tsachiherman deleted the tsachi/extend_ledger_request_write_timeout branch August 13, 2020 21:29

tsachiherman added a commit to tsachiherman/go-algorand that referenced this pull request Jul 7, 2021

Extend ledger request timeout (algorand#1390)

d5b16c4

Increase write timeout on ledger requests.

algorandskiy mentioned this pull request Jun 28, 2024

P2P HTTP Implementation: open problems #6042

Closed

cce mentioned this pull request Jun 28, 2024

network: use http.ResponseController instead of GetHTTPRequestConnection #6044

Merged

cce added a commit to cce/go-algorand that referenced this pull request Jun 28, 2024

requestTracker: remove conn-tracking code from algorand#1390 to use h…

b95f96f

…ttp.ResponseController instead

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extend ledger request timeout#1390

Extend ledger request timeout#1390
tsachiherman merged 11 commits intoalgorand:masterfrom
tsachiherman:tsachi/extend_ledger_request_write_timeout

tsachiherman commented Aug 12, 2020 •

edited

Loading

Uh oh!

tsachiherman Aug 12, 2020

Uh oh!

algonautshant left a comment

Uh oh!

Uh oh!

Uh oh!

algonautshant Aug 13, 2020

Uh oh!

tsachiherman Aug 13, 2020

Uh oh!

algonautshant Aug 13, 2020

Uh oh!

tsachiherman Aug 13, 2020

Uh oh!

brianolson commented Aug 13, 2020

Uh oh!

algonautshant Aug 13, 2020

Uh oh!

algonautshant Aug 13, 2020

Uh oh!

algonautshant left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

tsachiherman commented Aug 12, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test Plan

Uh oh!

tsachiherman Aug 12, 2020

Choose a reason for hiding this comment

Uh oh!

algonautshant left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

algonautshant Aug 13, 2020

Choose a reason for hiding this comment

Uh oh!

tsachiherman Aug 13, 2020

Choose a reason for hiding this comment

Uh oh!

algonautshant Aug 13, 2020

Choose a reason for hiding this comment

Uh oh!

tsachiherman Aug 13, 2020

Choose a reason for hiding this comment

Uh oh!

brianolson commented Aug 13, 2020

Uh oh!

algonautshant Aug 13, 2020

Choose a reason for hiding this comment

Uh oh!

algonautshant Aug 13, 2020

Choose a reason for hiding this comment

Uh oh!

algonautshant left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

tsachiherman commented Aug 12, 2020 •

edited

Loading