Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

WARC writer support HTTP/2 #30

Merged
merged 4 commits into from
Jul 27, 2024
Merged

WARC writer support HTTP/2 #30

merged 4 commits into from
Jul 27, 2024

Conversation

sebastian-nagel
Copy link

(implements #29, includes NUTCH-3062)

sebastian-nagel and others added 4 commits July 17, 2024 16:08
Adds property store.protocol.versions - if true, protocol-okhttp records
protocol versions and related in response metadata.
Activate HTTPHeadersInterceptor also if only store.protocol.versions is
true, but no other headers and connection information is intercepted.
In combination with NUTCH-3062, if store.protocol.versions is true:
adds two counters "HttpProtocolVersion" and "TlsProtocolVersion"
counting the values stored by protocol-okhttp in the protocol versions
field of content metadata. Counted values are the protocol versions.
- HTTP headers: replace HTTP/2 and alike by HTTP/1.1 to
  ensure backward-compatibility for WARC readers, see
   iipc/warc-specifications#15
- store protocol versions and cipher suites in WARC headers
  WARC-Protocol and WARC-Cipher-Suite, see
   iipc/warc-specifications#42
   iipc/warc-specifications#86
- allow multiple WARC headers of the same name (WARC-Protocol
  may occur twice to hold the HTTP and TLS version)
@sebastian-nagel sebastian-nagel merged commit 860f269 into cc Jul 27, 2024
@sebastian-nagel sebastian-nagel mentioned this pull request Jul 27, 2024
7 tasks
@sebastian-nagel sebastian-nagel deleted the cc-http2 branch July 27, 2024 15:06
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant