-
Notifications
You must be signed in to change notification settings - Fork 897
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Showing
4 changed files
with
308 additions
and
0 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,27 @@ | ||
// Copyright 2015 The Chromium Authors. All rights reserved. | ||
// | ||
// Redistribution and use in source and binary forms, with or without | ||
// modification, are permitted provided that the following conditions are | ||
// met: | ||
// | ||
// * Redistributions of source code must retain the above copyright | ||
// notice, this list of conditions and the following disclaimer. | ||
// * Redistributions in binary form must reproduce the above | ||
// copyright notice, this list of conditions and the following disclaimer | ||
// in the documentation and/or other materials provided with the | ||
// distribution. | ||
// * Neither the name of Google Inc. nor the names of its | ||
// contributors may be used to endorse or promote products derived from | ||
// this software without specific prior written permission. | ||
// | ||
// THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS | ||
// "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT | ||
// LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR | ||
// A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT | ||
// OWNER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, | ||
// SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT | ||
// LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, | ||
// DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY | ||
// THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT | ||
// (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE | ||
// OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,164 @@ | ||
# NaïveProxy ![build workflow](https://github.com/klzgrad/naiveproxy/actions/workflows/build.yml/badge.svg) | ||
|
||
NaïveProxy uses Chromium's network stack to camouflage traffic with strong censorship resistence and low detectablility. Reusing Chrome's stack also ensures best practices in performance and security. | ||
|
||
The following traffic attacks are mitigated by using Chromium's network stack: | ||
|
||
* Website fingerprinting / traffic classification: [mitigated](https://arxiv.org/abs/1707.00641) by traffic multiplexing in HTTP/2. | ||
* [TLS parameter fingerprinting](https://arxiv.org/abs/1607.01639): defeated by reusing [Chrome's network stack](https://www.chromium.org/developers/design-documents/network-stack). | ||
* [Active probing](https://ensa.fi/active-probing/): defeated by *application fronting*, i.e. hiding proxy servers behind a commonly used frontend server with application-layer routing. | ||
* Length-based traffic analysis: mitigated by length padding. | ||
|
||
## Architecture | ||
|
||
[Browser → Naïve client] ⟶ Censor ⟶ [Frontend → Naïve server] ⟶ Internet | ||
|
||
NaïveProxy uses Chromium's network stack to parrot traffic between regular Chrome browsers and standard frontend servers. | ||
|
||
The frontend server can be any well-known reverse proxy that is able to route HTTP/2 traffic based on HTTP authorization headers, preventing active probing of proxy existence. Known ones include Caddy with its forwardproxy plugin and HAProxy. | ||
|
||
The Naïve server here works as a forward proxy and a packet length padding layer. Caddy forwardproxy is also a forward proxy but it lacks a padding layer. A [fork](https://github.com/klzgrad/forwardproxy) adds the NaïveProxy padding layer to forwardproxy, combining both in one. | ||
|
||
## Download NaïveProxy | ||
|
||
Download [here](https://github.com/klzgrad/naiveproxy/releases/latest). Supported platforms include: Windows, Android (with [Exclave](https://github.com/dyhkwong/Exclave), [NekoBox](https://github.com/MatsuriDayo/NekoBoxForAndroid)), Linux, Mac OS, and OpenWrt ([support status](https://github.com/klzgrad/naiveproxy/wiki/OpenWrt-Support)). | ||
|
||
Users should always use the latest version to keep signatures identical to Chrome. | ||
|
||
Build from source: Please see [.github/workflows/build.yml](https://github.com/klzgrad/naiveproxy/blob/master/.github/workflows/build.yml). | ||
|
||
## Server setup | ||
|
||
The following describes the naïve fork of Caddy forwardproxy setup. | ||
|
||
Download [here](https://github.com/klzgrad/forwardproxy/releases/latest) or build from source: | ||
```sh | ||
go install github.com/caddyserver/xcaddy/cmd/xcaddy@latest | ||
~/go/bin/xcaddy build --with github.com/caddyserver/forwardproxy=github.com/klzgrad/forwardproxy@naive | ||
``` | ||
|
||
Example Caddyfile (replace `user` and `pass` accordingly): | ||
``` | ||
{ | ||
order forward_proxy before file_server | ||
} | ||
:443, example.com { | ||
tls [email protected] | ||
forward_proxy { | ||
basic_auth user pass | ||
hide_ip | ||
hide_via | ||
probe_resistance | ||
} | ||
file_server { | ||
root /var/www/html | ||
} | ||
} | ||
``` | ||
`:443` must appear first for this Caddyfile to work. See Caddyfile [docs](https://caddyserver.com/docs/caddyfile/directives/tls) for customizing TLS certificates. For more advanced usage consider using [JSON for Caddy 2's config](https://caddyserver.com/docs/json/). | ||
|
||
Run with the Caddyfile: | ||
``` | ||
sudo setcap cap_net_bind_service=+ep ./caddy | ||
./caddy start | ||
``` | ||
|
||
See also [Systemd unit example](https://github.com/klzgrad/naiveproxy/wiki/Run-Caddy-as-a-daemon) and [HAProxy setup](https://github.com/klzgrad/naiveproxy/wiki/HAProxy-Setup). | ||
|
||
## Client setup | ||
|
||
Run `./naive` with the following `config.json` to get a SOCKS5 proxy at local port 1080. | ||
```json | ||
{ | ||
"listen": "socks://127.0.0.1:1080", | ||
"proxy": "https://user:[email protected]" | ||
} | ||
``` | ||
|
||
Or `quic://user:[email protected]`, if it works better. See also [parameter usage](https://github.com/klzgrad/naiveproxy/blob/master/USAGE.txt) and [performance tuning](https://github.com/klzgrad/naiveproxy/wiki/Performance-Tuning). | ||
|
||
## Third-party integration | ||
|
||
* [v2rayN](https://github.com/2dust/v2rayN), GUI client, Windows | ||
* [NekoBox for Android](https://github.com/MatsuriDayo/NekoBoxForAndroid), Proxy toolchain, Android | ||
* [NekoRay / NekoBox For PC](https://github.com/MatsuriDayo/nekoray), Qt based GUI, Windows, Linux | ||
* [Yet Another Shadow Socket](https://github.com/Chilledheart/yass), NaïveProxy-compatible forward proxy, Android, iOS, Windows, macOS, Linux, FreeBSD | ||
|
||
## Notes for downstream | ||
|
||
Do not use the master branch to track updates, as it rebases from a new root commit for every new Chrome release. Use stable releases and the associated tags to track new versions, where short release notes are also provided. | ||
|
||
## Padding protocol, an informal specification | ||
|
||
The design of this padding protocol opts for low overhead and easier implementation, in the belief that proliferation of expendable, improvised circumvention protocol designs is a better logistical impediment to censorship research than sophisicated designs. | ||
|
||
### Proxy payload padding | ||
|
||
NaïveProxy proxies bidirectional streams through HTTP/2 (or HTTP/3) CONNECT tunnels. The bidirectional streams operate in a sequence of reads and writes of data. The first `kFirstPaddings` (8) reads and writes in a bidirectional stream after the stream is established are padded in this format: | ||
```c | ||
struct PaddedData { | ||
uint8_t original_data_size_high; // original_data_size / 256 | ||
uint8_t original_data_size_low; // original_data_size % 256 | ||
uint8_t padding_size; | ||
uint8_t original_data[original_data_size]; | ||
uint8_t zeros[padding_size]; | ||
}; | ||
``` | ||
`padding_size` is a random integer uniformally distributed in [0, `kMaxPaddingSize`] (`kMaxPaddingSize`: 255). `original_data_size` cannot be greater than 65535, or it has to be split into several reads or writes. | ||
|
||
`kFirstPaddings` is chosen to be 8 to flatten the packet length distribution spikes formed from common initial handshakes: | ||
- Common client initial sequence: 1. TLS ClientHello; 2. TLS ChangeCipherSpec, Finished; 3. H2 Magic, SETTINGS, WINDOW_UPDATE; 4. H2 HEADERS GET; 5. H2 SETTINGS ACK. | ||
- Common server initial sequence: 1. TLS ServerHello, ChangeCipherSpec, ...; 2. TLS Certificate, ...; 3. H2 SETTINGS; 4. H2 WINDOW_UPDATE; 5. H2 SETTINGS ACK; 6. H2 HEADERS 200 OK. | ||
|
||
Further reads and writes after `kFirstPaddings` are unpadded to avoid performance overhead. Also later packet lengths are usually considered less informative. | ||
|
||
### H2 RST_STREAM frame padding | ||
|
||
In experiments, NaïveProxy tends to send too many RST_STREAM frames per session, an uncommon behavior from regular browsers. To solve this, an END_STREAM DATA frame padded with total length distributed in [48, 72] is prepended to the RST_STREAM frame so it looks like a HEADERS frame. The server often replies to this with a WINDOW_UPDATE because padding is accounted in flow control. Whether this results in a new uncommon behavior is still unclear. | ||
|
||
### H2 HEADERS frame padding | ||
|
||
The CONNECT request and response frames are too short and too uncommon. To make its length similar to realistic HEADERS frames, a `padding` header is filled with a sequence of symbols that are not Huffman coded and are pseudo-random enough to avoid being indexed. The length of the padding sequence is randomly distributed in [16, 32] (request) or [30, 62] (response). | ||
|
||
### Opt-in of padding protocol | ||
|
||
NaïveProxy clients should interoperate with any regular HTTP/2 proxies unaware of this padding protocol. NaïveProxy servers (i.e. any proxy server capable of the this padding protocol) should interoperate with any regular HTTP/2 clients (e.g. regular browsers) unaware of this padding protocol. | ||
|
||
NaïveProxy servers and clients determines whether the counterpart is capable of this padding protocol by the presence of the `padding` header in the CONNECT request and response respectively. The padding procotol is enabled only if the `padding` header exists. | ||
|
||
The first CONNECT request to a server cannot use "Fast Open" to send payload before response, because the server's padding capability has not been determined from the first response and it's unknown whether to send padded or unpadded payload for Fast Open. | ||
|
||
## Changes from Chromium upstream | ||
|
||
- Minimize source code and build size (1% of the original) | ||
- Disable exceptions and RTTI, except on Mac and Android. | ||
- Support OpenWrt builds | ||
- (Android, Linux) Use the builtin verifier instead of the system verifier (drop dependency of NSS on Linux) and read the system trust store from (following Go's behavior in crypto/x509/root_unix.go and crypto/x509/root_linux.go): | ||
- The file in environment variable SSL_CERT_FILE | ||
- The first available file of | ||
- /etc/ssl/certs/ca-certificates.crt (Debian/Ubuntu/Gentoo etc.) | ||
- /etc/pki/tls/certs/ca-bundle.crt (Fedora/RHEL 6) | ||
- /etc/ssl/ca-bundle.pem (OpenSUSE) | ||
- /etc/pki/tls/cacert.pem (OpenELEC) | ||
- /etc/pki/ca-trust/extracted/pem/tls-ca-bundle.pem (CentOS/RHEL 7) | ||
- /etc/ssl/cert.pem (Alpine Linux) | ||
- Files in the directory of environment variable SSL_CERT_DIR | ||
- Files in the first available directory of | ||
- /etc/ssl/certs (SLES10/SLES11, https://golang.org/issue/12139) | ||
- /etc/pki/tls/certs (Fedora/RHEL) | ||
- /system/etc/security/cacerts (Android) | ||
- Handle AIA response in PKCS#7 format | ||
- Allow higher socket limits for proxies | ||
- Force tunneling for all sockets | ||
- Support HTTP/2 and HTTP/3 CONNECT tunnel Fast Open using the `fastopen` header | ||
- Pad RST_STREAM frames | ||
|
||
## Known weaknesses | ||
|
||
* HTTP CONNECT Fast Open creates back to back h2 packets consistently, which should not appear so often. This could be fixed with a little bit of corking but it would require surgical change deep in Chromium h2 stack, not very easy to do. | ||
* TLS over TLS requires more handshake round trips than needed by common h2 requests, that is, no h2 requests need these many back and forth handshakes. There is no simple way to avoid this besides doing MITM proxying, breaking E2E encryption. | ||
* TLS over TLS overhead causes visible packet length enlargement and lack of small packets. Removing this overhead also requires MITM proxying. | ||
* TLS over TLS overhead also causes packets to consistently exceed MTU limits, which should not happen for an originating user agent. Fixing this requires re-segmentation and it is not easy to do. | ||
* Packet length obfuscation partly relies on h2 multiplexing, which does not work if there is only one connection, a scenario not uncommon. It is not clear how to create covering co-connections organically (i.e. not hard coded). | ||
* Multiplexing requires use of a few long-lived tunnel connections. It is not clearly how long is appropriate for parroting and how to convincingly rotate the connections if there is an age limit or how to detect and recover stuck tunnel connections convincingly. | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,112 @@ | ||
Usage: naive --listen=... --proxy=... | ||
naive [/path/to/config.json] | ||
|
||
Description: | ||
|
||
naive is a proxy that transports traffic in Chromium's pattern. | ||
It works as both a proxy client and a proxy server or together. | ||
|
||
Options in the form of `naive --listen=... --proxy=...` can also be | ||
specified using a JSON file: | ||
|
||
{ | ||
"listen": "...", | ||
"proxy": "..." | ||
} | ||
|
||
Specifying a flag multiple times on the command line is equivalent to | ||
having an array of multiple strings in the JSON file. | ||
|
||
Uses "config.json" by default if run without arguments. | ||
|
||
Options: | ||
|
||
-h, --help | ||
|
||
Shows help message. | ||
|
||
--version | ||
|
||
Prints version. | ||
|
||
--listen=LISTEN-URI | ||
|
||
LISTEN-URI = <LISTEN-PROTO>"://"[<USER>":"<PASS>"@"][<ADDR>][":"<PORT>] | ||
LISTEN-PROTO = "socks" | "http" | "redir" | ||
|
||
Listens at addr:port with protocol <LISTEN-PROTO>. | ||
Can be specified multiple times to listen on multiple ports. | ||
Default proto, addr, port: socks, 0.0.0.0, 1080. | ||
|
||
Note: redir requires specific iptables rules and uses no authentication. | ||
|
||
(Redirecting locally originated traffic) | ||
iptables -t nat -A OUTPUT -d $proxy_server_ip -j RETURN | ||
iptables -t nat -A OUTPUT -p tcp -j REDIRECT --to-ports 1080 | ||
|
||
(Redirecting forwarded traffic on a router) | ||
iptables -t nat -A PREROUTING -p tcp -j REDIRECT --to-ports 1080 | ||
|
||
Also activates a DNS resolver on the same UDP port. Similar iptables | ||
rules can redirect DNS queries to this resolver. The resolver returns | ||
artificial addresses that are translated back to the original domain | ||
names in proxy requests and then resolved remotely. | ||
|
||
The artificial results are not saved for privacy, so restarting the | ||
resolver may cause downstream to cache stale results. | ||
|
||
--proxy=PROXY | ||
|
||
PROXY = PROXY-CHAIN | SOCKS-PROXY | ||
PROXY-CHAIN = <PROXY-URI>[","<PROXY-CHAIN>] | ||
PROXY-URI = <PROXY-PROTO>"://"[<USER>":"<PASS>"@"]<HOSTNAME>[":"<PORT>] | ||
PROXY-PROTO = "http" | "https" | "quic" | ||
SOCKS-PROXY = "socks://"<HOSTNAME>[":"<PORT>] | ||
|
||
Routes traffic via the proxy chain. | ||
The default proxy is directly connection without proxying. | ||
The last PROXY-URI is negotiated automatically for Naive padding. | ||
Limitations: | ||
* QUIC proxies cannot follow TCP-based proxies in a proxy chain. | ||
* The user needs to ensure there is no loop in the proxy chain. | ||
* SOCKS proxies do not support chaining, authentication, or Naive padding. | ||
|
||
--insecure-concurrency=<N> | ||
|
||
Use N concurrent tunnel connections to be more robust under bad network | ||
conditions. More connections make the tunneling easier to detect and less | ||
secure. This project strives for the strongest security against traffic | ||
analysis. Using it in an insecure way defeats its purpose. | ||
|
||
If you must use this, try N=2 first to see if it solves your issues. | ||
Strongly recommend against using more than 4 connections here. | ||
|
||
--extra-headers=... | ||
|
||
Appends extra headers in requests to the proxy server. | ||
Multiple headers are separated by CRLF. | ||
|
||
--host-resolver-rules="MAP proxy.example.com 1.2.3.4" | ||
|
||
Statically resolves a domain name to an IP address. | ||
|
||
--resolver-range=CIDR | ||
|
||
Uses this range in the builtin resolver. Default: 100.64.0.0/10. | ||
|
||
--log=[<path>] | ||
|
||
Saves log to the file at <path>. If path is empty, prints to | ||
console. No log is saved or printed by default for privacy. | ||
|
||
--log-net-log=<path> | ||
|
||
Saves NetLog. View at https://netlog-viewer.appspot.com/. | ||
|
||
--ssl-key-log-file=<path> | ||
|
||
Saves SSL keys for Wireshark inspection. | ||
|
||
--no-post-quantum | ||
|
||
Overrides the default and disables post-quantum key agreement. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,5 @@ | ||
{ | ||
"listen": ["socks://127.0.0.1:1080", "http://127.0.0.1:8080"], | ||
"proxy": "https://user:[email protected]", | ||
"log": "" | ||
} |