Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ToDo: followsups / extras / ideas #3 #169

Open
28 of 52 tasks
Thorin-Oakenpants opened this issue May 21, 2022 · 8 comments
Open
28 of 52 tasks

ToDo: followsups / extras / ideas #3 #169

Thorin-Oakenpants opened this issue May 21, 2022 · 8 comments

Comments

@Thorin-Oakenpants
Copy link
Contributor

Thorin-Oakenpants commented May 21, 2022

🔹 Pages I use to keep tabs on a few things


🟥 ToDo

🟩 perf

🟩 isVars

  • harden isFF with items from isEngine, and thus isEngine for gecko/goanna
  • replace isEngine math (for non-gecko/goanna)
  • harden isOS
  • harden navKeys with knownGood, knownPoison from isEngine

🟩 general: fingerprints, sections, lies

  • general: add overlay popup to show results for each section and overall
  • general: add a user-initiated i.e [click here]'s global FP
    • reset each gRun
    • update or add each metric when a metric is run
  • handle passing objects as well as arrays
  • don't use .join()
    • don't use .join() on arrays for all sDetail data (where we can now use objects or arrays)
    • don't use .join() anywhere else unless we have to (or it is safe)
  • preserve typeof where possible
    * e.g. screen metric width/height is a joined string but we already test for NaN
  • general: ongoing
    • knownLies (from mathematical proofs etc)
    • apply prototypeLies to metrics
    • logic -> check knownLies -> if no lies then check prototypeLie -> into the section hash data -> into overall hash data
  • general: ongoing
    • robust error handling, detection of script blockage: e.g. screen.js needs work

🟦 screen

  • screen: vw + wh units, visualviewport etc see expand screen methods #29
  • screen: android 73+: see 1514429:
  • screen: android: investigate a history of changes as soon as possible
    • e.g global.js is where I grab initial viewport and window sizes

🟦 ua

  • ua: refactor
    • always return 11 results
    • relax oscpu for specific linux distro strings
    • harden oscpu: win7 vs win8/10 via tiny font check
    • harden userAgent from the other parts: i.e userAgent is constructed using a set method
    • bypass more parts of the userAgent than just version where possible
    • only bypass the six "hardcoded" parts - we always detect without proxyLies
    • don't try to bypass the variable FF strings because that may indeed be the string, e.g forks, special user builds, useragent-override pref(s) ... only detect untrustworthy via proxyLies

🟦 feature detection

  • feature detection: new math test for android if isVer is FF68+
    • moved to misc, split into RFP vs other, extended tests to cover android entropy

🟦 language

🟦 storage

  • storage: auto-output estimate as it doesn't need permission
  • storage: finish the tests

🟦 devices

🟦 svg

  • svg: such as
    • ExtentOfChar
    • SVGTextContentElement.prototype.getComputedTextLength
    • SVGPathElement.prototype.getTotalLength
    • SVGGeometryElement.getPointAtLength()

🟦 canvas

🟦 webgl

  • webgl: add properties/parameters: split into 1, 2, experimental groups and subgroups e.g. affected by RFP/prefs
  • webgl: add image tests
    • make sure to expand/test on error entropy for readPixels
    • e.g. in VM's which usually block the graphics driver and the error messages can differ
    • make sure error entropy is handled for TB slider and VMs

🟦 audio

🟦 fonts

  • fonts: add domrect methods (bounding, bounding range, client, client range)
    • and use this (if trustworthy) for sizes (more precision)
  • fonts: use transform:scale ? also see font followups #167
  • fonts: add more enumeration methods (maybe via [click here])
    • font face
    • textmetrics
    • svg
  • update win fonts - https://docs.microsoft.com/en-us/typography/fonts/windows_11_font_list
  • update mac fonts
    • now 10.12 is the minimum in ESR91+ we can clean out some old additional fonts
    • add additional big sur (which is rather large)
  • system-ui 1226042
  • unicode support + FP based on code points assigned per script/version
  • emoji support + FP
  • textMetrics: randomness
  • fonts: graphite: use base64

🟦 media

🟦 css

  • css: prefers-contrast notation - tests, article
  • css: detect changes to expected values (e.g. dark readers)
    • e.g. window.getComputedStyle(document.getElementsByTagName("body")[0]).backgroundColor

🟦 elements

🟦 misc


🟫 BACKBURNER

  • canvas: add iframe tests (linked to CB's test page instead for now)
    • we can deal with iframes way down the track, along with workers
  • devices: use precision timing bypasses to estimate cores
  • fonts: fontfallback: investigate further
    • when document fonts are blocked, the first run misses fonts it picks up on subsequent tests: currently mitigated by doing a 2-font "priming" run: but this is not an elegant solution
  • fonts: can we do anything with zalgo
  • general: make worker fingerprint global
    • waste of time until we start adding lots of worker tests
    • currently there is only the ua and language sections, results are not used anywhere
    • instead all worker data should be a separate global FP which we use after the fact to report lies
      • i.e we still output a sub-section worker hash (and diffs to doc), but report lies/mismatches in global
  • general: make iframes fingerprint global (same deal as workers above)
    • waste of time until we start adding lots of iframe tests
  • language: injecting resource:// css files for app lang leaks
  • other: WebRTC
  • other: isAdBlocking [also: hash of results]
  • other: sanitizing / zombie storage - prototype
  • other: RFP timing tests
  • screen: android: improve keyboard PoC to use setInterval to detect a static change
  • screen: android: exploit font inflation
  • screen: android: exploit position (top/bottom) of toolbar in Fenix [yes, I can detect this]
  • screen: mod/add border-snapping subPixel devicePixelRatio PoC to use clientrect

...

@Thorin-Oakenpants
Copy link
Contributor Author

Thorin-Oakenpants commented Aug 25, 2022

interesting perf

Of course nothing really happens in TZP until we receive all the JS files, at which point we start timing, but makes you wonder if trying to get most JS files under 14kb speeds up the initial loading - but wait, that's compressed .. check your network console

meh

of course I expect creepy to simply wrap it's entire code into < 14kb, because, nerd!

@abrahamjuliot
Copy link
Collaborator

I'm currently at 140kb and CSS is 157kb 😭.

@Thorin-Oakenpants
Copy link
Contributor Author

@abrahamjuliot OMG - https://bugzilla.mozilla.org/show_bug.cgi?id=1519167#c117 .. anba had to stop at part 99

@Thorin-Oakenpants
Copy link
Contributor Author

well I got my 11.5 yr old PC down to ~210ms (including adding the webgl at around 15ms)

anyway, here's my new rig, current code

  • this is running as file:// and then hitting F5 to eliminate noise and get a stable and best case
    suckitbitches

@abrahamjuliot ... I think I can get down to 50ms . what do you think?

@abrahamjuliot
Copy link
Collaborator

Nice. Do we lose a lot of entropy with shorter font lists? Maybe that can be cut by half.

@Thorin-Oakenpants
Copy link
Contributor Author

So I just removed a lot of fonts in windows, will check mac later on. I have a win11 desktop rig, with 182/186 fonts tested (everything MS provides incl win11, including all supplemental, and downloads such as 'Cascadia Code','Cascadia Mono') - I do not have 'Arial Unicode MS','MS Reference Specialty','MS Outlook' (which are installed with office or other MS products, IDK about office365) .. not looked at the new office font .. and I don;t have 'AlternateGothic2 BT' which is in the RFP font vis level 1 for some reason (can't be assed looking up to re-read why jonathan added it, but it's not a MS font AFAICT, but it is in kBaseFonts). The 187th font is a poison pill

So .. I fixed some names up, and it's all peachy.

I had to remove anything that is a "variant" .. black, cond, light, demi, semi, whatever, ultra, etc - because they are not stable. I have an issue open for font variants. Remember when you said you had windows 10/11 sometimes detect Arial Black and sometimes now, or Arial Narrow .. whatever. Well, this seems to be due to some change in windows and/or Firefox. I believe it's due to the OS as FF is stable as fuck on my old machine.

unstable is like this

  • start new browser session, first load - it will detect some fonts as their namesake, e.g. it will detect Corbel Light because Corbel is used, and it will record the size as Corbel's, when Corbel Light is actually different. Or it will not detect a font.
  • do a refresh and then it becomes stable with all fonts detected and with the correct sizes, i.e actually using the font

So I surmise this has to do with font-weight, and I think all I need to do is, in a new test or wrapped into this one, is break the fonts lists into weights and loop weights, list, font (and record base the same)

Anyway, so fonts dropped from 240 to 190 in windows. But I added an extra char (a tofu) in the test string. The shorter the test string the faster the test (less glyphs to get per font). So perf didn't change much. The font string used in the test doesn't increase detection of fonts, but it increases the bumber of sizes collected

	if ("windows" === os) {
		// Mō = 124 +"á" = 125 +"Ω" = 127 (win7)
		// Mō - 141 +"á" = 142 +"Ω" = 144 | Mō - 141 +tofu = 154 | (win11: have 182/186 fonts
		let tofu = get_fntCodes("tofu")
		fntString = isTB ? "?-"+ tofu : "Mō"+ tofu

^ that's gave me 141 sizes for 182 fonts, but +tofu gave me 154 sizes for 182 fonts

now I have a mac, I'll be doing all that for mac. Linux and android font lists don't have any "variants"

I think fonts sizes may provide entropy (sort of an equivalency of system scaling, subpixels, cleartype, font version maybe), and I test them all because besides the core group they're all supplemental and you can't really leave any out - although I have, see code for why (assumptions, e.g if you have Segoe UI Variable Display then you're going to have Segoe UI Variable Small and Segoe UI Variable Text and they are the same size, etc

With variants, there are so many (I and I will not be testing styles, e.g. italic), I may have to be more selective. But overall, it's up to me to cover as much as possible, or all of it .. so that when we do collect some data, we can see where the entropy is coming from. At least in FF116+ it's limited to Win10+ and macOS 10.16+ (or whatever it was)

@Thorin-Oakenpants
Copy link
Contributor Author

perf

                   tofu: fonts:    3 ms |   50 ms | 5/21
         unicode glyphs: fonts:   21 ms |   71 ms
              fontsizes: fonts:   21 ms |   93 ms

Looks like it takes about 10 fonts per ms (there's a few ms in that 21 being used for base font sizes, and re-arranging the data into an object etc) .. so 18 or 19ms for 187 fonts. I think I'm going to be adding like 100 variants, so we're not going to save here, unless it's core expected fonts everyone has and data analysis says we don't lose entropy in measurements

@Thorin-Oakenpants
Copy link
Contributor Author

I do have some ideas on how to speed up unicode measurements, maybe - but like fonts, that's an area I'm going to expand

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Development

No branches or pull requests

2 participants