fix(perf): Remove FontSources.masks as they were consuming large amounts of memory even when no font sources were set by Auspicus · Pull Request #2519 · maplibre/martin

Auspicus · 2026-01-22T11:55:07Z

This drops the RSS value after the bitsets are computed. Unfortunately the bitset computation still spikes the memory usage quite high.

UPDATE: Swapping out BitSet for BTreeSet results in much lower peak memory usage after startup. I don't think BitSet is doing what we want here.

UPDATE: Replaced BTreeSet back with BitSet for the font codepoints for lower peak memory usage for a font that provides every available codepoint.

…ross workers

CommanderStorm · 2026-01-22T16:44:33Z

martin-core/src/resources/fonts/mod.rs

        }

-        let mut needed = self.masks[(start as usize) / CP_RANGE_SIZE].clone();
+        let needed: &BTreeSet<usize> = &self.masks[(start as usize) / CP_RANGE_SIZE];


@nyurik Why did you use masks in the first place here instead of something like

Suggested change

let needed: &BTreeSet<usize> = &self.masks[(start as usize) / CP_RANGE_SIZE];

// Create a BitSet for the requested range (start..=end)

let mut needed = BitSet::with_capacity(end as usize);

for cp in start..=end {

needed.insert(cp as usize);

}

In the testing I have done, this works fairly well, but there must be a reason why this is done this way that I am not getting..

tbh, it has been a long time since i did it, so no clue :)

I can imagine BitSet would work well for a small range like 0-255 but if you support the whole unicode range you need a lot of memory per bitset.

CommanderStorm · 2026-01-22T17:32:14Z

martin-core/src/resources/fonts/mod.rs

    face_index: isize,
    /// Unicode codepoints this font supports.
-    codepoints: BitSet,
+    codepoints: BTreeSet<usize>,


Unsure if there is a better way.
Lets brainstorm a bit.

so we need to know which if the 256 bits -> BitSet256 is set.

While we curently store this as one very big BitSet, that is likely not nessesary, since this range is likely fairly compressible.
No font designer would leave weird gaps in their font intentionally I think.

Vec<BitSet256> works and weighs in at ~1.1 megabytes (MAX_UNICODE_CP Bits)

BTreeSet<u32> works and is 32B * number of font-items -> Likely lower than above.

HashMap<u16, BitSet256> ((start / 256) -> next 256 code points) should works and is a compromise between the two?

Noticed that only u32 is needed for the index, that is another halfing of that memory usage.

This also stacks with the improvement above.

Yeah there was a mix of usize and u32 around so definitely could standardise on u32 and avoid a bunch of casting

The problem with using a BitSet256 is you need to range it from 0-255 but the codepoints are 0-0x10FFFF so you either need to translate the requested codepoints back to 0-255 before masking (losing the performance of a bitset) or have one GIANT BitSet for each range. If you check the size of each BitSet before you'll see some are as big as 0x10FFFF (depending on the largest inserted value).

Auspicus · 2026-01-22T20:28:29Z

martin-core/src/resources/fonts/mod.rs

-        let mut bs = BitSet::with_capacity(CP_RANGE_SIZE);
+        let mut set = BTreeSet::new();
        for v in 0..=MAX_UNICODE_CP {
-            bs.insert(v);


I think the core issue is that although we allocate a BitSet::with_capacity(256) the value for v here might be 0x10FFFF causing a MASSIVE reallocation to fit values 0-0x10FFFF in the BitSet. From what I understand, the values must be contiguous without holes so each BitSet could be as large as (0x10FFFF / 32) (bits per u32) * 4 (bytes per u32) and then each of these is inserted for each range up to 4352.

my understanding is that each file would use a 136KB of RAM (136*1024*8 = 1_114_112 = 0x11_0000 codepoint presence bits) - not that much memory. If we follow the optimization I proposed below, most fonts won't even get that high, but it will keep the complexity to the minimal.

The issue is less with each font source since it's only a single BitSet of 34816 u32 (139KB). The issue is with the pre-computed masks. Each range (of which there are 4352) carries a masking BitSet of up to 0x11_0000 (34816 u32 [4 bytes] in the backing BitVec). Assuming every range is this size (inaccurately) we get 600MB (34816*4*4352 bytes). More accurately we get masking BitSets with sizes growing from 8-34816 u32 and more likely around 300MB total. Prior to adjusting the max codepoint size we had smaller BitSets (8-2048 u32) and less of them 256 (0xFFFF/256). Resulting in at most ~2MB (2048*4*256 bytes) for masks but likely less due to not all BitSets being 2048 size.

See images attached with debugger running on main branch in the default for FontSources:

nyurik · 2026-01-22T20:29:08Z

one thing we could try without any other changes:

 fn get_available_codepoints(face: &mut Face) -> Option<GetGlyphInfo> {
-    let mut codepoints = BitSet::with_capacity(MAX_UNICODE_CP);
+    let mut codepoints = BitSet::new();  // dynamic memory growth
 ...
     for cp in 0..=MAX_UNICODE_CP {
     ...
+      if count >= face.num_glyphs() { break; } // we know we won't find any more glyphs in this font

nyurik · 2026-01-22T20:32:39Z

P.S. Oh, silly me -- we no longer need to even do that -- I added character iteration to the freetype lib 2 years ago -- PistonDevelopers/freetype-rs#257

so should just be able to call for ch in face.chars() { ... }

Auspicus · 2026-01-22T20:40:11Z

So maybe no masks? I was wondering how expensive it would be to iterate vs difference the set in the first place

nyurik · 2026-01-22T20:58:22Z

freetype-rs doesn't support iteration from arbitrary location (PRs welcome i guess), so i think it might still be useful to have a bitmask for perf reasons, esp since it is much simpler now. Since you ran all the memory usage tests, could you try this implementation?

fn get_available_codepoints(face: &mut Face) -> Option<GetGlyphInfo> {
    let mut codepoints = BitSet::new();
    let mut spans = Vec::new();
    let mut first: Option<usize> = None;
    let mut last = 0;

    for (cp, _) in face.chars() {
        codepoints.insert(cp);
        if let Some(start) = first {
            if cp != last + 1 {
                spans.push((start, last));
                first = Some(cp);
            }
        } else {
            first = Some(cp);
        }
        last = cp;
    }

    if let Some(first) = first {
        spans.push((first, last));
        let start = spans[0].0;
        let end = spans[spans.len() - 1].1;
        Some((codepoints, usize::try_from(face.num_glyphs()).unwrap(), spans, start, end))
    } else {
        None
    }
}

nyurik · 2026-01-22T21:00:19Z

(didn't test, but had to update the above a few times)

nyurik · 2026-01-22T21:06:53Z

interestingly enough, my original implementation seemed to have had a bug - if a font file had a codepoint at the MAX_UNICODE_CP-1 position (last allowed), the last span would not be included in the result.

martin-core/src/resources/fonts/mod.rs

nyurik · 2026-01-22T21:55:33Z

(note to self - github editor is NOT that great for ... editing code)

@Auspicus have you had a chance to memory profile it?

Auspicus · 2026-01-22T20:30:49Z

martin-core/src/resources/fonts/mod.rs

 /// Maximum Unicode codepoint range ID.
 ///
-/// Each range is 256 codepoints long, so the highest range ID is 0xFFFF / 256 = 255.
+/// Each range is 256 codepoints long, so the highest range ID is 0x10FFFF / 256 = 255.


Should fix this number which is now 0x10FFFF/256 = 4352 (rounded up)

Auspicus · 2026-01-22T21:51:43Z

martin-core/src/resources/fonts/mod.rs

-        let mut bs = BitSet::with_capacity(CP_RANGE_SIZE);
+        let mut set = BTreeSet::new();
        for v in 0..=MAX_UNICODE_CP {
-            bs.insert(v);


The issue is less with each font source since it's only a single BitSet of 34816 u32 (139KB). The issue is with the pre-computed masks. Each range (of which there are 4352) carries a masking BitSet of up to 0x11_0000 (34816 u32 [4 bytes] in the backing BitVec). Assuming every range is this size (inaccurately) we get 600MB (34816*4*4352 bytes). More accurately we get masking BitSets with sizes growing from 8-34816 u32 and more likely around 300MB total. Prior to adjusting the max codepoint size we had smaller BitSets (8-2048 u32) and less of them 256 (0xFFFF/256). Resulting in at most ~2MB (2048*4*256 bytes) for masks but likely less due to not all BitSets being 2048 size.

Auspicus · 2026-01-22T21:56:19Z

martin-core/src/resources/fonts/mod.rs

-        let mut bs = BitSet::with_capacity(CP_RANGE_SIZE);
+        let mut set = BTreeSet::new();
        for v in 0..=MAX_UNICODE_CP {
-            bs.insert(v);


See images attached with debugger running on main branch in the default for FontSources:

Auspicus · 2026-01-22T22:02:16Z

I am just starting my workday now but I'll review again after work today. I will also post the dhat file in the viewer as it really highlights the BitSet allocations.

martin-core/src/resources/fonts/mod.rs

…an BTreeSet at full capacity), lower lambda timeout and memory usage back to explicit default

Auspicus · 2026-01-23T22:56:29Z

martin-core/src/resources/fonts/mod.rs

 /// Returns `None` if the font contains no usable glyphs.
 fn get_available_codepoints(face: &mut Face) -> Option<GetGlyphInfo> {
-    let mut codepoints = BitSet::with_capacity(MAX_UNICODE_CP);
+    let mut codepoints = BitSet::new();


Without explicit capacity there will be some required resizing but it allows for this BitSet to be as small as 8 bytes (256 bits) when the font only provides codepoints within the range 0-255. If the font provides even a single codepoint at the end of the range this will need to be resized to 139KB. So we're trading off a small amount of start up time here for a lower memory footprint per font.

.github/files/lambda.yaml

Auspicus · 2026-01-23T23:19:22Z

Before:

git checkout main && \
cargo b --release --no-default-features --features fonts && \
valgrind --tool=dhat ./target/release/martin -C 0 -W 16 --font ./tests/fixtures/fonts

Total:     5,770,215,231 bytes in 90,450 blocks
At t-gmax: 4,857,098,381 bytes in 71,914 blocks
At t-end:  4,857,098,381 bytes in 71,914 blocks
Reads:     5,241,483,644 bytes
Writes:    5,475,459,168 bytes

After:

git checkout fix/2518/fonts-memory-usage && \
cargo b --release --no-default-features --features fonts && \
valgrind --tool=dhat ./target/release/martin -C 0 -W 16 --font ./tests/fixtures/fonts

Total:     8,947,164 bytes in 16,304 blocks
At t-gmax: 1,674,288 bytes in 4,653 blocks
At t-end:  96,933 bytes in 191 blocks
Reads:     4,826,227 bytes
Writes:    8,527,250 bytes

… worker

Auspicus · 2026-01-23T23:34:41Z

I'm happy with where this is at now, let me know if you think we need to change anything @CommanderStorm / @nyurik

CommanderStorm

Thank you 👍

nyurik · 2026-01-24T04:21:39Z

oops, we should have added to the subject / changelog that this also fixes incorrect font behavior - now the font glyph count will be reported correctly

## 🤖 New release * `martin-tile-utils`: 0.6.8 -> 0.6.9 (✓ API compatible changes) * `mbtiles`: 0.15.0 -> 0.15.1 (✓ API compatible changes) * `martin-core`: 0.2.5 -> 0.2.6 (✓ API compatible changes) * `martin`: 1.2.0 -> 1.3.0 <details><summary>Changelog</summary> ## `mbtiles` <blockquote> ## [0.15.1](mbtiles-v0.15.0...mbtiles-v0.15.1) - 2026-01-27 ### Added - add MLT decoding support ([#2512](#2512)) - migrate our log library to tracing ([#2494](#2494)) ### Other - unignore `diff_and_patch_bsdiff` test with unique SQLite database names ([#2480](#2480)) - *(mbtiles)* remove the prefix-ism around how files are named for binary diff copy and simpify their naming ([#2478](#2478)) - *(mbtiles)* add assertion messages what we are checking to the copy tests ([#2477](#2477)) </blockquote> ## `martin-core` <blockquote> ## [0.2.6](martin-core-v0.2.5...martin-core-v0.2.6) - 2026-01-27 ### Added - migrate our log library to tracing ([#2494](#2494)) - *(martin-core)* Allow glyph ranges more than 0xFFFF ([#2438](#2438)) ### Fixed - *(perf)* Remove FontSources.masks as they were consuming large amounts of memory even when no font sources were set ([#2519](#2519)) - improve error message if no SVG sprite files are present ([#2516](#2516)) ### Other - move our imports to tracing ([#2500](#2500)) - *(deps)* shear our dependencys ([#2497](#2497)) </blockquote> ## `martin` <blockquote> ## [1.3.0](martin-v1.2.0...martin-v1.3.0) - 2026-01-27 ### Added - *(srv)* Add `route_prefix` configuration for native subpath support without the need of a reverse proxy override ([#2523](#2523)) - add MLT decoding support ([#2512](#2512)) - migrate our log library to tracing ([#2494](#2494)) - improve martin-cp progress output time estimate ([#2491](#2491)) - *(pg)* include ID column info for tables ([#2485](#2485)) - *(pg)* support PostgreSQL materialized views ([#2279](#2279)) - *(martin-core)* Allow glyph ranges more than 0xFFFF ([#2438](#2438)) ### Fixed - *(ui)* clipboard copy for http://0.0.0.0:3000 and unify implementations ([#2487](#2487)) - the `Copy` icon displaying nicely, next to the text and with enough padding ot all items ([#2483](#2483)) - update copy text to include icon for better visibility ([#2482](#2482)) - *(perf)* Remove FontSources.masks as they were consuming large amounts of memory even when no font sources were set ([#2519](#2519)) - improve error message if no SVG sprite files are present ([#2516](#2516)) ### Other - move our request logging to tracing ([#2508](#2508)) - move our imports to tracing ([#2500](#2500)) - *(deps)* shear our dependencys ([#2497](#2497)) - *(ui)* adjust margin for copy icon in URL component ([#2489](#2489)) - unignore `diff_and_patch_bsdiff` test with unique SQLite database names ([#2480](#2480)) - *(mbtiles)* remove the prefix-ism around how files are named for binary diff copy and simpify their naming ([#2478](#2478)) - *(mbtiles)* add assertion messages what we are checking to the copy tests ([#2477](#2477)) </blockquote> </details> --- This PR was generated with [release-plz](https://github.com/release-plz/release-plz/). --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Add Arc<T> around FontSources.masks to allow for copy-free cloning ac…

65123a6

…ross workers

Auspicus changed the title ~~Add Arc<T> around FontSources.masks to allow for copy-free cloning across workers~~ fix(perf): Add Arc<T> around FontSources.masks to allow for copy-free cloning across workers Jan 22, 2026

Auspicus mentioned this pull request Jan 22, 2026

feat(unstable-cog): Change tile path semantics for COG sources to match other sources, expose COG bounds, center and tileSize in TileJSON #2510

Merged

Auspicus changed the title ~~fix(perf): Add Arc<T> around FontSources.masks to allow for copy-free cloning across workers~~ fix(perf): Add Arc<T> around FontSources.masks to allow for duplicate-free cloning across workers Jan 22, 2026

Auspicus and others added 4 commits January 23, 2026 00:14

Swap BitSet for BTreeSet

b281ec1

chore(fmt): apply pre-commit formatting fixes

62ee29a

Cleanup

0945a7c

Rename bs to set

57c66d5

Auspicus changed the title ~~fix(perf): Add Arc<T> around FontSources.masks to allow for duplicate-free cloning across workers~~ fix(perf): Add Arc<T> around FontSources.masks to allow for duplicate-free cloning across workers, replace BitSet with BTreeSet Jan 22, 2026

Auspicus marked this pull request as draft January 22, 2026 13:45

Auspicus and others added 2 commits January 23, 2026 01:02

Fix difference order, fix comment

e6c00ac

chore(fmt): apply pre-commit formatting fixes

59ee365

CommanderStorm reviewed Jan 22, 2026

View reviewed changes

Auspicus commented Jan 22, 2026

View reviewed changes

Auspicus added 2 commits January 23, 2026 08:26

Add @nyurik get_available_codepoints replacement

40f583b

Update font range calculation in comment

4ce28ad

nyurik reviewed Jan 22, 2026

View reviewed changes

martin-core/src/resources/fonts/mod.rs Outdated Show resolved Hide resolved

Refactor span handling in font resource module

e40ab2a

Auspicus marked this pull request as ready for review January 22, 2026 21:52

oops

546d4a8

Auspicus commented Jan 22, 2026

View reviewed changes

move font files and fix count

a9e72e4

Auspicus commented Jan 23, 2026

View reviewed changes

martin-core/src/resources/fonts/mod.rs Outdated Show resolved Hide resolved

Auspicus and others added 2 commits January 23, 2026 13:52

Add further suggestions to skip global mask creation entirely

a402e31

chore(fmt): apply pre-commit formatting fixes

c59ff5b

Auspicus commented Jan 23, 2026

View reviewed changes

martin-core/src/resources/fonts/mod.rs Show resolved Hide resolved

CommanderStorm added the bless label Jan 23, 2026

github-actions bot removed the bless label Jan 23, 2026

autofix-ci bot and others added 2 commits January 23, 2026 07:49

chore: update blessed test snapshots across all components

b2768e3

Add back bit-set to use for codepoint presence (lower memory usage th…

1b7e250

…an BTreeSet at full capacity), lower lambda timeout and memory usage back to explicit default

Auspicus commented Jan 23, 2026

View reviewed changes

.github/files/lambda.yaml Show resolved Hide resolved

Auspicus changed the title ~~fix(perf): Add Arc<T> around FontSources.masks to allow for duplicate-free cloning across workers, replace BitSet with BTreeSet~~ fix(perf): Remove FontSources.masks Jan 23, 2026

Auspicus changed the title ~~fix(perf): Remove FontSources.masks~~ fix(perf): Remove FontSources.masks as they were consuming large amounts of memory even when no font sources were set Jan 23, 2026

Wrap the potentially large BitSet in an Arc to avoid copying for each…

95f6c9a

… worker

CommanderStorm approved these changes Jan 24, 2026

View reviewed changes

CommanderStorm merged commit b6b61aa into maplibre:main Jan 24, 2026
40 checks passed

CommanderStorm mentioned this pull request Jan 21, 2026

chore: release #2481

Merged

Auspicus deleted the fix/2518/fonts-memory-usage branch March 18, 2026 20:51

-        let needed: &BTreeSet<usize> = &self.masks[(start as usize) / CP_RANGE_SIZE];
+        // Create a BitSet for the requested range (start..=end)
+        let mut needed = BitSet::with_capacity(end as usize);
+        for cp in start..=end {
+            needed.insert(cp as usize);
+        }

Uh oh!

Conversation

Auspicus commented Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

CommanderStorm Jan 22, 2026

Choose a reason for hiding this comment

Uh oh!

nyurik Jan 22, 2026

Choose a reason for hiding this comment

Uh oh!

Auspicus Jan 22, 2026

Choose a reason for hiding this comment

Uh oh!

CommanderStorm Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

CommanderStorm Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Auspicus Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Auspicus Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Auspicus Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

nyurik Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Auspicus Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Auspicus Jan 22, 2026

Choose a reason for hiding this comment

Uh oh!

nyurik commented Jan 22, 2026

Uh oh!

nyurik commented Jan 22, 2026

Uh oh!

Auspicus commented Jan 22, 2026

Uh oh!

nyurik commented Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nyurik commented Jan 22, 2026

Uh oh!

nyurik commented Jan 22, 2026

Uh oh!

Uh oh!

nyurik commented Jan 22, 2026

Uh oh!

Auspicus Jan 22, 2026

Choose a reason for hiding this comment

Uh oh!

Auspicus Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Auspicus Jan 22, 2026

Choose a reason for hiding this comment

Uh oh!

Auspicus commented Jan 22, 2026

Uh oh!

Uh oh!

Uh oh!

Auspicus Jan 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Auspicus commented Jan 22, 2026 •

edited

Loading

CommanderStorm Jan 22, 2026 •

edited

Loading

CommanderStorm Jan 22, 2026 •

edited

Loading

Auspicus Jan 22, 2026 •

edited

Loading

Auspicus Jan 22, 2026 •

edited

Loading

Auspicus Jan 22, 2026 •

edited

Loading

nyurik Jan 22, 2026 •

edited

Loading

Auspicus Jan 22, 2026 •

edited

Loading

nyurik commented Jan 22, 2026 •

edited

Loading

Auspicus Jan 22, 2026 •

edited

Loading

Auspicus Jan 23, 2026 •

edited

Loading

Auspicus commented Jan 23, 2026 •

edited

Loading