Introduce DecodeFrom and EncodeReader #3

EladGabay · 2021-09-30T18:06:49Z

The buckets byte slice is the biggest part of the the memory used by the filter,
and might be several of GBs.

Common usage of a filter is in an environment with limited RAM size
based on the filter size, load it to memory on startup and dump it to
disk on teardown.
Currently the Encode and Decode methods duplicates the byte slice,
which makes the memory usage at the loading and dumping time to be (at
least) twice the filter size.

This commit introduces a new method for dumping the filter using a reader
of the internal byte slice, and a method for loading the filter based on
already fetched encoded bytes (from disk, network) and use them
internaly instead of making a copy.

formatting.

linvon · 2021-10-06T08:28:19Z

Nice idea by the way

EladGabay · 2021-10-07T08:54:56Z

Nice idea by the way

Would you like to merge it :)?

linvon · 2021-10-07T10:35:03Z

Nice idea by the way

Would you like to merge it :)?

I think metaDataSize should be remove out of SizeInBytes, can you fix it?

EladGabay · 2021-10-07T18:47:35Z

SizeInBytes should reflect the size of encoded filter (metadata + data), this way the user can prepare the required memory for encoding\decoding, and this is the actual size in bytes used by the filter.
In addition, it's necessary to be the exact number of bytes for creating the bytes slice in Encode before ReadFull, otherwise we'll need to do ReadAll and pay with re-allocations and copies.

Added a new commit that makes it aligned in the filter object.

linvon · 2021-10-08T03:07:36Z

i'd like to merge the Reader part, we can discuss the Size part in the future, can you split this into two MR?

EladGabay · 2021-10-09T14:11:36Z

I suggest to keep the SizeInBytes without the metadata part and introduce EncodedSizeInBytes method that returns the size including the metadata. Sounds good?

linvon · 2021-10-09T14:36:31Z

EncodedSizeInBytes

this is okay too

The buckets byte slice is the biggest part of the the memory used by the filter, and might be several of GBs. Common usage of a filter is in an environment with limited RAM size based on the filter size, load it to memory on startup and dump it to disk on teardown. Currently the Encode and Decode methods duplicates the byte slice, which makes the memory usage at the loading and dumping time to be (at least) twice the filter size. This commit introduces a new method for dumping the filter using a reader of the internal byte slice, and a method for loading the filter based on already fetched encoded bytes (from disk, network) and use them internaly instead of making a copy. + formatting.

EladGabay · 2021-10-09T17:49:30Z

Now reader returns also the size, so we can prepare the memory in advance.

EladGabay · 2021-10-10T08:41:17Z

Would you like to create a new tag?

linvon · 2021-10-10T08:47:09Z

Would you like to create a new tag?

sure

EladGabay force-pushed the main branch from c7cb28a to ca22ebe Compare September 30, 2021 18:34

EladGabay force-pushed the main branch from dec0e8e to c53ec10 Compare October 7, 2021 18:50

EladGabay force-pushed the main branch from c53ec10 to a2a1889 Compare October 9, 2021 17:37

EladGabay force-pushed the main branch from a2a1889 to 65cc15d Compare October 9, 2021 17:49

linvon merged commit 92f5275 into linvon:main Oct 9, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce DecodeFrom and EncodeReader #3

Introduce DecodeFrom and EncodeReader #3

EladGabay commented Sep 30, 2021

linvon commented Oct 6, 2021

EladGabay commented Oct 7, 2021

linvon commented Oct 7, 2021

EladGabay commented Oct 7, 2021 •

edited

Loading

linvon commented Oct 8, 2021

EladGabay commented Oct 9, 2021

linvon commented Oct 9, 2021 •

edited

Loading

EladGabay commented Oct 9, 2021

EladGabay commented Oct 10, 2021

linvon commented Oct 10, 2021

Introduce DecodeFrom and EncodeReader #3

Introduce DecodeFrom and EncodeReader #3

Conversation

EladGabay commented Sep 30, 2021

linvon commented Oct 6, 2021

EladGabay commented Oct 7, 2021

linvon commented Oct 7, 2021

EladGabay commented Oct 7, 2021 • edited Loading

linvon commented Oct 8, 2021

EladGabay commented Oct 9, 2021

linvon commented Oct 9, 2021 • edited Loading

EladGabay commented Oct 9, 2021

EladGabay commented Oct 10, 2021

linvon commented Oct 10, 2021

EladGabay commented Oct 7, 2021 •

edited

Loading

linvon commented Oct 9, 2021 •

edited

Loading