lfs_bd_cmp() compares more bytes at one time #395

gmpy · 2020-03-13T07:39:08Z

It's very slowly to compare one byte at one time. Here are the
performance I get from 128M spinand with NFTL by sequential writing.

| file size | buffer size | write speed |
| 10 MB | 0 B | 3206.01 KB/s |
| 10 MB | 1 B | 2434.04 KB/s |
| 10 MB | 2 B | 2685.78 KB/s |
| 10 MB | 4 B | 2857.94 KB/s |
| 10 MB | 8 B | 3060.68 KB/s |
| 10 MB | 16 B | 3155.30 KB/s |
| 10 MB | 64 B | 3193.68 KB/s |
| 10 MB | 128 B | 3230.62 KB/s |
| 10 MB | 256 B | 3153.03 KB/s |

| 70 MB | 0 B | 2258.87 KB/s |
| 70 MB | 1 B | 1827.83 KB/s |
| 70 MB | 2 B | 1962.29 KB/s |
| 70 MB | 4 B | 2074.01 KB/s |
| 70 MB | 8 B | 2147.03 KB/s |
| 70 MB | 64 B | 2179.92 KB/s |
| 70 MB | 256 B | 2179.96 KB/s |

The 0 Byte size means no validation and the 1 Byte size is how
littlefs do before. Based on the above table and to save memory,
comparing 8 bytes at one time is more wonderful.

Signed-off-by: WeiXiong Liao [email protected]

It's very slowly to compare one byte at one time. Here are the performance I get from 128M spinand with NFTL by sequential writing. | file size | buffer size | write speed | | 10 MB | 0 B | 3206.01 KB/s | | 10 MB | 1 B | 2434.04 KB/s | | 10 MB | 2 B | 2685.78 KB/s | | 10 MB | 4 B | 2857.94 KB/s | | 10 MB | 8 B | 3060.68 KB/s | | 10 MB | 16 B | 3155.30 KB/s | | 10 MB | 64 B | 3193.68 KB/s | | 10 MB | 128 B | 3230.62 KB/s | | 10 MB | 256 B | 3153.03 KB/s | | 70 MB | 0 B | 2258.87 KB/s | | 70 MB | 1 B | 1827.83 KB/s | | 70 MB | 2 B | 1962.29 KB/s | | 70 MB | 4 B | 2074.01 KB/s | | 70 MB | 8 B | 2147.03 KB/s | | 70 MB | 64 B | 2179.92 KB/s | | 70 MB | 256 B | 2179.96 KB/s | The 0 Byte size means no validation and the 1 Byte size is how littlefs do before. Based on the above table and to save memory, comparing 8 bytes at one time is more wonderful. Signed-off-by: WeiXiong Liao <[email protected]>

gmpy · 2020-03-13T07:49:00Z

#381 @geky @pjsg

Following the codes, I get that lfs will always hit cache when do validate and the root cause of
slow speed is comparative efficiency. Here is my new patch to improve write performance.

pjsg · 2020-03-13T12:43:40Z

This looks reasonable to me....

thrasher8390 · 2020-03-24T17:28:42Z

lfs.c

-        uint8_t dat;
-        int err = lfs_bd_read(lfs,
+    for (lfs_off_t i = 0; i < size; i += diff) {
+        uint8_t dat[8];


Why was 8 chosen.
Remember that this is being stored on the stack.
Any thoughts on making this configurable or reasons why it shouldn't be configurable?

Too many configurations can be confusing.
I think 8 bytes is the best balance between performance and used space.

geky · 2020-11-16T23:23:31Z

Hi @gmpy, this is a great patch and great analysis!

Sorry this PR has been hanging for so long. I wasn't able to make a release for a while, but will bring this in the next minor release.

thrasher8390 approved these changes Mar 24, 2020

View reviewed changes

geky added the performance label Apr 9, 2020

geky added next minor v2.3 labels Nov 16, 2020

geky added this to the v2.3 milestone Nov 16, 2020

geky approved these changes Nov 16, 2020

View reviewed changes

geky changed the base branch from master to devel December 4, 2020 04:34

geky merged commit 6627206 into littlefs-project:devel Dec 4, 2020

This was referenced Dec 4, 2020

Minor release: v2.3 #495

Merged

Forward-looking erase-state CRCs #497

Merged

gtaska mentioned this pull request Feb 28, 2021

Poor write performance (testing with RAM-backed implementation) #535

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lfs_bd_cmp() compares more bytes at one time #395

lfs_bd_cmp() compares more bytes at one time #395

gmpy commented Mar 13, 2020

gmpy commented Mar 13, 2020

pjsg commented Mar 13, 2020

thrasher8390 Mar 24, 2020

gmpy Mar 26, 2020

geky commented Nov 16, 2020

lfs_bd_cmp() compares more bytes at one time #395

lfs_bd_cmp() compares more bytes at one time #395

Conversation

gmpy commented Mar 13, 2020

gmpy commented Mar 13, 2020

pjsg commented Mar 13, 2020

thrasher8390 Mar 24, 2020

Choose a reason for hiding this comment

gmpy Mar 26, 2020

Choose a reason for hiding this comment

geky commented Nov 16, 2020