base64 decoder could be 2x faster when decoding wrapped base64 #12114

jorangreef · 2017-03-29T08:50:05Z

Node's base64 decoder currently uses a fast decoder and a slow decoder.

The fast decoder decodes 32-bit words at a time. If it sees a line-break or whitespace or garbage, then it switches permanently to the slow decoder which decodes a byte at a time, with a conditional branch per byte, instead of per 32-bit word.

I did some rough benchmarking to compare decoding a 4mb random buffer encoded as base64, and decoding the same base64 but with CRLFs added every 76 chars as per MIME base64:

Decode Fast: 8ms
Decode Fast: 9ms
Decode Fast: 9ms
Decode Slow (wrapped every 76 chars): 30ms
Decode Slow (wrapped every 76 chars): 20ms
Decode Slow (wrapped every 76 chars): 22ms

As far as I can see, there's no reason to switch permanently to the slow decoder. If the fast decoder detects that the 32-bit word contains an invalid character, it could just decode the next few bytes byte-by-byte, and then switch back to fast mode as soon as it has consumed 4 valid base64 characters and outputted 3 bytes. This could all be a sub-branch after the 32-bit word check so it should not affect the performance of the fast decoder in any way.

For base64 decoding MIME data, this should nearly double the throughput since the slow case is triggered only every 76 bytes.

The text was updated successfully, but these errors were encountered:

sathvikl · 2017-03-31T01:27:18Z

can you please post the js code sample to test this.. I could probably take a look at it.

jorangreef · 2017-03-31T07:51:40Z

var crypto = require('crypto');
var data = crypto.randomBytes(32*1024*1024);
var base64 = '';
var now = 0;

var times = 3;
while (times--) {
  now = Date.now();
  base64 = data.toString('base64');
  console.log('Encode: ' + (Date.now() - now) + 'ms');
}

var times = 3;
while (times--) {
  now = Date.now();
  Buffer.from(base64, 'base64');
  console.log('Decode Fast: ' + (Date.now() - now) + 'ms');
}

function wrap(string) {
  var lines = [];
  var index = 0;
  var length = string.length;
  while (index < length) {
    lines.push(string.slice(index, index += 76));
  }
  return lines.join('\r\n');
}

base64 = wrap(base64);

var times = 3;
while (times--) {
  now = Date.now();
  Buffer.from(base64, 'base64');
  console.log('Decode Slow: ' + (Date.now() - now) + 'ms');
}

aqrln · 2017-03-31T14:29:07Z

@jorangreef tried to do that, but the performance increase is only 10–11% for me: #12146.

The fast base64 decoder used to switch to the slow one permanently when it saw a whitespace or other garbage character. Since the most common situation such characters may be encountered in is line-wrapped base64 data, a more profitable strategy is to decode a single 24-bit group with the slow decoder and then continue running the fast algorithm. Refs: nodejs#12114

jorangreef · 2017-04-03T11:28:41Z

@aqrln I just finished https://github.com/ronomon/base64 which is an alternative C++ buffer-to-buffer encoder/decoder (you can decode a buffer containing base64 without allocating an interim string). If you take a look at the C++ binding source, it handles the slow case without switching permanently to slow mode. I also included a simplistic fast-slow.js benchmark in the source for this exact issue:


 Decoding Base64:

    Node: Decode: 42ms
    Node: Decode: 41ms
    Node: Decode: 42ms

  Base64: Decode: 45ms
  Base64: Decode: 44ms
  Base64: Decode: 46ms

 Decoding Base64 (wrapped every 76 characters):

    Node: Decode: 117ms
    Node: Decode: 119ms
    Node: Decode: 119ms

  Base64: Decode: 53ms
  Base64: Decode: 54ms
  Base64: Decode: 54ms

The fast base64 decoder used to switch to the slow one permanently when it saw a whitespace or other garbage character. Since the most common situation such characters may be encountered in is line-wrapped base64 data, a more profitable strategy is to decode a single 24-bit group with the slow decoder and then continue running the fast algorithm. PR-URL: #12146 Ref: #12114 Reviewed-By: Anna Henningsen <[email protected]> Reviewed-By: Trevor Norris <[email protected]> Reviewed-By: James M Snell <[email protected]>

jorangreef · 2017-04-07T09:17:01Z

Thanks @aqrln, just wanted to check that you were able to increase the performance by close to 100% in the end?

aqrln · 2017-04-07T13:40:15Z

@jorangreef nope, I haven't worked on this since that PR. If you'd like to optimize it more to achieve the performance of your userland library, it would be really great. If not, then I could maybe take a look at it later.

jorangreef · 2017-04-07T13:48:27Z

Thanks @aqrln, I hope you can run with it and get it all the way there - your c++ is probably better than mine. You are welcome to copy the decoder verbatim from my source if not.

The fast base64 decoder used to switch to the slow one permanently when it saw a whitespace or other garbage character. Since the most common situation such characters may be encountered in is line-wrapped base64 data, a more profitable strategy is to decode a single 24-bit group with the slow decoder and then continue running the fast algorithm. PR-URL: nodejs#12146 Ref: nodejs#12114 Reviewed-By: Anna Henningsen <[email protected]> Reviewed-By: Trevor Norris <[email protected]> Reviewed-By: James M Snell <[email protected]>

jorangreef · 2017-07-25T08:54:34Z

Here's the latest on this with node v8.2.1.

Generated by https://github.com/ronomon/base64/blob/master/fast-slow.js:


 Decoding Base64:

    Node: Decode: 47ms
    Node: Decode: 49ms
    Node: Decode: 48ms

  Base64: Decode: 46ms
  Base64: Decode: 46ms
  Base64: Decode: 47ms

 Decoding Base64 (wrapped every 76 characters):

    Node: Decode: 110ms
    Node: Decode: 111ms
    Node: Decode: 112ms

  Base64: Decode: 56ms
  Base64: Decode: 57ms
  Base64: Decode: 53ms

The Base64 reference is at https://github.com/ronomon/base64.

bnoordhuis · 2018-05-28T19:11:02Z

Seeing there's been no real movement on this in over a year, I'll go ahead and close this out. Pull requests still welcome, of course.

mscdex added buffer Issues and PRs related to the buffer subsystem. c++ Issues and PRs that require attention from people who are familiar with C++. labels Mar 29, 2017

jorangreef changed the title ~~base64 decoder could be 2x faster when base64 data is wrapped~~ base64 decoder could be 2x faster when decoding wrapped base64 Mar 29, 2017

addaleax added the performance Issues and PRs related to the performance of Node.js. label Mar 29, 2017

aqrln mentioned this issue Mar 31, 2017

buffer: optimize decoding wrapped base64 data #12146

Closed

3 tasks

aqrln self-assigned this Apr 7, 2017

bnoordhuis closed this as completed May 28, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

base64 decoder could be 2x faster when decoding wrapped base64 #12114

base64 decoder could be 2x faster when decoding wrapped base64 #12114

jorangreef commented Mar 29, 2017

sathvikl commented Mar 31, 2017

jorangreef commented Mar 31, 2017 •

edited

Loading

aqrln commented Mar 31, 2017

jorangreef commented Apr 3, 2017

jorangreef commented Apr 7, 2017

aqrln commented Apr 7, 2017

jorangreef commented Apr 7, 2017

jorangreef commented Jul 25, 2017 •

edited

Loading

bnoordhuis commented May 28, 2018

base64 decoder could be 2x faster when decoding wrapped base64 #12114

base64 decoder could be 2x faster when decoding wrapped base64 #12114

Comments

jorangreef commented Mar 29, 2017

sathvikl commented Mar 31, 2017

jorangreef commented Mar 31, 2017 • edited Loading

aqrln commented Mar 31, 2017

jorangreef commented Apr 3, 2017

jorangreef commented Apr 7, 2017

aqrln commented Apr 7, 2017

jorangreef commented Apr 7, 2017

jorangreef commented Jul 25, 2017 • edited Loading

bnoordhuis commented May 28, 2018

jorangreef commented Mar 31, 2017 •

edited

Loading

jorangreef commented Jul 25, 2017 •

edited

Loading