Improvements to Double/Float conversion #121

zapov · 2019-03-30T21:16:14Z

Grisu3 works most of the time but it could be improved/replaced with a different faster algorithm.
Parsing doubles does not match Java algorithm in all cases (unless it's configured with Exact precision option; High precision probably gives the same numbers as Java - but not guaranteed).
Float uses double conversion which can lead to bit loss.

Look into suggested replacement for some of the problems: #120

plokhotnyuk · 2019-03-31T04:22:27Z

Currently during parsing of float primitives there is a small error ~1ULP. I think it should be documented properly until "exact" parsing option (with precision ~0.5ULP) is not available.

BTW, here is a post about the Rust project which tries to push the performance limits without losing in precision:
https://www.reddit.com/r/rust/comments/a6j5j1/making_rust_float_parsing_fast_and_correct/

zapov · 2019-03-31T06:33:12Z

I'm actually not aware of float examples which lead to wrong result. Do you have any?
If I knew of them I would probably already look into it, or added a configuration option for floats too.

I saw that article but didn't have time to look into the code ;(

Also, I find it interesting that Java does not behave as expected too: https://www.exploringbinary.com/java-doesnt-print-the-shortest-strings-that-round-trip/

plokhotnyuk · 2019-03-31T19:39:16Z

The rounding error can be easy reproduced when parsing string representation of some double values.

scala> "1.00000017881393432617187499".toFloat
res0: Float = 1.0000001

scala> "1.00000017881393432617187499".toDouble.toFloat
res1: Float = 1.0000002

The detailed explanation is in this comment

zapov · 2019-03-31T20:16:09Z

Sure, but that will not trigger rounding error in DSL-JSON for floats.
It's not that number is parsed into exact double equivalent first, rather after significant number of digits, the rest will be ignored. Thus this rounding error on double is not hit.

plokhotnyuk · 2019-04-01T06:27:06Z

The following code can print lot of such numbers which are affected by rounding during parsing with DSL-JSON:

val reader = new DslJson[Any](new DslJson.Settings[Any]()).newReader()
(1 to 100000).foreach { _ =>
  val n = ThreadLocalRandom.current().nextLong()
  val x = java.lang.Double.longBitsToDouble(n & ~0xFFFFFFFL)
  if (java.lang.Double.isFinite(x)) checkAndPrint(x.toString)
}

def checkAndPrint(input: String): Unit = {
  val bs = ("[" + input + "]").getBytes
  reader.process(bs, bs.length)
  reader.read()
  val actualOutput = NumberConverter.FLOAT_ARRAY_READER.read(reader)(0)
  val expectedOutput = input.toFloat
  if (actualOutput != expectedOutput) {
    println(s"input = $input, expectedOutput =$expectedOutput, actualOutput = $actualOutput")
  }
}

Below are samples from its output:

input = -269.91502380371094, expectedOutput =-269.91504, actualOutput = -269.915
input = -0.46754591166973114, expectedOutput =-0.4675459, actualOutput = -0.46754593
input = -7.665778767318443E-8, expectedOutput =-7.665779E-8, actualOutput = -7.6657784E-8

plokhotnyuk · 2020-06-07T12:12:45Z

@zapov you can peek solutions for parsing and serialization of floats and decimals immediately from the jsoniter-scala-coreJVM sub-project:

Feel free to translate all them into Java from mine or original code of authors of algorithms, as long as you adhere to the copyright notices for the writing and the code in authors' repositories and/or appropriate attribution is mentioned.

Below are screenshots from results of benchmarks that compares those approaches used in jsoniter-scala with different JSON parsers for Scala on different JVMs. Throughput (ops/sec) of parsing for serialization of arrays with 128 floats or doubles is measured here:

zapov · 2020-06-08T13:33:01Z

It would be nice to improve this, I "just" need to find some time to work on it :D

zapov added enhancement help wanted labels Mar 30, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improvements to Double/Float conversion #121

Improvements to Double/Float conversion #121

zapov commented Mar 30, 2019

plokhotnyuk commented Mar 31, 2019 •

edited

Loading

zapov commented Mar 31, 2019

plokhotnyuk commented Mar 31, 2019 •

edited

Loading

zapov commented Mar 31, 2019

plokhotnyuk commented Apr 1, 2019 •

edited

Loading

plokhotnyuk commented Jun 7, 2020 •

edited

Loading

zapov commented Jun 8, 2020

Improvements to Double/Float conversion #121

Improvements to Double/Float conversion #121

Comments

zapov commented Mar 30, 2019

plokhotnyuk commented Mar 31, 2019 • edited Loading

zapov commented Mar 31, 2019

plokhotnyuk commented Mar 31, 2019 • edited Loading

zapov commented Mar 31, 2019

plokhotnyuk commented Apr 1, 2019 • edited Loading

plokhotnyuk commented Jun 7, 2020 • edited Loading

zapov commented Jun 8, 2020

plokhotnyuk commented Mar 31, 2019 •

edited

Loading

plokhotnyuk commented Mar 31, 2019 •

edited

Loading

plokhotnyuk commented Apr 1, 2019 •

edited

Loading

plokhotnyuk commented Jun 7, 2020 •

edited

Loading