Allow use of faster floating-point number parsing (Schubfach) with `StreamReadFeature.USE_FAST_DOUBLE_PARSER` #577

cowtowncoder · 2019-11-06T00:03:07Z

Jsoniter project (https://github.com/plokhotnyuk/jsoniter-scala) has many impressive performance optimizations; linked f.ex from here:

https://www.reddit.com/r/java/comments/darehu/jackson_release_210/f1ysf1e/

Of ones included, number-parsing would be relevant for this repo.

EDIT: also see (from the comment below)

"Unrelated to jsoniter but this recent port of Lemire's Double parser:

https://github.com/wrandelshofer/FastDoubleParser

and the original paper https://arxiv.org/abs/2101.11408 also relevant"

cowtowncoder · 2019-11-14T23:28:25Z

And specifically, for BigDecimal case:

https://github.com/plokhotnyuk/jsoniter-scala/blob/b51f90b87923305b6fa177a4da909ff43da3c7b7/jsoniter-scala-core/src/main/scala/com/github/plokhotnyuk/jsoniter_scala/core/JsonReader.scala#L1477-L1591

anneloreegger · 2020-10-10T00:26:35Z

I'd like to work on this issue, is it still available?

cowtowncoder · 2020-10-10T03:53:44Z

@anneloreegger Yes, this is available! I'll assign it to you (no obligation to work, just a marker so others know you are considering it).

anneloreegger · 2020-10-10T10:39:12Z

Just to be sure I understand the issue correctly. Shall I especially improve the parsingMethods inside the NumberInput-class? (especially parseBigDecimal)
And should this be done on 2.12 or master branch?
Thanks for the answers :)

cowtowncoder · 2020-10-11T21:23:24Z

@anneloreegger Yes, I think NumberInput has stubs that would allow for this. On 2.12 vs master -- ideally 2.12, if this seems doable on short term with well contained changes. But master if there were need to change API or internal interfaces.
I have not really investigated jsoniter's approach enough to know how involved this gets unfortunately.

cowtowncoder · 2021-02-05T03:04:55Z

NOTE: BigDecimal use case covered by #677 (for 2.13), will edit title

LifeIsStrange · 2021-03-22T17:27:33Z

Unrelated to jsoniter but this recent port of Lemire's Double parser is 5.5X faster than the default double parser!!
(if the microbenchmark generalize)
https://github.com/wrandelshofer/FastDoubleParser

see also the original paper https://arxiv.org/abs/2101.11408

cowtowncoder · 2021-03-27T18:47:11Z

@LifeIsStrange thank you for sharing this! I hope someone might have time to maybe investigate possibility of PR for improvements here.

abc12345678912345 · 2021-09-03T14:07:55Z

@cowtowncoder I'd like to work on this issue, could you assign it to me?

cowtowncoder · 2021-09-07T19:52:45Z

I think @anneloreegger is not working on this, so will change assignee to @abc12345678912345.

If anyone else wants to try to do it instead, please simple add a note here as courtesy; assignments are about intentions but with OSS things come and go.

pjfanning · 2022-03-31T10:32:21Z

@cowtowncoder if FastDoubleParser was to be used in jacakson-core - would you accept it as dependency or would the code need to be inlined in jackson-core?

jackson-core is still java6 bound - is this going to change? FastDoubleParser 0.2.0 is Java 8 and 0.3.0 is Java 11 bound.

cowtowncoder · 2022-03-31T18:15:53Z

I would require inlining; using Maven shade plug-in might be acceptable. But not external dependency.

As to Java 6... tough call. Was hoping to leave it for Jackson 2.x, but may reconsider. Java 6 support is iffy anyway; most likely we have Java 7. And not sure if much usage of latest versions by anyone is pre-Java 8 (although Android may have some odd limits).

But Java 11 would be step too far for Jackson 2.x I think. Jackson 3.0, once I get back to doing it, may well choose different baseline -- Java 14? -- but right now it still only requires Java 8.

pjfanning · 2022-03-31T18:22:46Z

com.fasterxml.jackson.core.io.ContentReferenece uses java.lang.Objects and that class only arrived in Java 7.

My IDE won't compile jackson-core because of this. Only command line maven build seems to work.

IntelliJ seems more concerned with enforcing these props - https://github.com/FasterXML/jackson-core/blob/2.14/pom.xml#L40 - than maven itself is

cowtowncoder · 2022-03-31T18:27:40Z

Ok, that is a good example of accidental inclusion of Java 7 things that default tooling (compilation is already with JDK 8) cannot detect, and that we haven't added anything to guard.
Earlier when JDK 6 was available on CI platforms this was easier to guard against.

So I am not against proposing to raise Jackson 2.14 (for example) baseline officially to Java 8.

pjfanning · 2022-03-31T18:29:38Z

one extra question - FastDoubleParser has a copyright and is generally MIT licensed - how would you handle this if the classes were copied to jackson-core - a special notice file? - could I keep the license headers in the source files too?

example: https://github.com/wrandelshofer/FastDoubleParser/blob/java8/src/main/java/ch/randelshofer/fastdoubleparser/FastDoubleParser.java#L3

cowtowncoder · 2022-04-23T17:03:45Z

@pjfanning Looks good. Just one question: are references to "ints" in this context still double values (like, 1.0 and 25.0), or do they relate to actual Java integer numbers?

pjfanning · 2022-04-29T12:13:10Z

@cowtowncoder this issue is more about parsing and the FastDoubleParser can handle numbers with and without decimal points. When looking at serializing numbers, we can look whether the suggested alternatives like Schubfach output different values from Double.toString. Ultimately, I think any changes will be hidden behind config settings that are documented to warn users about potential diffs.

re-thc · 2022-06-07T08:42:03Z

Note that a similar but likely faster version has already been integrated into the JDK. See this PR for reference. A summary exists in this reddit post.

plokhotnyuk · 2022-06-07T08:56:34Z

@re-thc Also, Giulietti's "The Schubfach way to render doubles" was adopted for JDK 8 here and improved for JDK 11 here.

Below are benchmark results for serialization of doubles by different JSON parsers for Scala using JDK 19.

Before openjdk/jdk#3402

[info] Benchmark                                    (size)   Mode  Cnt       Score      Error  Units
[info] ArrayOfDoublesWriting.avSystemGenCodec          128  thrpt    5   49126.034 ±  752.732  ops/s
[info] ArrayOfDoublesWriting.borer                     128  thrpt    5   48017.213 ±  576.038  ops/s
[info] ArrayOfDoublesWriting.circe                     128  thrpt    5   53851.533 ±  727.258  ops/s
[info] ArrayOfDoublesWriting.circeJsoniter             128  thrpt    5  211738.017 ± 5150.114  ops/s
[info] ArrayOfDoublesWriting.dslJsonScala              128  thrpt    5   92222.619 ±  965.339  ops/s
[info] ArrayOfDoublesWriting.jacksonScala              128  thrpt    5   52508.342 ±  350.423  ops/s
[info] ArrayOfDoublesWriting.jsoniterScala             128  thrpt    5  284303.731 ± 3273.204  ops/s
[info] ArrayOfDoublesWriting.jsoniterScalaPrealloc     128  thrpt    5  298571.032 ± 3925.366  ops/s
[info] ArrayOfDoublesWriting.ninnyJson                 128  thrpt    5  212634.305 ± 5421.051  ops/s
[info] ArrayOfDoublesWriting.playJson                  128  thrpt    5   17558.393 ±   89.862  ops/s
[info] ArrayOfDoublesWriting.playJsonJsoniter          128  thrpt    5   35317.875 ± 1251.812  ops/s
[info] ArrayOfDoublesWriting.smithy4sJson              128  thrpt    5  236212.794 ± 4307.005  ops/s
[info] ArrayOfDoublesWriting.sprayJson                 128  thrpt    5   30913.349 ± 1243.390  ops/s
[info] ArrayOfDoublesWriting.uPickle                   128  thrpt    5   51697.516 ± 1157.040  ops/s
[info] ArrayOfDoublesWriting.weePickle                 128  thrpt    5   53931.179 ±  655.922  ops/s
[info] ArrayOfDoublesWriting.zioJson                   128  thrpt    5  126197.585 ±  863.407  ops/s

After openjdk/jdk#3402

[info] Benchmark                                    (size)   Mode  Cnt       Score       Error  Units
[info] ArrayOfDoublesWriting.avSystemGenCodec          128  thrpt    5  134274.744 ±  1186.555  ops/s
[info] ArrayOfDoublesWriting.borer                     128  thrpt    5  135609.524 ±  3682.101  ops/s
[info] ArrayOfDoublesWriting.circe                     128  thrpt    5  140578.748 ±   599.352  ops/s
[info] ArrayOfDoublesWriting.circeJsoniter             128  thrpt    5  218366.982 ±  2461.686  ops/s
[info] ArrayOfDoublesWriting.dslJsonScala              128  thrpt    5  101039.519 ±  2052.939  ops/s
[info] ArrayOfDoublesWriting.jacksonScala              128  thrpt    5  151821.728 ±  1679.314  ops/s
[info] ArrayOfDoublesWriting.jsoniterScala             128  thrpt    5  287645.795 ±  8382.984  ops/s
[info] ArrayOfDoublesWriting.jsoniterScalaPrealloc     128  thrpt    5  308345.371 ± 11930.089  ops/s
[info] ArrayOfDoublesWriting.ninnyJson                 128  thrpt    5  215322.691 ±  2858.767  ops/s
[info] ArrayOfDoublesWriting.playJson                  128  thrpt    5   23084.133 ±   499.170  ops/s
[info] ArrayOfDoublesWriting.playJsonJsoniter          128  thrpt    5   68504.541 ±  1246.185  ops/s
[info] ArrayOfDoublesWriting.smithy4sJson              128  thrpt    5  239066.484 ±  3021.682  ops/s
[info] ArrayOfDoublesWriting.sprayJson                 128  thrpt    5   55431.626 ±   999.962  ops/s
[info] ArrayOfDoublesWriting.uPickle                   128  thrpt    5  141348.749 ±  1193.024  ops/s
[info] ArrayOfDoublesWriting.weePickle                 128  thrpt    5  149880.063 ±  2034.722  ops/s
[info] ArrayOfDoublesWriting.zioJson                   128  thrpt    5  126290.331 ±  1874.128  ops/s

cowtowncoder · 2022-06-07T15:51:00Z

Thank you for sharing @plokhotnyuk. Good to know JDK is improving as well.

cowtowncoder · 2022-06-22T03:31:07Z

Ok. Thanks to @pjfanning we now have improved floating-point parsing functionality; merges are scattered (alas) through a few merges, purpose of which was to make it easier to merge to 3.0.

PRs include #747 and #766 for anyone interested in details.

cowtowncoder mentioned this issue Nov 14, 2019

Deserializing BigDecimal using JsonNode loses precision FasterXML/jackson-databind#2087

Closed

vy mentioned this issue Dec 29, 2019

Add JsonGenerator#writeNumber(char[], int, int) method #587

Closed

cowtowncoder added the performance Issue related to performance problems or enhancements label Aug 23, 2020

cowtowncoder added good first issue Issue that seems easy to resolve and is likely a good candidate for contributors new to project hacktoberfest Issue related to Hactoberfest2020 activities, eligible for additional rewards labels Oct 9, 2020

cowtowncoder assigned anneloreegger Oct 10, 2020

ferenc-csaky mentioned this issue Feb 2, 2021

Introduce O(n^1.5) BigDecimal parser implementation #677

Merged

cowtowncoder changed the title ~~Consider number-decoding improvements from jsoniter (esp. for double/float, BigInteger, BigDecimal)~~ Consider number-decoding improvements from jsoniter (esp. for double/float, BigInteger) Feb 5, 2021

cowtowncoder removed the hacktoberfest Issue related to Hactoberfest2020 activities, eligible for additional rewards label Feb 11, 2021

LifeIsStrange mentioned this issue Mar 22, 2021

Integrating it as the default parser in openjdk wrandelshofer/FastDoubleParser#3

Closed

cowtowncoder changed the title ~~Consider number-decoding improvements from jsoniter (esp. for double/float, BigInteger)~~ Consider number-decoding improvements from jsoniter or Lemire's "fast double parser" (esp. for double/float, BigInteger) Mar 27, 2021

cowtowncoder assigned abc12345678912345 and unassigned anneloreegger Sep 7, 2021

victornoel mentioned this issue Sep 28, 2021

Avoid the unnecessary UPDATE for JsonNode entity mappings vladmihalcea/hypersistence-utils#348

Closed

pjfanning mentioned this issue Mar 31, 2022

Change minimum Java version to 8 #745

Merged

pjfanning mentioned this issue Apr 21, 2022

Add NumberInput.parseFloat() #753

Closed

cowtowncoder changed the title ~~Consider number-decoding improvements from jsoniter or Lemire's "fast double parser" (esp. for double/float, BigInteger)~~ Improve parsing of floating-point numbers Jun 22, 2022

cowtowncoder added 2.14 Issue planned (at earliest) for 2.14 and removed good first issue Issue that seems easy to resolve and is likely a good candidate for contributors new to project labels Jun 22, 2022

cowtowncoder added this to the 2.14.0 milestone Jun 22, 2022

cowtowncoder changed the title ~~Improve parsing of floating-point numbers~~ Improve performant of floating-point number parsing Jun 22, 2022

cowtowncoder changed the title ~~Improve performant of floating-point number parsing~~ Improve performance of floating-point number parsing Jun 22, 2022

cowtowncoder closed this as completed Jun 22, 2022

cowtowncoder changed the title ~~Improve performance of floating-point number parsing~~ Improve performance of floating-point number parsing (Schubfach) Jun 24, 2022

This was referenced Jun 24, 2022

FastDoubleParser doesn't support all input formats as the default OpenJDK Float/Double parsers #778

Closed

FastDoubleParser doesn't support all input formats as the default OpenJDK Float/Double parsers wrandelshofer/FastDoubleParser#19

Closed

cowtowncoder changed the title ~~Improve performance of floating-point number parsing (Schubfach)~~ Allow use of faster floating-point number parsing (Schubfach) with StreamReadFeature.USE_FAST_DOUBLE_PARSER Jul 28, 2022

cowtowncoder mentioned this issue Jul 30, 2022

Optimize TextBuffer.contentsAsDouble() #346

Closed

ChrisHegarty mentioned this issue Nov 7, 2022

Upgrade XContent to Jackson 2.14.0 and enable Fast Double Parser elastic/elasticsearch#90553

Merged

rudygt mentioned this issue Nov 8, 2022

use fastdoubleparser to improve double parsing speed aws/event-ruler#54

Merged

cowtowncoder mentioned this issue Apr 2, 2023

Measure performance improvement by "fast double parsing" #970

Closed

wrandelshofer mentioned this issue Apr 28, 2023

Please bundle LICENSE/NOTICE files in the produced jar files wrandelshofer/FastDoubleParser#38

Closed

vlsi mentioned this issue Apr 28, 2023

Clarify license implications for those who make custom jackson-core builds #1002

Closed

pjfanning added a commit to pjfanning/jackson-core that referenced this issue Apr 28, 2023

adjust license for jackson 2.14 - see FasterXML#577 (comment)

f37e0ca

pjfanning mentioned this issue Apr 28, 2023

FastDoubleParser license (v2.14 branch) #1004

Merged

reta mentioned this issue Jun 6, 2023

Enable Fast Double Parser in Jackson opensearch-project/OpenSearch#7909

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow use of faster floating-point number parsing (Schubfach) with `StreamReadFeature.USE_FAST_DOUBLE_PARSER` #577

Allow use of faster floating-point number parsing (Schubfach) with `StreamReadFeature.USE_FAST_DOUBLE_PARSER` #577

cowtowncoder commented Nov 6, 2019 •

edited

cowtowncoder commented Nov 14, 2019

anneloreegger commented Oct 10, 2020

cowtowncoder commented Oct 10, 2020

anneloreegger commented Oct 10, 2020

cowtowncoder commented Oct 11, 2020

cowtowncoder commented Feb 5, 2021

LifeIsStrange commented Mar 22, 2021 •

edited

cowtowncoder commented Mar 27, 2021

abc12345678912345 commented Sep 3, 2021 •

edited

cowtowncoder commented Sep 7, 2021

pjfanning commented Mar 31, 2022

cowtowncoder commented Mar 31, 2022

pjfanning commented Mar 31, 2022 •

edited

cowtowncoder commented Mar 31, 2022

pjfanning commented Mar 31, 2022

cowtowncoder commented Apr 23, 2022

pjfanning commented Apr 29, 2022

re-thc commented Jun 7, 2022

plokhotnyuk commented Jun 7, 2022 •

edited

cowtowncoder commented Jun 7, 2022

cowtowncoder commented Jun 22, 2022

Allow use of faster floating-point number parsing (Schubfach) with StreamReadFeature.USE_FAST_DOUBLE_PARSER #577

Allow use of faster floating-point number parsing (Schubfach) with StreamReadFeature.USE_FAST_DOUBLE_PARSER #577

Comments

cowtowncoder commented Nov 6, 2019 • edited

cowtowncoder commented Nov 14, 2019

anneloreegger commented Oct 10, 2020

cowtowncoder commented Oct 10, 2020

anneloreegger commented Oct 10, 2020

cowtowncoder commented Oct 11, 2020

cowtowncoder commented Feb 5, 2021

LifeIsStrange commented Mar 22, 2021 • edited

cowtowncoder commented Mar 27, 2021

abc12345678912345 commented Sep 3, 2021 • edited

cowtowncoder commented Sep 7, 2021

pjfanning commented Mar 31, 2022

cowtowncoder commented Mar 31, 2022

pjfanning commented Mar 31, 2022 • edited

cowtowncoder commented Mar 31, 2022

pjfanning commented Mar 31, 2022

cowtowncoder commented Apr 23, 2022

pjfanning commented Apr 29, 2022

re-thc commented Jun 7, 2022

plokhotnyuk commented Jun 7, 2022 • edited

cowtowncoder commented Jun 7, 2022

cowtowncoder commented Jun 22, 2022

Allow use of faster floating-point number parsing (Schubfach) with `StreamReadFeature.USE_FAST_DOUBLE_PARSER` #577

Allow use of faster floating-point number parsing (Schubfach) with `StreamReadFeature.USE_FAST_DOUBLE_PARSER` #577

cowtowncoder commented Nov 6, 2019 •

edited

LifeIsStrange commented Mar 22, 2021 •

edited

abc12345678912345 commented Sep 3, 2021 •

edited

pjfanning commented Mar 31, 2022 •

edited

plokhotnyuk commented Jun 7, 2022 •

edited