perf(NODE-5934): replace DataView uses with bit math #649

nbbeeken · 2024-02-15T19:11:41Z

Description

What is changing?

Removes all usage of dataview in favor of Float64Arrays and bit math

We keep a Float64Array and a Uint8Array referencing the same ArrayBuffer. With 8 lines worth of assignment we can take the bytes from input BSON and put them into the Uint8Array and use the Float64Array to interpret them as a LE double.

For bigints we can shift the value into two 32 bit components and use javascript's truncation logic to pull out the bytes as we shift down to each segment of the number.

Is there new documentation needed for these changes?

No

What is the motivation for this change?

Creating data views has an impact on peformance, and we can easily avoid using it altogether. Generally, we see better performance the more logic we code in JavaScript.

Benchmarks See below

Release Highlight

Improved the performance of serializing and deserializing doubles and bigints

We now use bit shifting and multiplication operators in place of DataView getX/setX calls to parse and serialize bigints and a Float64Array to convert a double to bytes. This change has been shown to increase deserializing performance ~1.3x and serializing performance ~1.75x.

Double check the following

Ran npm run check:lint script
Self-review completed using the steps outlined here
PR title follows the correct format: type(NODE-xxxx)[!]: description
- Example: feat(NODE-1234)!: rewriting everything in coffeescript
Changes are covered by tests
Changes have been benchmarked
New TODOs have a related JIRA ticket

nbbeeken · 2024-02-15T21:27:51Z

Running on:

Node.js v20.11.0
OS: linux
CPUs: 8
Arch: x64
RAM: 33.105743872 GB

64-bit floats show improvements, tested a single value, and 1000 values:

=================== testing with { _id: 120.384 } bytes 18
do_nothing x 852,087,149 ops/sec ±0.04% (396 runs sampled)

bson_deserialize_current_main x 3,509,120 ops/sec ±0.11% (395 runs sampled)
bson_deserialize_bits_pr x 6,246,320 ops/sec ±0.39% (391 runs sampled)

bson_serialize_current_main x 2,481,998 ops/sec ±0.39% (391 runs sampled)
bson_serialize_bits_pr x 2,691,455 ops/sec ±0.14% (393 runs sampled)

=================== testing with an object with 1000 keys all set to 120.384, bytes 12895
do_nothing x 852,260,397 ops/sec ±0.04% (396 runs sampled)

bson_deserialize_current_main x 15,682 ops/sec ±0.08% (395 runs sampled)
bson_deserialize_bits_pr x 15,014 ops/sec ±0.06% (395 runs sampled)

bson_serialize_current_main x 10,893 ops/sec ±0.22% (395 runs sampled)
bson_serialize_bits_pr x 16,187 ops/sec ±0.11% (395 runs sampled)

64-bit integers also improved

=================== testing with { _id: -120n } bytes 18
do_nothing x 852,427,659 ops/sec ±0.04% (396 runs sampled)

bson_deserialize_current_main x 2,602,237 ops/sec ±0.09% (394 runs sampled)
bson_deserialize_bits_pr x 4,762,447 ops/sec ±0.17% (395 runs sampled)

bson_serialize_current_main x 2,382,100 ops/sec ±0.14% (393 runs sampled)
bson_serialize_bits_pr x 2,547,732 ops/sec ±0.12% (394 runs sampled)

=================== testing with an object with 1000 keys all set to -120n, bytes 12895
do_nothing x 851,099,356 ops/sec ±0.06% (396 runs sampled)

bson_deserialize_current_main x 3,764 ops/sec ±0.07% (395 runs sampled)
bson_deserialize_bits_pr x 9,259 ops/sec ±0.12% (394 runs sampled)

bson_serialize_current_main x 10,604 ops/sec ±0.10% (395 runs sampled)
bson_serialize_bits_pr x 16,260 ops/sec ±0.12% (395 runs sampled)

Granular benchmark results here

billouboq · 2024-02-22T09:26:58Z

Looks freaking good !

I have crazy load of work right now, but I was thinking of an other possible improvement, we use Buffer.allocate at multiple places where we could use the faster allocateUnsafe function since we directly set the bytes. I don't know how much improvement it could make but it's safe to use it in those cases :

    const bytes = ByteUtils.allocate(16);
     // Copy the next 16 bytes into the bytes buffer
     bytes.set(buffer.subarray(index, index + 16), 0);

Or

    } else if (elementType === constants.BSON_DATA_OID) {
      const oid = ByteUtils.allocate(12);
      oid.set(buffer.subarray(index, index + 12));
      value = new ObjectId(oid);
      index = index + 12;
    }

nbbeeken · 2024-02-22T15:29:10Z

@billouboq Good idea, I'll file a ticket about it, I do wish there was a web equivalent, but we can incorporate it for node at least.

src/objectid.ts

src/parser/deserializer.ts

src/parser/serializer.ts

src/parser/deserializer.ts

src/parser/serializer.ts

aditi-khare-mongoDB

LGTM

src/objectid.ts

nbbeeken force-pushed the NODE-5934-readDoubleLE branch from 05b9dec to 6e70471 Compare February 15, 2024 21:03

nbbeeken force-pushed the NODE-5934-readDoubleLE branch from 6e70471 to 4d69b12 Compare February 21, 2024 21:13

nbbeeken marked this pull request as ready for review February 21, 2024 21:20

aditi-khare-mongoDB self-requested a review February 26, 2024 18:26

aditi-khare-mongoDB requested changes Feb 26, 2024

View reviewed changes

durran added the Primary Review In Review with primary reviewer, not yet ready for team's eyes label Feb 27, 2024

durran assigned aditi-khare-mongoDB Feb 27, 2024

nbbeeken added 5 commits February 27, 2024 14:25

perf(NODE-5934): replace DataView uses with bit math

5c2ac88

fix: operator-assignment

d278830

perf: refactor bit operations into helper functions (#652)

0eb9216

test: number utils

3956552

test: rm only

46b5060

nbbeeken force-pushed the NODE-5934-readDoubleLE branch from 8d99e9b to 46b5060 Compare February 27, 2024 19:25

nbbeeken requested a review from aditi-khare-mongoDB February 27, 2024 19:26

aditi-khare-mongoDB added Team Review Needs review from team and removed Primary Review In Review with primary reviewer, not yet ready for team's eyes labels Feb 27, 2024

aditi-khare-mongoDB approved these changes Feb 27, 2024

View reviewed changes

baileympearson requested changes Feb 27, 2024

View reviewed changes

src/objectid.ts Show resolved Hide resolved

nbbeeken requested a review from baileympearson February 27, 2024 20:48

baileympearson approved these changes Feb 27, 2024

View reviewed changes

aditi-khare-mongoDB merged commit 6d343ab into main Feb 27, 2024
4 checks passed

aditi-khare-mongoDB deleted the NODE-5934-readDoubleLE branch February 27, 2024 21:44

github-actions bot mentioned this pull request Feb 27, 2024

chore(main): release 6.4.0 [skip-ci] #644

Merged

github-actions bot mentioned this pull request May 7, 2024

chore(main): release 7.0.0 [skip-ci] #687

Closed

This was referenced May 25, 2024

[Snyk] Upgrade bson from 6.3.0 to 6.7.0 boytur/server-posyayee-v1#51

Closed

[Snyk] Upgrade bson from 6.3.0 to 6.7.0 boytur/server-posyayee-v1#54

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf(NODE-5934): replace DataView uses with bit math #649

perf(NODE-5934): replace DataView uses with bit math #649

nbbeeken commented Feb 15, 2024 •

edited

Loading

nbbeeken commented Feb 15, 2024 •

edited

Loading

billouboq commented Feb 22, 2024

nbbeeken commented Feb 22, 2024

aditi-khare-mongoDB left a comment

perf(NODE-5934): replace DataView uses with bit math #649

perf(NODE-5934): replace DataView uses with bit math #649

Conversation

nbbeeken commented Feb 15, 2024 • edited Loading

Description

What is changing?

Is there new documentation needed for these changes?

What is the motivation for this change?

Release Highlight

Improved the performance of serializing and deserializing doubles and bigints

Double check the following

nbbeeken commented Feb 15, 2024 • edited Loading

billouboq commented Feb 22, 2024

nbbeeken commented Feb 22, 2024

aditi-khare-mongoDB left a comment

Choose a reason for hiding this comment

nbbeeken commented Feb 15, 2024 •

edited

Loading

nbbeeken commented Feb 15, 2024 •

edited

Loading