perf(NODE-5557): move DataView and Set allocation used for double parsing and utf8 validation to nested path #611

billouboq · 2023-08-20T14:02:39Z

Description

This is a small performance improvement that has 0 impact on the code behaviour.

Things can be improved a little bit by changing the UTF-8 validation option, but this is more work, will see if I can test it later

What is changing?

Remove uneeded Set and dataview creation.

Also added a new benchmark with a more "natural" object to serialize

Is there new documentation needed for these changes?

No documentation changes needed

What is the motivation for this change?

Increasing deserialization performance

Release Highlight

Deserialization performance increased

If BSON data does not contain Doubles and UTF8 validation is disabled the deserializer is careful to not allocate data structures needed to support that functionality. This has shown to greatly increase (2x-1.3x) the performance of the deserializer.

Thank you @billouboq for this contribution!

Double check the following

Ran npm run check:lint script
Self-review completed using the steps outlined here
PR title follows the correct format: type(NODE-xxxx)[!]: description
- Example: feat([NODE-1234](https://jira.mongodb.org/browse/NODE-1234))!: rewriting everything in coffeescript
Changes are covered by tests
New TODOs have a related JIRA ticket

src/parser/deserializer.ts

nbbeeken

Hey @billouboq thanks so much for contributing! I have a couple questions for you and some cleanup to get this closer to ready

Can you share some performance numbers you're observing after this change?

etc/benchmarks/main.mjs

package.json

src/parser/deserializer.ts

billouboq · 2023-08-23T09:58:16Z

@nbbeeken I just updated the code and add a new benchmark that is really close to my real world usage.

Made some benchmark tests and here are the mean durations :

No changes : 548.269 - 555
Remove Set creations : 543 - 545
Remove dataview creation : 499 - 502
All changes : 480 (around 13.6% improvement, might be even greater with changes from that PR: #615)

billouboq · 2023-09-19T18:26:12Z

@W-A-James Hello, any chance this one get's merged ?

W-A-James · 2023-09-21T17:31:12Z

Hi @billouboq, apologies for the long turnaround time on this PR. We are currently in the middle of reworking how we do performance testing for js-bson (see NODE-3654). As a result, we're putting a pause on re-reviewing and potentially merging this and other potential performance improvements until that work is done. At that point, with more robust benchmarking infrastructure in place, we will reassess this PR and come to a decision.

Thanks for all your work on this repo.

billouboq · 2023-12-27T09:16:14Z

Hello @W-A-James, do you have any news about this ?

It seems people are having performance issues with the new mongodb driver :
Automattic/mongoose#13456 (comment)

billouboq · 2024-02-11T19:20:36Z

Hello, I would like to know what's holding us from using that optimisation ?

Is there something I can do to make it pass ?

nbbeeken · 2024-02-12T15:54:53Z

Hi @billouboq, thanks for reaching out, we recently were unblocked on this (see NODE-5557) with the completion of our suite of BSON benchmarks that Warren mentioned. Now we can see the effect of this change per data type and collectively on mixed documents. Do you think your document represents a case unique enough to be separate from the bestbuy test case? If so we can include this document as its own test.

I am currently working on performance measuring and improvements, so I will be taking another look at this soon. I am interested in seeing the results of the double-focused tests we have they should reveal if this change caused a delay due to GC of the DataView on every iteration.

I am thinking we can split the difference here by storing the dataview on a let variable when we create it so that it can be shared after the first double creates it.

billouboq · 2024-02-13T11:04:34Z

Thanks for the update

Do you think your document represents a case unique enough to be separate from the bestbuy test case? If so we can include this document as its own test.

I created a test that was representing something close to what I could use in a production environment, so with lot's of different types of values. We can remove it, it was more a test to check how much improvements I could have in our own environment.

I am thinking we can split the difference here by storing the dataview on a let variable when we create it so that it can be shared after the first double creates it.

That might be a good idea indeed !

Don't hesitate to tell me what I should change or you can close that PR and make a new PR based on my code, up to you

nbbeeken · 2024-02-13T15:26:53Z

Awesome thanks for sticking with this after so much time :) Could you rebase this and move the dataview variable back up to the top as a let, and the double code path can use ??= to only create it once?

I will handle running the benchmarks on my end and report back the results, TIA!

billouboq · 2024-02-13T18:19:59Z

I just have small possible improvements to test first based on the same kind of changes from that PR. I will rebase after those testings

…paths

billouboq · 2024-02-13T21:58:27Z

@nbbeeken I rebased and pushed only the needed changes 👍

Also, I do not have the time to dig more, but I feel like we could improve date deserialisation performance by skipping creating a "new Long" and just create a small function that take lowBits and hightBits and do only this part without the context "this" :

  // taken from Long class file
  toNumber(): number {
    if (this.unsigned) return (this.high >>> 0) * TWO_PWR_32_DBL + (this.low >>> 0);
    return this.high * TWO_PWR_32_DBL + (this.low >>> 0);
  }

That way we avoid creating a new Long object just for the sake of converting it directly.

But it should be benchmarked to make sure it's worse it

nbbeeken

Thanks @billouboq for the help! LGTM

// bytes: { _id: new bson.Int32(2) }
bson.deserialize(bytes, {validation: {utf8:false}})

do_nothing x 851,290,397 ops/sec ±0.05% (196 runs sampled)
bson_deserialize_current x 3,191,767 ops/sec ±0.20% (193 runs sampled)
bson_deserialize_nested_dv x 6,522,438 ops/sec ±0.15% (194 runs sampled)

I ran our granular (per bson type) benchmarks and I see numbers corroborating this, for non-doubles we get way better performance, looks like DataView is not too great. I will file follow up tickets for your suggestion about Date's and I bet we can get Doubles to be even better if we use the readDoubleLE API that Node.js offers (when running in Node.js)

Filed:

aditi-khare-mongoDB

LGTM, thank you @billouboq!

addaleax reviewed Aug 21, 2023

View reviewed changes

src/parser/deserializer.ts Outdated Show resolved Hide resolved

nbbeeken changed the title ~~Improve deserialisation performance~~ fix(NODE-5557): improve deserialization performance by moving allocations to nested paths Aug 21, 2023

nbbeeken requested changes Aug 21, 2023

View reviewed changes

etc/benchmarks/main.mjs Outdated Show resolved Hide resolved

package.json Outdated Show resolved Hide resolved

src/parser/deserializer.ts Outdated Show resolved Hide resolved

billouboq force-pushed the perf-testing-2 branch from 031fc4f to d6d8979 Compare August 25, 2023 16:03

W-A-James added the Blocked Blocked on other work label Sep 21, 2023

nbbeeken removed the Blocked Blocked on other work label Feb 12, 2024

improve deserialization performance by moving allocations to nested …

a2e03d1

…paths

billouboq force-pushed the perf-testing-2 branch from d6d8979 to a2e03d1 Compare February 13, 2024 21:53

Merge branch 'main' into perf-testing-2

a7cbf6f

billouboq mentioned this pull request Feb 13, 2024

Why do Mongoose v6 and v7 cause a significant increase in garbage collections and event loop latency compared to v5? Automattic/mongoose#13456

Closed

2 tasks

nbbeeken self-assigned this Feb 14, 2024

nbbeeken added the Team Review Needs review from team label Feb 14, 2024

nbbeeken approved these changes Feb 14, 2024

View reviewed changes

nbbeeken changed the title ~~fix(NODE-5557): improve deserialization performance by moving allocations to nested paths~~ perf(NODE-5557): move DataView and Set allocation used for double parsing and utf8 validation to nested path Feb 14, 2024

aditi-khare-mongoDB self-requested a review February 14, 2024 20:13

aditi-khare-mongoDB approved these changes Feb 14, 2024

View reviewed changes

nbbeeken merged commit 9a150e1 into mongodb:main Feb 14, 2024
4 checks passed

github-actions bot mentioned this pull request Feb 14, 2024

chore(main): release 6.4.0 [skip-ci] #644

Merged

github-actions bot mentioned this pull request May 7, 2024

chore(main): release 7.0.0 [skip-ci] #687

Closed

boytur mentioned this pull request May 25, 2024

[Snyk] Upgrade bson from 6.3.0 to 6.7.0 boytur/server-posyayee-v1#51

Closed

boytur mentioned this pull request May 26, 2024

[Snyk] Upgrade bson from 6.3.0 to 6.7.0 boytur/server-posyayee-v1#54

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf(NODE-5557): move DataView and Set allocation used for double parsing and utf8 validation to nested path #611

perf(NODE-5557): move DataView and Set allocation used for double parsing and utf8 validation to nested path #611

billouboq commented Aug 20, 2023 •

edited by nbbeeken

Loading

nbbeeken left a comment •

edited

Loading

billouboq commented Aug 23, 2023 •

edited

Loading

billouboq commented Sep 19, 2023

W-A-James commented Sep 21, 2023

billouboq commented Dec 27, 2023 •

edited

Loading

billouboq commented Feb 11, 2024

nbbeeken commented Feb 12, 2024

billouboq commented Feb 13, 2024

nbbeeken commented Feb 13, 2024

billouboq commented Feb 13, 2024

billouboq commented Feb 13, 2024 •

edited

Loading

nbbeeken left a comment •

edited

Loading

aditi-khare-mongoDB left a comment

perf(NODE-5557): move DataView and Set allocation used for double parsing and utf8 validation to nested path #611

perf(NODE-5557): move DataView and Set allocation used for double parsing and utf8 validation to nested path #611

Conversation

billouboq commented Aug 20, 2023 • edited by nbbeeken Loading

Description

What is changing?

Is there new documentation needed for these changes?

What is the motivation for this change?

Release Highlight

Deserialization performance increased

Double check the following

nbbeeken left a comment • edited Loading

Choose a reason for hiding this comment

billouboq commented Aug 23, 2023 • edited Loading

billouboq commented Sep 19, 2023

W-A-James commented Sep 21, 2023

billouboq commented Dec 27, 2023 • edited Loading

billouboq commented Feb 11, 2024

nbbeeken commented Feb 12, 2024

billouboq commented Feb 13, 2024

nbbeeken commented Feb 13, 2024

billouboq commented Feb 13, 2024

billouboq commented Feb 13, 2024 • edited Loading

nbbeeken left a comment • edited Loading

Choose a reason for hiding this comment

aditi-khare-mongoDB left a comment

Choose a reason for hiding this comment

billouboq commented Aug 20, 2023 •

edited by nbbeeken

Loading

nbbeeken left a comment •

edited

Loading

billouboq commented Aug 23, 2023 •

edited

Loading

billouboq commented Dec 27, 2023 •

edited

Loading

billouboq commented Feb 13, 2024 •

edited

Loading

nbbeeken left a comment •

edited

Loading