Introduce `_VarInfo` internally to reduce memory footprint in value propagation #189

neNasko1 · 2024-11-06T15:54:25Z

Fixes #187 by making the current user-facing Var be a wrapper around a helper class VarInfo, where Var is a pair (VarInfo, PropValue).

The idea is to have VarInfo be the object that is passed around inside of all functions and be pointed to by all Node-s, so concrete propagated values can be garbage collected correctly.

Checklist for implementation

Provide better typing around input_prop_values dictionary
Fix adapt_node
Fix documentation
Add a CHANGELOG.rst entry
Fix Var-s passed to manual inference
Make tests passing

Breaking changes checklist

Examine the performance implications?
Examine downstream repos, does the change brake anything we rely on?

cbourjau

Thanks for taking this on and great work to already make the tests pass! I am trying to grok the details of the implementation. While doing so I already left a few high-level comments. I am not quite convinced that the use of a meta class is buying us much with respect to the introduced complexity/scariness of it. Would you mind adding type hints for its use and/or point out where it would provide a mypy-testable benefit compared to a simpler/dynamic structure such as a dict[str, Var]s?

src/spox/_exceptions.py

src/spox/_node.py

src/spox/_fields.py

src/spox/opset/ai/onnx/ml/v3.py

src/spox/_node.py

jbachurski · 2024-11-09T01:52:40Z

Are you sure there isn’t a cleaner way of doing this? Or perhaps some simpler hack. It feels like a significant change.
I wonder if your standpoint on value propagation changed, Christian @cbourjau - it was mostly intended as an experimental side-feature to improve experimentation back when I added it, not something that should be relied on in live systems. So I’m not sure a large change is worth it to improve the GC behaviour.
I’m afraid I won’t be able to help review this as I’m pretty starved on time.

cbourjau · 2024-11-09T10:33:28Z

Thanks for your feedback @jbachurski ! Let me provide a little more context.

I wonder if your standpoint on value propagation changed, Christian ?

Yes, it did indeed! ndonnx makes it very easy to try out NumPy code with constant arrays. So much so that it is perfectly reasonable to quickly convert large NumPy arrays into ndonnx ones and to throw them at our code. While technically still a "debugging" feature, it is now used very commonly during development and on very large graphs. The value propagation in ndonnx is currently re-implemented there, but I believe it would be cleaner to do it properly here in Spox on the operator level.

jbachurski · 2024-11-09T22:30:06Z

While technically still a "debugging" feature, it is now used very commonly during development and on very large graphs. The value propagation in ndonnx is currently re-implemented there, but I believe it would be cleaner to do it properly here in Spox on the operator level.

Curious. Yes, I noticed that it was early on and found it surprising, but I didn't want to question it :)
I imagine it would be cleaner like so indeed – good luck 🍀

I'll let you know in case I come up with something accidentally but yeah, it's unfortunately an extremely busy period for me to think over this design aspect.

src/spox/_var.py

cbourjau

I'm afraid running this PR against this branch in ndonnx (which makes extensive use of Spox's value propagation) gives rise to a lot of warnings of the type:

spox._exceptions.InferenceWarning:
 Output type for variable expanded of ai.onnx@21::Unsqueeze was not concrete - ValueError: Tensor int32[...] does not specify the shape -- in ?.

These warnings do not appear when running on spox from the main branch.

You can make pytest fail on these warnings in the ndonnx test suite by running it as follows:

pytest tests -W="error:Output type for variable"

Do you think you can take a look into what is going on here?

src/spox/_var.py

src/spox/_standard.py

neNasko1 · 2024-11-28T00:15:27Z

I'm afraid running this PR against this branch in ndonnx (which makes extensive use of Spox's value propagation) gives rise to a lot of warnings of the type:
spox._exceptions.InferenceWarning:
 Output type for variable expanded of ai.onnx@21::Unsqueeze was not concrete - ValueError: Tensor int32[...] does not specify the shape -- in ?.
These warnings do not appear when running on spox from the main branch.

You can make pytest fail on these warnings in the ndonnx test suite by running it as follows:
pytest tests -W="error:Output type for variable"
Do you think you can take a look into what is going on here?

This is caused by the fact that validate_types was being called before actually performing the value propagation.

Currently this type-validation is "optional", however AFAIK this is not really optional as validate is not exposed publicly. Should we deprecate this option.

cbourjau

Thanks for looking into the warnings on the ndonnx side! I think this PR is converging :). Could you add type hints to all new functions please?

src/spox/_fields.py

src/spox/_node.py

src/spox/_fields.py

cbourjau

Very nice! I have one very tiny nitpick, and then this is ready! Thank you very much!

CHANGELOG.rst

src/spox/_value_prop.py

Co-authored-by: Christian Bourjau <[email protected]>

neNasko1 added 15 commits October 27, 2024 18:48

Init

539ca29

Run linter

b6aa765

Changes

1d6bcbc

Fix ml

3c8e9b1

Fix some tests

096a4f4

Fix some tests

e29a920

Fix more tests

bfaed79

Add some proper typing

49e366c

More initializers

f5af5d9

Fix passing

27c562e

Minor fixes and linter

50e828b

Change initializers name to input_prop_values

d6b59ca

Make tests passing

340d4c2

Hacky fix mypy

3d77a87

Correctly codegen

9060d12

cbourjau requested changes Nov 8, 2024

View reviewed changes

neNasko1 added 7 commits November 15, 2024 18:20

Comments after code review

6aa1bf4

Improve type checking

07f9676

Pre-commit enable

df0bee9

Update documentation

02e36ac

Fix adapt node

9488f6a

Fix function inputs passing

c23086b

Fix opset generation

1aebc1a

neNasko1 requested a review from cbourjau November 20, 2024 00:20

cbourjau reviewed Nov 20, 2024

View reviewed changes

src/spox/_var.py Show resolved Hide resolved

neNasko1 added 2 commits November 20, 2024 15:56

Hint that _VarInfo is private

67d5b3b

Merge branch 'main' into split-value-prop

c9de7ec

cbourjau reviewed Nov 27, 2024

View reviewed changes

src/spox/_var.py Outdated Show resolved Hide resolved

src/spox/_standard.py Outdated Show resolved Hide resolved

src/spox/_standard.py Outdated Show resolved Hide resolved

Comments after code review

63be89b

Move validation to after propagation

0d5e2c8

neNasko1 requested a review from cbourjau November 28, 2024 00:26

neNasko1 added 2 commits November 28, 2024 14:04

Fix diff

d14e300

Fix diffs

e33b00e

cbourjau requested changes Nov 29, 2024

View reviewed changes

src/spox/_fields.py Show resolved Hide resolved

src/spox/_node.py Outdated Show resolved Hide resolved

src/spox/_fields.py Show resolved Hide resolved

src/spox/_fields.py Outdated Show resolved Hide resolved

neNasko1 added 13 commits December 4, 2024 14:01

Merge branch 'main' into split-value-prop

b9f922c

Improve type-hinting information

fdf81a3

Remove unneded functions

cfae394

Init

19d7ebb

fix

b9cb099

Final fixes

756f274

Add test for propagation of optional var

ac8807e

Unify logic around VarInfos -> Var

e5c311f

Add comment

7f87559

Merge with main

1b03440

Improve qol

8e5d25a

Update CHANGELOG.rst

4215495

Fix tools/generate

fdb89cf

neNasko1 requested a review from cbourjau December 9, 2024 02:47

cbourjau approved these changes Dec 10, 2024

View reviewed changes

CHANGELOG.rst Outdated Show resolved Hide resolved

src/spox/_value_prop.py Outdated Show resolved Hide resolved

neNasko1 and others added 3 commits December 10, 2024 11:09

Update CHANGELOG.rst

07a8f91

Co-authored-by: Christian Bourjau <[email protected]>

Merge branch 'main' into split-value-prop

6360d61

Comments after code-review

40b8b87

neNasko1 changed the title ~~Split value prop~~ Introduce _VarInfo internally to reduce memory footprint in value propagation Dec 10, 2024

neNasko1 mentioned this pull request Dec 10, 2024

Value propagation may produce a large memory footprint #187

Closed

neNasko1 merged commit 320d57f into main Dec 10, 2024
18 checks passed

neNasko1 deleted the split-value-prop branch December 10, 2024 10:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce `_VarInfo` internally to reduce memory footprint in value propagation #189

Introduce `_VarInfo` internally to reduce memory footprint in value propagation #189

neNasko1 commented Nov 6, 2024 •

edited

Loading

cbourjau left a comment

jbachurski commented Nov 9, 2024 •

edited

Loading

cbourjau commented Nov 9, 2024

jbachurski commented Nov 9, 2024

cbourjau left a comment

neNasko1 commented Nov 28, 2024 •

edited

Loading

cbourjau left a comment

cbourjau left a comment •

edited

Loading

Introduce _VarInfo internally to reduce memory footprint in value propagation #189

Introduce _VarInfo internally to reduce memory footprint in value propagation #189

Conversation

neNasko1 commented Nov 6, 2024 • edited Loading

Checklist for implementation

Breaking changes checklist

cbourjau left a comment

Choose a reason for hiding this comment

jbachurski commented Nov 9, 2024 • edited Loading

cbourjau commented Nov 9, 2024

jbachurski commented Nov 9, 2024

cbourjau left a comment

Choose a reason for hiding this comment

neNasko1 commented Nov 28, 2024 • edited Loading

cbourjau left a comment

Choose a reason for hiding this comment

cbourjau left a comment • edited Loading

Choose a reason for hiding this comment

Introduce `_VarInfo` internally to reduce memory footprint in value propagation #189

Introduce `_VarInfo` internally to reduce memory footprint in value propagation #189

neNasko1 commented Nov 6, 2024 •

edited

Loading

jbachurski commented Nov 9, 2024 •

edited

Loading

neNasko1 commented Nov 28, 2024 •

edited

Loading

cbourjau left a comment •

edited

Loading