[WIP] Add framework for version converter API #1926

shubhambhokare1 · 2024-10-31T18:17:23Z

No description provided.

onnxscript/version_converter/__init__.py

github-advanced-security

lintrunner found more than 20 potential problems in the proposed changes. Check the Files changed tab for more details.

onnxscript/version_converter/version_converter_test.py

justinchuby · 2024-10-31T18:23:40Z

onnxscript/version_converter/version_converter_test.py

+        )
+        model = ir.serde.deserialize_model(model_proto)
+        self.assertEqual(model.graph._nodes[4].op_type, "GridSample")
+        self.assertEqual(model.graph._nodes[4]._attributes['mode'].value, 'bilinear')


Suggested change

self.assertEqual(model.graph._nodes[4]._attributes['mode'].value, 'bilinear')

self.assertEqual(model.graph[4].attributes['mode'].value, 'bilinear')

codecov · 2024-10-31T18:25:41Z

❌ 15 Tests Failed:

Tests completed	Failed	Passed	Skipped
14275	15	14260	1625

View the full list of 3 ❄️ flaky tests

tests.eager_mode_test.TestEagerModeArguments_0_reference_runtime test_function_input_and_attribute_by_kwargs_out_of_order

Flake rate in main: 38.05% (Passed 5976 times, Failed 3671 times)

Stack Traces | 0.002s run time

..../test_torch_nightly/lib/python3.12.../reference/ops/_op.py:91: in run
    res = self._run(x, y)
..../test_torch_nightly/lib/python3.12.../reference/ops/_op.py:139: in _run
    res = (convert_from_ml_dtypes(res[0]),)
..../test_torch_nightly/lib/python3.12.../onnx/reference/custom_element_types.py:50: in convert_from_ml_dtypes
    return array.view(dtype=dtype)
E   ValueError: Changing the dtype of a 0d array is only supported if the itemsize is unchanged

The above exception was the direct cause of the following exception:
tests/eager_mode_test.py:115: in test_function_input_and_attribute_by_kwargs_out_of_order
    self.assertEqual(add_with_alpha(alpha=3.0, other=2.0, this=1.0), 7.0)
onnxscript/values.py:576: in __call__
    return evaluator.default().eval_function(self, args, kwargs)
onnxscript/evaluator.py:307: in eval_function
    result = function.function(*adapted_args, **adapted_kwargs)
tests/eager_mode_test.py:59: in add_with_alpha
    other = op.Mul(other, alpha)
.../onnx_opset/_impl/opset14.py:696: in Mul
    return op(*self._prepare_inputs(schema, A, B))
onnxscript/values.py:304: in __call__
    return evaluator.default().eval(schema, args, kwargs)
onnxscript/evaluator.py:194: in eval
    outputs = self._eval(schema, inputs, attributes, closure)
onnxscript/evaluator.py:524: in _eval
    result = session.run(None, session_run_input)
..../test_torch_nightly/lib/python3.12.../onnx/reference/reference_evaluator.py:599: in run
    outputs = node.run(*inputs, **linked_attributes)
..../test_torch_nightly/lib/python3.12.../reference/ops/_op.py:114: in run
    res = OpRunBinary.run(self, x, y)
..../test_torch_nightly/lib/python3.12.../reference/ops/_op.py:93: in run
    raise TypeError(
E   TypeError: Issues with types &lt;class 'numpy.ndarray'&gt;, &lt;class 'numpy.ndarray'&gt; (binary operator 'Mul').

tests.eager_mode_test.TestEagerModeArguments_0_reference_runtime test_function_some_input_by_kwargs

Flake rate in main: 38.05% (Passed 5976 times, Failed 3671 times)

Stack Traces | 0.002s run time

..../test_torch_nightly/lib/python3.12.../reference/ops/_op.py:91: in run
    res = self._run(x, y)
..../test_torch_nightly/lib/python3.12.../reference/ops/_op.py:139: in _run
    res = (convert_from_ml_dtypes(res[0]),)
..../test_torch_nightly/lib/python3.12.../onnx/reference/custom_element_types.py:50: in convert_from_ml_dtypes
    return array.view(dtype=dtype)
E   ValueError: Changing the dtype of a 0d array is only supported if the itemsize is unchanged

The above exception was the direct cause of the following exception:
tests/eager_mode_test.py:106: in test_function_some_input_by_kwargs
    self.assertEqual(add_with_alpha(1.0, other=2.0), 3.0)
onnxscript/values.py:576: in __call__
    return evaluator.default().eval_function(self, args, kwargs)
onnxscript/evaluator.py:307: in eval_function
    result = function.function(*adapted_args, **adapted_kwargs)
tests/eager_mode_test.py:59: in add_with_alpha
    other = op.Mul(other, alpha)
.../onnx_opset/_impl/opset14.py:696: in Mul
    return op(*self._prepare_inputs(schema, A, B))
onnxscript/values.py:304: in __call__
    return evaluator.default().eval(schema, args, kwargs)
onnxscript/evaluator.py:194: in eval
    outputs = self._eval(schema, inputs, attributes, closure)
onnxscript/evaluator.py:524: in _eval
    result = session.run(None, session_run_input)
..../test_torch_nightly/lib/python3.12.../onnx/reference/reference_evaluator.py:599: in run
    outputs = node.run(*inputs, **linked_attributes)
..../test_torch_nightly/lib/python3.12.../reference/ops/_op.py:114: in run
    res = OpRunBinary.run(self, x, y)
..../test_torch_nightly/lib/python3.12.../reference/ops/_op.py:93: in run
    raise TypeError(
E   TypeError: Issues with types &lt;class 'numpy.ndarray'&gt;, &lt;class 'numpy.ndarray'&gt; (binary operator 'Mul').

tests.eager_mode_test.TestEagerModeArguments_0_reference_runtime test_function_all_input_by_kwargs

Flake rate in main: 38.05% (Passed 5976 times, Failed 3671 times)

Stack Traces | 0.003s run time

..../test_torch_nightly/lib/python3.12.../reference/ops/_op.py:91: in run
    res = self._run(x, y)
..../test_torch_nightly/lib/python3.12.../reference/ops/_op.py:139: in _run
    res = (convert_from_ml_dtypes(res[0]),)
..../test_torch_nightly/lib/python3.12.../onnx/reference/custom_element_types.py:50: in convert_from_ml_dtypes
    return array.view(dtype=dtype)
E   ValueError: Changing the dtype of a 0d array is only supported if the itemsize is unchanged

The above exception was the direct cause of the following exception:
tests/eager_mode_test.py:109: in test_function_all_input_by_kwargs
    self.assertEqual(add_with_alpha(this=1.0, other=2.0), 3.0)
onnxscript/values.py:576: in __call__
    return evaluator.default().eval_function(self, args, kwargs)
onnxscript/evaluator.py:307: in eval_function
    result = function.function(*adapted_args, **adapted_kwargs)
tests/eager_mode_test.py:59: in add_with_alpha
    other = op.Mul(other, alpha)
.../onnx_opset/_impl/opset14.py:696: in Mul
    return op(*self._prepare_inputs(schema, A, B))
onnxscript/values.py:304: in __call__
    return evaluator.default().eval(schema, args, kwargs)
onnxscript/evaluator.py:194: in eval
    outputs = self._eval(schema, inputs, attributes, closure)
onnxscript/evaluator.py:524: in _eval
    result = session.run(None, session_run_input)
..../test_torch_nightly/lib/python3.12.../onnx/reference/reference_evaluator.py:599: in run
    outputs = node.run(*inputs, **linked_attributes)
..../test_torch_nightly/lib/python3.12.../reference/ops/_op.py:114: in run
    res = OpRunBinary.run(self, x, y)
..../test_torch_nightly/lib/python3.12.../reference/ops/_op.py:93: in run
    raise TypeError(
E   TypeError: Issues with types &lt;class 'numpy.ndarray'&gt;, &lt;class 'numpy.ndarray'&gt; (binary operator 'Mul').

To view more test analytics, go to the Test Analytics Dashboard
Got feedback? Let us know on Github

gramalingam · 2024-11-01T18:51:57Z

onnxscript/version_converter/version_converter.py

+### Adapters
+
+# Compatibility Adapter
+def adapter_compatible(op: ir.Node, target_opset):


nit: I recommend node instead of op. We frequently use op for building a node via the op.MatMul(x, y) syntax.

Furthermore, the general interface for a version-converter would likely need an op/node builder as an input, and that would be best called op. We would need that when creating new nodes as part of the version-conversion (eg., even in the example below to create a Constant node from an attribute value.

gramalingam · 2024-11-01T19:02:19Z

onnxscript/version_converter/_adapter_lib.py

+
+
+_ADAPTERS_18_19 = {
+    "Equal": adapter_compatible,


Minor nit: I think it would be better to not have to register an adapter for compatible extensions, since it is basically an identity operation.

I can see a value in explicitly documenting that a particular opset-update is backwards-compatible (to catch the case where we forget to register an adapter). We could do that using a separate set of all compatible ops.

onnxscript/version_converter/version_converter_test.py

gramalingam · 2024-11-01T19:06:40Z

onnxscript/version_converter/version_converter.py

+
+# Compatibility Adapter
+def adapter_compatible(op: ir.Node, target_opset):
+    op.version = target_opset


Setting the version on a node need not be part of the adapter logic. It should be in the converter logic that calls the adapter, since this logic is the same for all adapters. No point in duplicating this line in every adapter. That would also make a compatible adapter an identity function, and we don't even need to register one for compatible extensions,

onnxscript/version_converter/_adapter_lib.py

onnxscript/version_converter/version_converter.py

gramalingam · 2024-11-01T23:48:06Z

One of the main design question relates to the "adapter" signature: what form should it take? Essentially it is a function that takes a single node as a parameter, and modifies it in some form. The changes are typically a simple mutation of a node along with potentially other changes (such as the insertion of extra nodes).

For now, I think it might be fine to follow the pattern used in the optimizer and rewriter, which are based on node-transformers that, given an input node, return a sequence of replacement nodes or None (if no replacement is required). This allows a simple loop over all nodes in the graph that transforms each node in sequence. This can be generalized later if necessary.

onnxscript/version_converter/__init__.py

justinchuby · 2024-11-06T21:22:00Z

onnxscript/version_converter/_adapter_lib.py

+            name=_attr.name,
+        )
+        # Add the ir.Value as inputs to the node and graph
+        node._inputs = node._inputs + (attr_as_input,)


note: avoid using and modifying private fields. To change inputs to a node, always initialize a new node to replace the current one.

+1 ... we could do what the optimizer and rewriter currently do: use a more generic interface for a node-adapter that returns a list of replacement nodes (or None if no replacement is needed).

Further, we can't just create a value as above ... we need to create a constant value with the given value. For now, the simplest way is to create a new Constant node. (Orthogonally to this, we should extend the "builder" API we currently use to create initializers as well, but that's a separate issue, for now Constant nodes should be fine.)

justinchuby · 2024-11-06T21:22:26Z

onnxscript/version_converter/version_converter_test.py

+        self.assertEqual(nodes[1].version, 19)
+        self.assertEqual(nodes[4].op_type, "GridSample")
+        self.assertEqual(nodes[4].version, 20)
+        self.assertEqual(model.graph._nodes[4]._attributes["mode"].value, "cubic")


note: Avoid accessing internal fields

onnxscript/version_converter/version_converter.py

gramalingam · 2024-11-06T21:52:15Z

onnxscript/version_converter/version_converter.py

+        self.custom_adapters = custom_adapter_list
+
+    def graph_version_convert(self, graph: ir.Graph, target_version: int) -> None:
+        if self.model_version == target_version:


(Extension) I think we will need to soon support the case where the incoming model has nodes with different opset versions. At that point, such checks should happen at the node level, not at the model level.

gramalingam · 2024-11-06T21:53:53Z

onnxscript/version_converter/version_converter.py

+
+        # Iterate from current model version -> target version
+        # Updating each node based on the correct adapter [ver->ver+1]
+        for opset_version in range(self.model_version, target_version):


May be better to explicitly check for target_version being > or < current version. (It's ok to focus on up-conversion in first version, but may be better to have an error/warning message if we run into down-conversion when it is unimplemented.)

Actually, this could be bundled into a check after pick_adapter_set to more generally handle the case when we don't have an adapter set (for either 23 to 24 or for 18 to 17).

justinchuby · 2024-11-07T18:13:07Z

Questions that are related to the design doc that comes to mind

How do we ensure down conversion can be supported in the future
What invariants do we preserve in the nodes; an opset version may or may not be associated with an ir.Node. How are both cases handled
How does the design relate to the rewriter? Conceptually version conversion is a model rewriting process. How do we plan to maintain a consistent dev/user experience when authoring and debugging subgraph replacement logic?
When version conversion fails, how should it fail? Succeed partially, abort, etc.? What are the guarantees/invariants of the model state when conversion is not possible?
Any performance considerations?
What is the path for supporting new opsets?
How is the conversion logic tested for them to be robust? How is it designed so that future maintenance is simple and scalable?

onnxscript/version_converter/version_converter.py

+    def __init__(self, target_version: int):
+        self.target_version = target_version
+
+    def process_node(self, node: ir.Node, opset_version):


onnxscript/version_converter/version_converter_test.py

+        model = ir.serde.deserialize_model(model_proto)
+        target_version = 17
+        version_converter.convert_version(model, target_version=target_version)
+        nodes = model.graph._nodes


shubhambhokare1 added 2 commits October 31, 2024 16:39

Add basic framework

91585b3

add grid sample

f8ce63d

github-advanced-security bot found potential problems Oct 31, 2024

View reviewed changes

onnxscript/version_converter/__init__.py Fixed Show fixed Hide fixed

github-advanced-security bot found potential problems Oct 31, 2024

View reviewed changes

justinchuby reviewed Oct 31, 2024

View reviewed changes

onnxscript/version_converter/version_converter_test.py Outdated Show resolved Hide resolved

justinchuby reviewed Oct 31, 2024

View reviewed changes

gramalingam reviewed Nov 1, 2024

View reviewed changes

onnxscript/version_converter/version_converter_test.py Outdated Show resolved Hide resolved

gramalingam reviewed Nov 1, 2024

View reviewed changes

onnxscript/version_converter/_adapter_lib.py Outdated Show resolved Hide resolved

gramalingam reviewed Nov 1, 2024

View reviewed changes

onnxscript/version_converter/version_converter.py Outdated Show resolved Hide resolved

shubhambhokare1 added 2 commits November 6, 2024 19:48

restructure

8560036

custom adapters

0f1a601

github-advanced-security bot found potential problems Nov 6, 2024

View reviewed changes

onnxscript/version_converter/__init__.py Fixed Show fixed Hide fixed

justinchuby reviewed Nov 6, 2024

View reviewed changes

shubhambhokare1 self-assigned this Nov 6, 2024

shubhambhokare1 added enhancement New feature or request topic: api topic: IR Intermediate representation labels Nov 6, 2024

gramalingam reviewed Nov 6, 2024

View reviewed changes

onnxscript/version_converter/version_converter.py Outdated Show resolved Hide resolved

gramalingam reviewed Nov 6, 2024

View reviewed changes

remove custom adapters

7c8a3d3

shubhambhokare1 added 2 commits November 7, 2024 18:49

Add node replace fns

7690a34

Refactor

6fed016

Add logger warnings

c59906e

github-advanced-security bot found potential problems Nov 7, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Add framework for version converter API #1926

[WIP] Add framework for version converter API #1926

shubhambhokare1 commented Oct 31, 2024

github-advanced-security bot left a comment

justinchuby Oct 31, 2024

codecov bot commented Oct 31, 2024 •

edited

Loading

gramalingam Nov 1, 2024

gramalingam Nov 1, 2024

gramalingam Nov 1, 2024

gramalingam Nov 1, 2024

gramalingam commented Nov 1, 2024

justinchuby Nov 6, 2024

gramalingam Nov 6, 2024

gramalingam Nov 6, 2024

justinchuby Nov 6, 2024

gramalingam Nov 6, 2024

gramalingam Nov 6, 2024

gramalingam Nov 6, 2024

justinchuby commented Nov 7, 2024 •

edited

Loading

	self.assertEqual(model.graph._nodes[4]._attributes['mode'].value, 'bilinear')
	self.assertEqual(model.graph[4].attributes['mode'].value, 'bilinear')

[WIP] Add framework for version converter API #1926

Are you sure you want to change the base?

[WIP] Add framework for version converter API #1926

Conversation

shubhambhokare1 commented Oct 31, 2024

github-advanced-security bot left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

codecov bot commented Oct 31, 2024 • edited Loading

❌ 15 Tests Failed:

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gramalingam commented Nov 1, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

justinchuby commented Nov 7, 2024 • edited Loading

codecov bot commented Oct 31, 2024 •

edited

Loading

justinchuby commented Nov 7, 2024 •

edited

Loading