[optimizer] Allow optimizer to function when external data is not available #1792

justinchuby · 2024-08-08T15:02:26Z

Models with external data may not have their data available when being passed into the optimizer. We should still optimize what we can without raising an error.

…perly (#1801) Implement efficient save/load and handle loading external data properly in the IR. Before this change, when a ModelProto containing external data is converted to IR, the external tensor objects will load the data from a path relative to the working directory, not the ONNX file. This is because we do not store the onnx file path and thus have no way to look for the external data file. With the change, a `base_dir` property is added to ExternalTensor that we can set, in a separate pass when the directory is available, so the object has full information to find the data file on disk. The base_dir is not serialized to the proto to maintain a relative path in the "location" field in TensorProto. #1701, #1792 Example: ``` >>> m.graph.initializers["model.model.decoder.layers.2.encoder_attn.v_proj.weight"].const_value.display() ExternalTensor<FLOAT,[512,512]>(path='model.onnx.data', name='model.model.decoder.layers.2.encoder_attn.v_proj.weight', offset=245864448, length=1048576, base_dir='/home/justinchu/dev/ONNXConverter/docker/dump_bash_bench/BlenderbotSmallForConditionalGeneration-torch -onnx-detailed-cpu-') Min: -0.08586505800485611, Max: 0.09103105217218399, NaN count: 0, Inf count: 0 Sparsity (abs<1e-06): 0.00 Histogram: 11504 ┼ 10226 ┤ ╭───────╮ 8948 ┤ ╭─╯ ╰─╮ 7670 ┤ ╭─╯ ╰─╮ 6392 ┤ ╭─╯ ╰─╮ 5113 ┤ ╭─╯ ╰─╮ 3835 ┤ ╭─╯ ╰─╮ 2557 ┤ ╭──╯ ╰─╮ 1279 ┤ ╭────╯ ╰────╮ 1 ┼────────────────╯ ╰─────────────────── -0.0859 -0.0682 -0.0505 -0.0306 -0.0129 0.0070 0.0225 0.0402 0.0557 0.0733 0.0910 ```

justinchuby · 2024-09-27T01:12:50Z

cc @gramalingam

gramalingam · 2024-10-02T23:34:23Z

I took a quick look at the IR-based constant-folder/optimizer. I think it should work, as long as the numpy() method returns None or value.const_value is None. We can change it to use whatever appropriate methods the IR exposes to access such external tensors, allowing for possibility that the value may not be available.

On a different note: I just remembered that ir.Value.const_value is used for initializers that are graph inputs as well. This could break a few things. The optimizer etc. assume .const_value has a value only for constants, but graph inputs that are initializers should NOT be treated as constants.

gramalingam · 2024-10-03T00:01:51Z

Is this manifesting itself in any benchmark run (or another model)? Perhaps for proto-based optimization? We should deprecate that anyway.

justinchuby · 2024-10-03T01:23:43Z

I took a quick look at the IR-based constant-folder/optimizer. I think it should work, as long as the numpy() method returns None or value.const_value is None. We can change it to use whatever appropriate methods the IR exposes to access such external tensors, allowing for possibility that the value may not be available.

On a different note: I just remembered that ir.Value.const_value is used for initializers that are graph inputs as well. This could break a few things. The optimizer etc. assume .const_value has a value only for constants, but graph inputs that are initializers should NOT be treated as constants.

Right. We shouldn’t assume only Constants have const_value. Any value can have constant values.

justinchuby added enhancement New feature or request topic: optimizer labels Aug 8, 2024

justinchuby changed the title ~~Allow optimizer to function when external data is not available~~ [optimizer] Allow optimizer to function when external data is not available Aug 8, 2024

justinchuby mentioned this issue Aug 13, 2024

[IR] Implement save/load functions in IR and handle external data properly #1801

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[optimizer] Allow optimizer to function when external data is not available #1792

[optimizer] Allow optimizer to function when external data is not available #1792

justinchuby commented Aug 8, 2024

justinchuby commented Sep 27, 2024

gramalingam commented Oct 2, 2024

gramalingam commented Oct 3, 2024

justinchuby commented Oct 3, 2024

[optimizer] Allow optimizer to function when external data is not available #1792

[optimizer] Allow optimizer to function when external data is not available #1792

Comments

justinchuby commented Aug 8, 2024

justinchuby commented Sep 27, 2024

gramalingam commented Oct 2, 2024

gramalingam commented Oct 3, 2024

justinchuby commented Oct 3, 2024