[NPU] Use zero tensor to get correct data #27980

pereanub · 2024-12-09T15:45:23Z

Details:

Can not deallocate and re-allocate a newer memory for tensor if update mutable command list is not supported. Newer memory should be updated in the graph and this can be done only using the updating command list feature
In order to check if the memory was or wasn't re-allocated we are checking the unique ID provided by the driver when memory is created

Tickets:

E#134453

razvanapetroaie

Halfway through, nothing major so far.

src/plugins/intel_npu/src/backend/include/zero_infer_request.hpp

src/plugins/intel_npu/src/backend/include/zero_tensor.hpp

src/plugins/intel_npu/src/backend/src/zero_infer_request.cpp

src/plugins/intel_npu/src/backend/src/zero_pipeline.cpp

src/plugins/intel_npu/src/common/include/intel_npu/common/sync_infer_request.hpp

src/plugins/intel_npu/src/backend/src/zero_tensor.cpp

src/plugins/intel_npu/src/backend/src/zero_infer_request.cpp

razvanapetroaie

@MirceaDan99 Please let us know if you have anything before merging this.

src/plugins/intel_npu/src/backend/src/zero_infer_request.cpp

src/plugins/intel_npu/src/backend/src/zero_pipeline.cpp

razvanapetroaie · 2024-12-20T12:34:59Z

src/plugins/intel_npu/src/backend/src/zero_infer_request.cpp

+                continue;
+            }
+
+            if (std::dynamic_pointer_cast<ZeroRemoteTensor>(levelZeroTensor.at(SINGLE_TENSOR)) != nullptr) {


Since these three ifs share the same body, you may consider writing them in a more compact manner. That is: if (condition1 || condition2 || condition3) then common_body.

I've also noticed you're using std::dynamic_pointer_cast<ZeroRemoteTensor> quite often for determining whether the tensor is remote or not. You may consider creating a static method inside ZeroRemoteTensor, perhaps called is_remote_tensor, which does this check.

Not sure about the second part of the comment. We still need to do the cast, correct? Since it is an ITensor as default. Can I call methods from the derivate class in this case?

Was thinking you could do something like:

class ZeroRemoteTensor { public: static bool is_remote_tensor(const std::shared_ptr<ov::ITensor>& tensor) { return std::dynamic_pointer_cast<ZeroRemoteTensor>(tensor) != nullptr; } };

And call it like: ZeroRemoteTensor::is_remote_tensor(tensor). So I guess in this case you would just place the helper function inside the ZeroRemoteTensor "namespace" practically. Not sure if this is the best approach or the best place for the helper function, the suggestion is mainly about defining a helper for this wherever you think it belongs best.

razvanapetroaie · 2024-12-20T12:43:22Z

src/plugins/intel_npu/src/backend/src/zero_infer_request.cpp

+            }
+
+            const auto inputDescriptor = _metadata.inputs.at(inputIndex);
+            if (inputDescriptor.isShapeTensor || inputDescriptor.isStateInput) {


We sure we should skip state tensors? Just asking if you've considered this case hard enough, I haven't checked this yet. The tensor can be retrieved via SyncInferRequest::query_state -> get_state.

Not sure yet how this is supposed to work with state tensors.

src/plugins/intel_npu/src/backend/src/zero_infer_request.cpp

Signed-off-by: Bogdan Pereanu <[email protected]>

github-actions bot added the category: NPU OpenVINO NPU plugin label Dec 9, 2024

pereanub changed the title ~~Use zero tensor to get correct data~~ [NPU] Use zero tensor to get correct data Dec 9, 2024

pereanub changed the title ~~[NPU] Use zero tensor to get correct data~~ [DO NOT MERGE][NPU] Use zero tensor to get correct data Dec 10, 2024

pereanub marked this pull request as ready for review December 10, 2024 13:37

pereanub requested review from a team as code owners December 10, 2024 13:37

pereanub force-pushed the use_correct_tensor branch 11 times, most recently from c718ff5 to b1afd2c Compare December 16, 2024 14:55

pereanub force-pushed the use_correct_tensor branch from b1afd2c to 03a4df5 Compare December 18, 2024 09:43

pereanub changed the title ~~[DO NOT MERGE][NPU] Use zero tensor to get correct data~~ [NPU] Use zero tensor to get correct data Dec 18, 2024

PatrikStepan assigned razvanapetroaie and MirceaDan99 Dec 18, 2024

pereanub force-pushed the use_correct_tensor branch from 6f77e4d to a14a25b Compare December 18, 2024 14:44

razvanapetroaie reviewed Dec 18, 2024

View reviewed changes

razvanapetroaie reviewed Dec 20, 2024

View reviewed changes

pereanub force-pushed the use_correct_tensor branch 6 times, most recently from f505588 to 1b6d092 Compare January 8, 2025 10:25

pereanub force-pushed the use_correct_tensor branch from 1b6d092 to 71035d6 Compare January 8, 2025 10:39

pereanub added 5 commits January 8, 2025 13:25

Use zero tensor to get correct data

8e6d2cb

Signed-off-by: Bogdan Pereanu <[email protected]>

Update command list if someone else allocated another tensor

9b3e8b4

Signed-off-by: Bogdan Pereanu <[email protected]>

Create zero tensor class

4d336c1

Signed-off-by: Bogdan Pereanu <[email protected]>

Clean code

8d0f7c4

Signed-off-by: Bogdan Pereanu <[email protected]>

Do not set a bigger shape in case of remote tensor

bc10837

Signed-off-by: Bogdan Pereanu <[email protected]>

pereanub force-pushed the use_correct_tensor branch from 71035d6 to ae58e16 Compare January 8, 2025 11:26

Add method for checking memory address changes in the zero tensor

bd6c07b

Signed-off-by: Bogdan Pereanu <[email protected]>

pereanub force-pushed the use_correct_tensor branch from ae58e16 to bd6c07b Compare January 8, 2025 11:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[NPU] Use zero tensor to get correct data #27980

[NPU] Use zero tensor to get correct data #27980

pereanub commented Dec 9, 2024 •

edited

Loading

razvanapetroaie left a comment

razvanapetroaie left a comment

razvanapetroaie Dec 20, 2024

pereanub Jan 8, 2025

razvanapetroaie Jan 8, 2025

razvanapetroaie Dec 20, 2024

pereanub Jan 8, 2025

[NPU] Use zero tensor to get correct data #27980

Are you sure you want to change the base?

[NPU] Use zero tensor to get correct data #27980

Conversation

pereanub commented Dec 9, 2024 • edited Loading

Details:

Tickets:

razvanapetroaie left a comment

Choose a reason for hiding this comment

razvanapetroaie left a comment

Choose a reason for hiding this comment

razvanapetroaie Dec 20, 2024

Choose a reason for hiding this comment

pereanub Jan 8, 2025

Choose a reason for hiding this comment

razvanapetroaie Jan 8, 2025

Choose a reason for hiding this comment

razvanapetroaie Dec 20, 2024

Choose a reason for hiding this comment

pereanub Jan 8, 2025

Choose a reason for hiding this comment

pereanub commented Dec 9, 2024 •

edited

Loading