[Good First Issue]: Support aten::avg_poolNd and aten::max_poolNd with no batch #20927

mvafin · 2023-11-07T11:57:41Z

Context

OpenVINO component responsible for support of PyTorch models is called as PyTorch Frontend (PT FE). PT FE converts a model represented as TorchScript model to a model in OpenVINO opset.

What needs to be done?

Modify existing conversion rule for the pooling operations: https://github.com/openvinotoolkit/openvino/blob/master/src/frontends/pytorch/src/op/avg_poolnd.cpp and https://github.com/openvinotoolkit/openvino/blob/master/src/frontends/pytorch/src/op/max_poolnd.cpp.
Openvino MaxPool and AvgPool operations assume that Batch and Channels dimensions exist in input. In order to support pytorch operations inputs that have no batch should be reshaped to have batch 1 and reshaped back after.
Implement operation tests in tests/layer_tests/pytorch_tests. Please consider different data types, but keep reasonable number of test cases

Example Pull Requests

No response

Resources

Contact points

@openvinotoolkit/openvino-pytorch-frontend-maintainers

Ticket

No response

ahmadchalhoub · 2023-12-13T05:06:04Z

.take

github-actions · 2023-12-13T05:06:16Z

Thank you for looking into this issue! Please let us know if you have any questions or require any help.

p-wysocki · 2023-12-15T10:05:55Z

Hello @ahmadchalhoub! Thank you for taking a look, please let us know if you have any questions. Just yesterday our CONTRIBUTING.md has been updated with a technical guide - I highly recommend checking it out. :)

ahmadchalhoub · 2023-12-15T16:41:22Z

@p-wysocki Thanks for your input! I'll definitely take a look at the updates :). I already started working on the PR for this but it might take me a few days.

p-wysocki · 2024-01-03T15:20:03Z

Hello @ahmadchalhoub, do you need any help? Are you still working on that issue?

I am happy to announce that we have created a channel dedicated to Good First Issues support on our Intel DevHub Discord server! Join it to receive support, engage in discussions, ask questions and talk to OpenVINO developers.

ahmadchalhoub · 2024-01-05T00:51:28Z

Hi @p-wysocki! Very sorry for the delay; I got caught up with some things and will not be able to work on this issue. I will un-assign myself so someone else can pick it up :)

p-wysocki · 2024-01-05T10:55:02Z

No worries, you're welcome to pick up another issue anytime. :)

Sar2580P · 2024-01-07T19:59:37Z

@p-wysocki I am new to open-source, could you please provide some guidance relevant to above issue?
I want to get started, I think task is just to reshape the input twice, but need some code understanding...

mlukasze · 2024-01-08T06:21:56Z

@mmikolajcz could you help here?

p-wysocki · 2024-01-08T12:44:43Z

Hello @Sar2580P, you can start with our technical guide in CONTRIBUTING.md :)

You're also welcome to join our Intel DevHub Discord server, where you can receive chat support from OpenVINO developers.

mvafin · 2024-01-08T12:48:12Z

@p-wysocki I am new to open-source, could you please provide some guidance relevant to above issue? I want to get started, I think task is just to reshape the input twice, but need some code understanding...

That is exactly right, you need to reshape input 2 times, but the main problem is to calculate the shape to reshape to, when input shape is only know in runtime.

SpaceDigi · 2024-01-12T16:21:48Z

I am very sorry, but do you know the deadline for this bug?

krish1209 · 2024-01-16T10:34:15Z

.take

github-actions · 2024-01-16T10:34:30Z

Thank you for looking into this issue! Please let us know if you have any questions or require any help.

krish1209 · 2024-01-16T10:57:45Z

Hi. I am new to Open source contributing but I am excited to start as I am too. I am going to participate in GSOC 24 too and choosing OpenVino organization as well. Currently I'm pursuing a Bachelor's degree in CSE AI and Machine Learning and have also some experience in Deep learning and neural networks as well.

The problem here is that we need a batch size initialization and channel here refers to the image components of an image. E.g RGB, CMY, Grayscale etc. Here in CNN it typically means the depth dimension of the input tensor.

For OpenVino's MaxPool and AvgPool operations it assumes batch and channels dimensions exist. But for images that don't have it in input need to be assigned batch as 1 and then after operation get back to its original size.

In the above code given we can try making a block of code which can check whether the image has a batch or not and if not then we assign batch as 1.

I am writing a code here since i dont know if there is a seperate section for providing codes as solutions
And if any mentor would help me in how to use pull requests and how to know if you solve an issue, it would be really helpful :)

In this repo, we can try the following code: (This is my first open source contribution please be kind :) )
https://github.com/openvinotoolkit/openvino/blob/master/src/frontends/pytorch/src/op/avg_poolnd.cpp

OutputVector translate_avg_poolnd(const NodeContext& context) {
// Same code as above link onlu

auto input = context.get_input(0);

// Here we check if the image tensor has a batch dimension. We check if the rank (number of dimensions) of the input tensor is known to us at compile time or not. and we then check its length whether it is less than 3 or not
if (input.get_partial_shape().rank().is_static() && input.get_partial_shape().rank().get_length() < 3) {
    // now we add a batch size of 1
    auto batch_size = context.mark_node(v0::Constant::create(element::i32, Shape{}, {1}));
    input = context.mark_node(std::make_shared<v1::Broadcast>(input, batch_size));
}

// here rest of the code is same as the github repo link

return {context.mark_node(
    std::make_shared<v1::AvgPool>(input, strides, pads, pads, kernel, !count_include_pad, rounding_type))};

}

Please feel free to guide me more and notify me if anything comes up.

mvafin · 2024-01-16T11:08:38Z

@krish1209 Please check contribution guide for tips how to contribute and create Pull Requests.

The code you provided will not actually work as you expect, because Broadcast accepts shape to broadcast to, when you provide 1 you ask to broacast tensor to scalar, which should fail. I would recommend you use Unsqueeze operation, but the main difficulty in this task is the fact that pooling can be done for any shape NCH, CH, NCHW, CHW, NCHWD, CHWD, you may want to use information about which torch operation is used, e.g aten::max_pool2d or aten::max_pool1d, which will help you understanding what dimensions are required to get pooled.

Also please write a test for all these cases first and make sure your solution passes all tests.

If you have any questions please feel free to ask.

krish1209 · 2024-01-16T11:13:51Z

Okay, I'll spin my head around it. Meanwhile could you let me know like how are you thinking of approaching this ?

cannguyen275 · 2024-04-15T03:02:50Z

Hello guys, I'm facing this issue and looking forward to fixing it. My MaxPool3D layer is causing the issue:

nn.MaxPool3d(kernel_size=(1, 3, 3), stride=(1, 1, 1))

Are there any ways to escape this issue instead of waiting for new updates?

mvafin · 2024-04-15T16:09:32Z

@cannguyen275 you could modify the model code to unsqueeze before pool and squeeze after, so you will run pooling in NCHWD mode instead of in CHWD

tianyiSKY1 · 2024-09-03T09:44:09Z

.take

github-actions · 2024-09-03T09:44:23Z

Thank you for looking into this issue! Please let us know if you have any questions or require any help.

tianyiSKY1 · 2024-09-04T12:48:09Z

@mlukasze Hello, I’m new to open source. Could you help me understand how avg_poolnd.cpp and max_poolnd.cpp handle inputs of varying dimensions?

p-wysocki · 2024-09-04T14:00:15Z

Hello @tianyiSKY1! Thank you for taking a look at the task.

In the files you mentioned the inputs are retrieved using auto input = context.get_input(0); and being passed directly into the operator constructor:

openvino/src/frontends/pytorch/src/op/max_poolnd.cpp

Line 56 in 41394bb

    
           auto res = context.mark_node(std::make_shared<v14::MaxPool>(context.get_input(0),

As for inputs of varying dimensions you're most likely talking about PartialShapes. PartialShape is a shape which can have a dynamic rank (we know pretty much nothing about the data) or static rank. In case of static rank, the shape is a collection of Dimension objects.

They can be static (for example {5}), dynamic ({?}) or an interval ({3, 6}). In this task I suppose the case you mostly care about is whether the input is of static rank, and what the rank is. In practice when the graph has a dynamic input shape, we make it static during runtime, once the input data is given and its shape is known. Since this task regards a Frontend it's not something that should concern you.

In the snippet in #20927 (comment) you can see how to access the input's PartialShape and rank.

Additionally, it's worth checking out @mvafin's very informative comment about handling different data layouts: #20927 (comment).

Please let us know if you have any other questions.

tianyiSKY1 · 2024-09-05T14:44:57Z

@p-wysocki Hi, I conducted a preliminary test and wrote some test code. However, every time I input a case for the CHWD dimension, the test fails with an 'is_conversion_successful' error. Is there a part of the code that autommatically detects the input shape, which might be preventing me from proceeding with further testing?

class TestAvgPool3D(PytorchLayerTest):
    
    def _prepare_input(self):
        return (self.input_tensor, )
    
    def create_model(self, kernel_size, stride, padding, ceil_mode=True, count_include_pad=True):
        class aten_avg_pool3d(torch.nn.Module):

            def __init__(self, kernel_size, stride, padding, ceil_mode, count_include_pad) -> None:
                super().__init__()
                self.kernel_size = kernel_size
                print(kernel_size)
                self.stride = stride
                print(stride)
                self.padding = padding
                self.ceil_mode = ceil_mode
                self.count_include_pad = count_include_pad

            def forward(self, input_tensor):
                return torch.nn.functional.avg_pool3d(input_tensor, self.kernel_size, self.stride, self.padding, self.ceil_mode,
                                                      self.count_include_pad)
        ref_net = None

        return aten_avg_pool3d(kernel_size, stride, padding, ceil_mode=True, count_include_pad=True), ref_net, "aten::avg_pool3d"

    @pytest.mark.parametrize('input_shape', [[1, 3, 15, 15, 15], [3, 15, 15, 15]])
    @pytest.mark.parametrize("params", d3_params)
    @pytest.mark.parametrize("ceil_mode", [True, False])
    @pytest.mark.parametrize("count_include_pad", [True, False])
    @pytest.mark.nightly
    @pytest.mark.precommit
    @pytest.mark.precommit_torch_export
    @pytest.mark.precommit_fx_backend
    def test_avg_pool3d(self, params, ceil_mode, count_include_pad, ie_device, precision, ir_version, input_shape):
        self.input_tensor = np.random.randn(*input_shape).astype(np.float32)
        self._test(*self.create_model(**params, ceil_mode=ceil_mode, count_include_pad=count_include_pad),
                   ie_device, precision, ir_version, trace_model=True, dynamic_shapes=False)

FAILED test_avg_pool.py::TestAvgPool3D::test_avg_pool3d[ ie_device:CPU - precision:FP16 - count_include_pad:False - ceil_mode:False - params:{'kernel_size': [3, 2, 1], 'stride': None, 'padding': [0, 0, 0]} - input_shape:[3, 15, 15, 15] ] - openvino._pyopenvino.OpConversionFailure: Check 'is_conversion_successful' failed at src/frontends/pytorch/src/frontend.cpp:166:

p-wysocki · 2024-09-06T12:57:18Z

Judging by the check it seems the operator for some reason hasn't been converted:

openvino/src/frontends/pytorch/src/frontend.cpp

Lines 172 to 173 in 69180e2

    
           bool is_conversion_successful = unconverted_ops.size() == 0 && norm_err.empty(); 
        
           FRONT_END_OP_CONVERSION_CHECK(is_conversion_successful, pack_detailed_failure_report(unconverted_ops, norm_err));

I'd use a debugger to see what's going on in the function during the test exactly, with the breakpoint at the beginning of the conversion function. Perhaps there's some assert the operator doesn't pass and the conversion function fails?

tianyiSKY1 · 2024-09-09T04:26:51Z

hi @p-wysocki I found places that prevented me from converting models:

openvino/src/core/shape_inference/include/pooling_shape_inference_util.hpp

Lines 56 to 59 in 91b5744

    
           NODE_VALIDATION_CHECK(op, 
        
                                 data_rank.is_dynamic() || num_spatial == (data_shape.size() - spatial_dim_offset), 
        
                                 "Expected kernel size to be equal to input size - 2. Got: ", 
        
                                 num_spatial);

I'm trying to force a model conversion by commenting out this code. I did a recompile after making the changes, but it still runs as the original code when I test it. I've tried deleting the \build file and recompiling, but it doesn't work.

E       Check 'data_rank.is_dynamic() || num_spatial == (data_shape.size() - spatial_dim_offset)' failed at src/core/shape_inference/include/pooling_shape_inference_util.hpp:56:
E       While validating node 'opset1::AvgPool AvgPool_156823 (opset1::Pad aten::avg_pool3d/Pad[0]:f32[?,?,?,?]) -> (dynamic[...])' with friendly_name 'AvgPool_156823':
E       Expected kernel size to be equal to input size - 2. Got: 3

mvafin · 2024-09-09T10:07:03Z

@tianyiSKY1 You shouldn't modify shape inference code. The problem that needs to be solved is reshaping of CHW to 1CHW input and the reshape of the output of the operation back by removing 1.

tianyiSKY1 · 2024-09-09T13:13:19Z

@mvafin I have tried reshaping the CHW to a 1CHW input and reshaping the output of the operation by removing the 1. But in the testing phase, the check for the input shape prevents the model conversion.

OutputVector translate_avg_pool3d(const NodeContext& context) {
    auto input = context.get_input(0);
    auto input_shape = input.get_partial_shape();
    
    auto kernel = context.const_input<Shape>(1);

    // Check if the input has a batch dimension
    bool has_batch_dim = input_shape.rank().is_static() && input_shape.rank().get_length() == 5;

    if (!has_batch_dim) {
        // If there is no batch dimension, add one by unsqueezing the input tensor
        auto zero = v0::Constant::create(element::i32, Shape{}, {0});
        auto unsqueezed_input = context.mark_node(std::make_shared<v0::Unsqueeze>(input, zero));
        input = unsqueezed_input;
    }

    auto pooled_output = translate_avg_pool_base(context, input);

    auto const_0 = context.mark_node(v0::Constant::create(element::i32, Shape{1}, {0}));
    auto const_1 = context.mark_node(v0::Constant::create(element::i32, Shape{1}, {1}));
    auto slice_end = context.mark_node(v0::Constant::create(element::i32, Shape{1}, {-4}));


    if (!has_batch_dim) {
        // If there was no batch dimension, remove the added dimension by slicing the output tensor
        auto sliced_output = context.mark_node(std::make_shared<v8::Slice>(pooled_output[0], const_0, slice_end, const_1, const_0));
        Output<Node> pooled_output = sliced_output;
    }

    return {pooled_output}; 
};

mvafin · 2024-09-09T13:28:38Z

Did you add dynamic_shape=False to test class?

Your solution uses get_partial_shape which is not-recommended, because it will not work for dynamic shapes.

Also you use Slice on pool output which is wrong, you need to Squeeze(0) or Reshape.

To make your solution working on dynamic shapes, you need to do something like this assuming 3d case:
Before pool: Reshape(input, [-1,0,0,0,0], true) or calculate shape similarly as for output
After pool: ShapeOf(input) -> Slice(0, -3, 1)
ShapeOf(pooled_output) -> Slice(-3, -1, 1)
Concat -> Reshape
This should work for dynamic shape, but it will create additional nodes, so ideal would be to combine two solutions using if partial_shape.is_dynamic().

tianyiSKY1 · 2024-09-09T13:43:12Z

@mvafin Yes, I added dynamic_shape=False to test class.
Thank you very much for your guidance, I will keep trying.

tianyiSKY1 · 2024-09-10T13:24:10Z

@mvafin Hi, I'm sorry to interrupt. Can you please guide me a little further?
I tried to use Reshape to increase the batch dimension while maintaining the dynamic shape of the input, but this doesn't seem to work.

OutputVector translate_avg_pool3d(const NodeContext& context) {
    auto input = context.get_input(0);
    
    auto kernel = context.const_input<Shape>(1);

    // Check if the input has a batch dimension
    auto input_shape_of = context.mark_node(std::make_shared<v3::ShapeOf>(input));


    auto has_batch_dim = context.mark_node(std::make_shared<v1::Equal>(
        input_shape_of,  
        context.mark_node(v0::Constant::create(element::i64, Shape{1}, {5}))
    ));

    if (!has_batch_dim) {
        // If there is no batch dimension, add one by reshaping the input tensor
        auto reshape_pattern = context.mark_node(v0::Constant::create(element::i32, Shape{5}, {-1, 0, 0, 0, 0}));
        input = context.mark_node(std::make_shared<v1::Reshape>(input, reshape_pattern, true));
    }

    auto pooled_output = translate_avg_pool_base(context, input);

    if (!has_batch_dim) {
        // If there was no batch dimension, remove the added dimension by slicing the output tensor
        auto input_shape_of = context.mark_node(std::make_shared<v3::ShapeOf>(input));
        auto pooled_output_shape_of = context.mark_node(std::make_shared<v3::ShapeOf>(pooled_output[0]));

        auto slice_input_shape = context.mark_node(std::make_shared<v8::Slice>(input_shape_of, 
            context.mark_node(v0::Constant::create(element::i64, Shape{1}, {0})), 
            context.mark_node(v0::Constant::create(element::i64, Shape{1}, {-3})), 
            context.mark_node(v0::Constant::create(element::i64, Shape{1}, {1})), 
            context.mark_node(v0::Constant::create(element::i64, Shape{1}, {0}))));

        auto slice_pooled_output_shape = context.mark_node(std::make_shared<v8::Slice>(pooled_output_shape_of, 
            context.mark_node(v0::Constant::create(element::i64, Shape{1}, {-3})), 
            context.mark_node(v0::Constant::create(element::i64, Shape{1}, {-1})), 
            context.mark_node(v0::Constant::create(element::i64, Shape{1}, {1})), 
            context.mark_node(v0::Constant::create(element::i64, Shape{1}, {0}))));

        auto concat_shape = context.mark_node(std::make_shared<v0::Concat>(OutputVector{slice_input_shape, slice_pooled_output_shape}, 0));
        for (auto& node : pooled_output) {
            node = context.mark_node(std::make_shared<v1::Reshape>(node, concat_shape, true));
        }
        
    }

    return {pooled_output}; 
};

mvafin · 2024-09-10T13:34:24Z

@tianyiSKY1 No worries, thank you for working on this issue.

You seem to misunderstand how operations work. std::make_shared<v1::Equal> will return operation, not result of the comparison. So !has_batch_dim will always be false. So for dynamic case you have to have same code for NCHW case and CHW, that is why I have wrote about inefficiency of such approach, it will create more nodes then we have today for operations that didn't need that before. That is why I recommend having different if branches for partial_shape.is_static() cases (true - static and false - dynamic) to optimize this for static case when we are sure if we need to reshape or not.

As for the code under the if it seems right in concept, but it needs to be tested.

tianyiSKY1 · 2024-09-11T08:33:01Z

@mvafin Hi, I followed your instructions and tried again. It even directly expands all input tensor without judgment to directly expand the batch dimensions. But it still doesn't work. Did I fail to expand the dimension?

OutputVector translate_avg_pool3d(const NodeContext& context) {
    auto input = context.get_input(0);
    
    auto kernel = context.const_input<Shape>(1);

    // Check if the input has a batch dimension
    auto input_shape = input.get_partial_shape();

    if (input_shape.is_static()) {
        bool has_batch_dim = input_shape.rank().get_length() == 5;
        if (!has_batch_dim) {
            // Reshape input to add a batch dimension
            auto reshape_pattern = context.mark_node(v0::Constant::create(element::i64, Shape{5}, {-1, 0, 0, 0, 0}));
            input = context.mark_node(std::make_shared<v1::Reshape>(input, reshape_pattern, true));
        }
    } else {
        // If the input shape is dynamic, add a batch dimension by default
        auto reshape_pattern = context.mark_node(v0::Constant::create(element::i64, Shape{5}, {-1, 0, 0, 0, 0}));
        input = context.mark_node(std::make_shared<v1::Reshape>(input, reshape_pattern, true));
    }

    auto pooled_output = translate_avg_pool_base(context, input);

    
    // If there was no batch dimension, remove the added dimension by slicing the output tensor
    auto input_shape_of = context.mark_node(std::make_shared<v3::ShapeOf>(input));
    auto pooled_output_shape_of = context.mark_node(std::make_shared<v3::ShapeOf>(pooled_output[0]));

    auto slice_input_shape = context.mark_node(std::make_shared<v8::Slice>(input_shape_of, 
        context.mark_node(v0::Constant::create(element::i64, Shape{1}, {0})), 
        context.mark_node(v0::Constant::create(element::i64, Shape{1}, {-3})), 
        context.mark_node(v0::Constant::create(element::i64, Shape{1}, {1})), 
        context.mark_node(v0::Constant::create(element::i64, Shape{1}, {0}))));

    auto slice_pooled_output_shape = context.mark_node(std::make_shared<v8::Slice>(pooled_output_shape_of, 
        context.mark_node(v0::Constant::create(element::i64, Shape{1}, {-3})), 
        context.mark_node(v0::Constant::create(element::i64, Shape{1}, {-1})), 
        context.mark_node(v0::Constant::create(element::i64, Shape{1}, {1})), 
        context.mark_node(v0::Constant::create(element::i64, Shape{1}, {0}))));

    auto concat_shape = context.mark_node(std::make_shared<v0::Concat>(OutputVector{slice_input_shape, slice_pooled_output_shape}, 0));
    for (auto& node : pooled_output) {
        node = context.mark_node(std::make_shared<v1::Reshape>(node, concat_shape, true));
    }
        
    return pooled_output; 
};

self = <test_avg_pool.TestAvgPool3D object at 0x7f800871b730>
om = <Model: 'Model58'
inputs[
<ConstOutput: names[input_tensor] shape[?,?,?,?] type: f32>
]
outputs[
<ConstOutput: names[] shape[?,?,?,?,?] type: dynamic>
]>
ov_inputs = [array([[[[-0.44533446,  1.8727436 , -0.47645602, ..., -0.37609327,
           0.87295103, -1.6756757 ],
         [-0....      [ 1.0659894 , -0.909589  , -0.8698702 , ...,  1.6324621 ,
          -2.3695803 , -0.50536054]]]], dtype=float32)]
dynamic_shapes = True

    def _resolve_input_shape_dtype(self, om, ov_inputs, dynamic_shapes):
        params = list(om.inputs)
        for i in range(len(ov_inputs)):
            inp = ov_inputs[i]
            if isinstance(inp, list):
                ov_inputs[i] = np.array(inp)
                if ov_inputs[i].dtype == np.int64:
                    ov_inputs[i] = ov_inputs[i].astype(np.int32)
                inp = ov_inputs[i]
            assert inp.dtype.name in self._type_map, f"Unknown type {inp.dtype}."
            if params[i].get_node().get_element_type().is_dynamic():
                params[i].get_node().set_element_type(
                    self._type_map[inp.dtype.name])
            shape = [-1] * len(inp.shape) if dynamic_shapes else inp.shape
            params[i].get_node().set_partial_shape(PartialShape(shape))
>       om.validate_nodes_and_infer_types()
E       RuntimeError: Check 'data_rank.is_dynamic() || num_spatial == (data_shape.size() - spatial_dim_offset)' failed at src/core/shape_inference/include/pooling_shape_inference_util.hpp:56:
E       While validating node 'opset1::AvgPool aten::avg_pool3d/AvgPool (opset1::Pad aten::avg_pool3d/Pad[0]:f32[?,2..,2..,2..]) -> (dynamic[?,?,?,?,?])' with friendly_name 'aten::avg_pool3d/AvgPool':
E       Expected kernel size to be equal to input size - 2. Got: 3

pytorch_layer_test_class.py:262: RuntimeError

tianyiSKY1 · 2024-09-12T15:04:53Z

@mvafin Hello, I was wondering if there is a way to view the variables in avg_poolnd.cpp using std::cout during pytest testing.

mlukasze · 2024-09-18T05:22:32Z

@mvafin gently ping

mvafin · 2024-09-18T11:59:46Z

@tianyiSKY1 From what I can see your code should work. You can use std::cout in the code to debug, if you don't see the debug prints you inserted, it may mean that your code is not used. You could prepare PR it will be easier to understand the issues that you have.

tianyiSKY1 · 2024-09-21T12:32:41Z

Hi, @mvafin I've solved the problem of the code not being used. After successfully running the code I realized that I can't expand the dimension of the input with v1::Reshape

auto reshape_pattern = context.mark_node(v0::Constant::create(element::i64, Shape{5}, {-1, 0, 0, 0, 0}));
input = context.mark_node(std::make_shared<v1::Reshape>(input, reshape_pattern, true));

If I expand the dimension in this way, will report an error

E       RuntimeError: Check 'i < input_shape.size()' failed at src/core/shape_inference/include/reshape_shape_inference.hpp:306:
E       While validating node 'opset1::Reshape aten::avg_pool3d/Reshape (opset1::Parameter input_tensor[0]:f32[?,?,?,?], opset1::Constant aten::avg_pool3d/Constant[0]:i64[5]) -> (dynamic[?,?,?,?,?])' with friendly_name 'aten::avg_pool3d/Reshape':
E       Shape inference input shapes {[?,?,?,?],[5]}
E       '0' dimension is out of range

I tried to successfully expand the dimensions with v0::Unsqueeze and it passed the test for inputs without batch dimensions.

auto unsqueeze_axis = context.mark_node(v0::Constant::create(element::i64, Shape{}, {0}));
input = context.mark_node(std::make_shared<v0::Unsqueeze>(input, unsqueeze_axis));

However, due to the failure to obtain dynamic shapes, it would expand inputs that have batch dimensions as well.
So I want to get guidance to know whether I’m using v1::Reshape incorrectly, or if there’s a way to ascertain the length of a dynamic shape.

mvafin · 2024-09-24T09:20:27Z

@tianyiSKY1 It is my bad, I suggested using Reshape with zeros, but this is will not work in this case. Using Unsqueeze is good idea if you can detect existence of batch dimension during conversion, but in case you can not do that, I would recommend using Reshape and calculating the required shape using ShapeOf, Slice the shape with negative start, stop and use ScatterElementUpdate to get the required shape by inserting output of slice to the end of tensor filled with -1, so it will create the output shape [-1,C,H,W...].
That is how to get input shape.
Output shape can be calculated similarly by using Slice on the input shape with start=0 and end -N which will output batch dimension or empty tensor if it doesn't exist. After that use Concat to concatenate this batch tensor with the rest of the output shape with removed batch, that you can again get using Slice.

#20927 ### Details: - *add batch dimension before pool* - *remove batch dimension after pool*

mvafin added good first issue Good for newcomers no_stale Do not mark as stale labels Nov 7, 2023

github-project-automation bot added this to Good first issues Nov 7, 2023

github-project-automation bot moved this to Contributors Needed in Good first issues Nov 7, 2023

mlukasze added the category: PyTorch FE OpenVINO PyTorch Frontend label Nov 7, 2023

mvafin mentioned this issue Nov 14, 2023

[Bug]: Expected a 3D, 4D or 5D tensor for the input. Got: [?,?] #21052

Closed

3 tasks

github-actions bot assigned ahmadchalhoub Dec 13, 2023

ilya-lavrenov moved this from Contributors Needed to Assigned in Good first issues Dec 13, 2023

mvafin mentioned this issue Jan 4, 2024

[Bug]: Expected kernel size to be equal to input size - 2. Got: 3 #20907

Closed

3 tasks

ahmadchalhoub removed their assignment Jan 5, 2024

p-wysocki moved this from Assigned to Contributors Needed in Good first issues Jan 5, 2024

github-actions bot assigned krish1209 Jan 16, 2024

mvafin moved this from Contributors Needed to Assigned in Good first issues Jan 16, 2024

mlukasze unassigned krish1209 Jun 18, 2024

mlukasze moved this from In Review to Contributors Needed in Good first issues Jun 18, 2024

github-actions bot assigned tianyiSKY1 Sep 3, 2024

mlukasze moved this from Contributors Needed to Assigned in Good first issues Sep 3, 2024

tianyiSKY1 mentioned this issue Sep 25, 2024

#20927 support inputs that have no batch #26778

Merged

github-merge-queue bot pushed a commit that referenced this issue Oct 17, 2024

#20927 support inputs that have no batch (#26778)

8822480

#20927 ### Details: - *add batch dimension before pool* - *remove batch dimension after pool*

mvafin closed this as completed Oct 18, 2024

github-project-automation bot moved this from Assigned to Closed in Good first issues Oct 18, 2024

mlukasze added this to the 2024.5 milestone Oct 21, 2024

[Good First Issue]: Support aten::avg_poolNd and aten::max_poolNd with no batch #20927

[Good First Issue]: Support aten::avg_poolNd and aten::max_poolNd with no batch #20927

Comments

mvafin commented Nov 7, 2023 • edited by p-wysocki Loading

Context

What needs to be done?

Example Pull Requests

Resources

Contact points

Ticket

ahmadchalhoub commented Dec 13, 2023

github-actions bot commented Dec 13, 2023

p-wysocki commented Dec 15, 2023

ahmadchalhoub commented Dec 15, 2023 • edited Loading

p-wysocki commented Jan 3, 2024

ahmadchalhoub commented Jan 5, 2024

p-wysocki commented Jan 5, 2024

Sar2580P commented Jan 7, 2024

mlukasze commented Jan 8, 2024

p-wysocki commented Jan 8, 2024

mvafin commented Jan 8, 2024

SpaceDigi commented Jan 12, 2024

krish1209 commented Jan 16, 2024

github-actions bot commented Jan 16, 2024

krish1209 commented Jan 16, 2024 • edited Loading

mvafin commented Jan 16, 2024

krish1209 commented Jan 16, 2024

cannguyen275 commented Apr 15, 2024

mvafin commented Apr 15, 2024

tianyiSKY1 commented Sep 3, 2024

github-actions bot commented Sep 3, 2024

tianyiSKY1 commented Sep 4, 2024

p-wysocki commented Sep 4, 2024

tianyiSKY1 commented Sep 5, 2024

p-wysocki commented Sep 6, 2024

tianyiSKY1 commented Sep 9, 2024 • edited Loading

mvafin commented Sep 9, 2024 • edited Loading

tianyiSKY1 commented Sep 9, 2024

mvafin commented Sep 9, 2024

tianyiSKY1 commented Sep 9, 2024

tianyiSKY1 commented Sep 10, 2024

mvafin commented Sep 10, 2024

tianyiSKY1 commented Sep 11, 2024

tianyiSKY1 commented Sep 12, 2024

mlukasze commented Sep 18, 2024

mvafin commented Sep 18, 2024

tianyiSKY1 commented Sep 21, 2024

mvafin commented Sep 24, 2024 • edited Loading

mvafin commented Nov 7, 2023 •

edited by p-wysocki

Loading

ahmadchalhoub commented Dec 15, 2023 •

edited

Loading

krish1209 commented Jan 16, 2024 •

edited

Loading

tianyiSKY1 commented Sep 9, 2024 •

edited

Loading

mvafin commented Sep 9, 2024 •

edited

Loading

mvafin commented Sep 24, 2024 •

edited

Loading