[Fix] Fix dimension error when using slide inference with Mask2Former head by Joris-Kuehl-TU-Berlin · Pull Request #3752 · open-mmlab/mmsegmentation

Joris-Kuehl-TU-Berlin · 2024-08-05T14:39:16Z

Motivation

The method predict in mmseg/models/decode_heads/mask2former_head.py is incompatible with the 'slide' inference mode.

To elaborate, on line 280 of slide_inference in mmseg/models/segmentors/encoder_decoder.py, the key 'img_shape' of batch_img_metas[0] is overwritten by the shape of the cropped image / sliding window.

In line 353 of mmseg/models/decode_heads/decode_head.py, this is the first key that is used to get the target size for upsampling. As such, crop_seg_logits will have the same shape as crop_img.

In mmseg/models/decode_heads/mask2former_head.py, the first shape that is referenced as a target for upsampling is 'pad_shape' instead. Since slide_inference does not overwrite this key, each cropped image is upsampled to the size of the full image, and then further padded with zeros by slide_inference, leading to a dimension mismatch when trying to add up the padded crop_seg_logits. This leads to the issue described in #3666.

Modification

I have simply adjusted the size selection in mmseg/models/decode_heads/mask2former_head.py to match that of mmseg/models/decode_heads/decode_head.py. As a result, a dimension mismatch no longer occurs when using slide inference with a Mask2Former head.

Checklist

Pre-commit or other linting tools are used to fix the potential lint issues.
The modification is covered by complete unit tests. If not, please add more unit test to ensure the correctness.
If the modification has potential influence on downstream projects, this PR should be tested with downstream projects, like MMDet or MMDet3D.
The documentation has been modified accordingly, like docstring or example tutorials.

londumas · 2024-11-08T14:23:10Z

Thank you for this PR, it works.

Fix open-mmlab#3666

59c67b0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Fix] Fix dimension error when using slide inference with Mask2Former head#3752

[Fix] Fix dimension error when using slide inference with Mask2Former head#3752
Joris-Kuehl-TU-Berlin wants to merge 1 commit intoopen-mmlab:dev-1.xfrom
Joris-Kuehl-TU-Berlin:make-mask2former-slide-compatible

Joris-Kuehl-TU-Berlin commented Aug 5, 2024 •

edited

Loading

Uh oh!

londumas commented Nov 8, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Joris-Kuehl-TU-Berlin commented Aug 5, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Modification

Checklist

Uh oh!

londumas commented Nov 8, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Joris-Kuehl-TU-Berlin commented Aug 5, 2024 •

edited

Loading