多标签语义分割任务 multi-label semantic segmentation #2174

Wulx2050 · 2022-05-30T13:53:13Z

PaddleSeg 里面的任务和模型好像都是默认一个像素一个标签，但是有时候会碰到一个像素有多个标签的情况，比如这里：
https://www.kaggle.com/competitions/uw-madison-gi-tract-image-segmentation/overview

这个竞赛是一个多标签语义分割(multi-label semantic segmentation)任务，有些像素同时有两个以上的标签，输出是 Hight×Width×Num_class。Num_class>=2

我没搜索到 PaddleSeg 里面相关的教程和算法，请问我怎么处理这种多标签语义分割任务?

Wulx2050 · 2022-05-30T13:57:50Z

而且现在很多数据集的标注文件使用RLE(run-length encoding，游程编码，行程长度编码)的格式，为了方便麻烦加两个编码和解码的函数到 PaddleSeg 中。

import numpy as np

# ref: https://www.kaggle.com/paulorzp/run-length-encode-and-decode
# modified from: https://www.kaggle.com/inversion/run-length-decoding-quick-start
def rle_decode(mask_rle, shape, color=1):
    """ TBD
    
    Args:
        mask_rle (str): run-length as string formated (start length)
        shape (tuple of ints): (height,width) of array to return 
    
    Returns: 
        Mask (np.array)
            - 1 indicating mask
            - 0 indicating background

    """
    # Split the string by space, then convert it into a integer array
    s = np.array(mask_rle.split(), dtype=int)

    # Every even value is the start, every odd value is the "run" length
    starts = s[0::2] - 1
    lengths = s[1::2]
    ends = starts + lengths

    # The image image is actually flattened since RLE is a 1D "run"
    if len(shape)==3:
        h, w, d = shape
        img = np.zeros((h * w, d), dtype=np.float32)
    else:
        h, w = shape
        img = np.zeros((h * w,), dtype=np.float32)

    # The color here is actually just any integer you want!
    for lo, hi in zip(starts, ends):
        img[lo : hi] = color
        
    # Don't forget to change the image back to the original shape
    return img.reshape(shape)

# https://www.kaggle.com/namgalielei/which-reshape-is-used-in-rle
def rle_decode_top_to_bot_first(mask_rle, shape):
    """ TBD
    
    Args:
        mask_rle (str): run-length as string formated (start length)
        shape (tuple of ints): (height,width) of array to return 
    
    Returns:
        Mask (np.array)
            - 1 indicating mask
            - 0 indicating background

    """
    s = mask_rle.split()
    starts, lengths = [np.asarray(x, dtype=int) for x in (s[0:][::2], s[1:][::2])]
    starts -= 1
    ends = starts + lengths
    img = np.zeros(shape[0]*shape[1], dtype=np.uint8)
    for lo, hi in zip(starts, ends):
        img[lo:hi] = 1
    return img.reshape((shape[1], shape[0]), order='F').T  # Reshape from top -> bottom first

# ref.: https://www.kaggle.com/stainsby/fast-tested-rle
def rle_encode(img):
    """ TBD
    
    Args:
        img (np.array): 
            - 1 indicating mask
            - 0 indicating background
    
    Returns: 
        run length as string formated
    """
    
    pixels = img.flatten()
    pixels = np.concatenate([[0], pixels, [0]])
    runs = np.where(pixels[1:] != pixels[:-1])[0] + 1
    runs[1::2] -= runs[::2]
    return ' '.join(str(x) for x in runs)

juncaipeng · 2022-05-31T02:06:41Z

PaddleSeg 里面的任务和模型好像都是默认一个像素一个标签，但是有时候会碰到一个像素有多个标签的情况，比如这里： https://www.kaggle.com/competitions/uw-madison-gi-tract-image-segmentation/overview

这个竞赛是一个多标签语义分割(multi-label semantic segmentation)任务，有些像素同时有两个以上的标签，输出是 Hight×Width×Num_class。Num_class>=2

我没搜索到 PaddleSeg 里面相关的教程和算法，请问我怎么处理这种多标签语义分割任务?

你好，目前paddleseg不支持多标签语义分割。如果需要使用，需要手动修改模型结构、loss计算、训练和预测过程。

alexhmyang · 2023-08-16T03:53:21Z

PaddleSeg 里面的任务和模型好像都是默认一个像素一个标签，但是有时候会碰到一个像素有多个标签的情况，比如这里： https://www.kaggle.com/competitions/uw-madison-gi-tract-image-segmentation/overview
这个竞赛是一个多标签语义分割(multi-label semantic segmentation)任务，有些像素同时有两个以上的标签，输出是 Hight×Width×Num_class。Num_class>=2
我没搜索到 PaddleSeg 里面相关的教程和算法，请问我怎么处理这种多标签语义分割任务?

你好，目前paddleseg不支持多标签语义分割。如果需要使用，需要手动修改模型结构、loss计算、训练和预测过程。

一年了还不支持？ EISeg 可以标注分割类多标签，我标完了，结果 paddleseg报错不支持？

MINGtoMING · 2023-08-29T03:29:43Z

@alexhmyang 你好，我目前正在做paddleseg多标签语义分割的支持。多标签语义分割任务中图像上的某个像素点可以同时对应多个类别(普通语义分割是只能指向一个类别)，这样的话图像上的不同object之间可能会出现重叠，但EISeg现在并不支持不同object之间可以重叠这种模式。所以可以说一下你是怎样进行重叠区域d标注的吗？还有你的标注好的annotation的格式是什么样的？这样的话我可以提供更加便利的数据读取接口。谢谢！

Wulx2050 added the enhancement New feature or request label May 30, 2022

juncaipeng self-assigned this May 31, 2022

juncaipeng closed this as completed Jul 28, 2022

This was referenced Aug 16, 2023

新增需求征集（Collect Feature Request） #3363

Closed

为PaddleSeg添加多标签语义分割的功能 #3456

Closed

This was referenced Aug 23, 2023

🏅️飞桨套件快乐开源常规赛 PaddlePaddle/PaddleOCR#10223

Closed

CV套件建设专项活动 #3333

Closed

Ligoml mentioned this issue Aug 27, 2023

飞桨快乐开源活动全新升级🔥 PaddlePaddle/Paddle#56689

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

多标签语义分割任务 multi-label semantic segmentation #2174

多标签语义分割任务 multi-label semantic segmentation #2174

Wulx2050 commented May 30, 2022

Wulx2050 commented May 30, 2022 •

edited

Loading

juncaipeng commented May 31, 2022

alexhmyang commented Aug 16, 2023

MINGtoMING commented Aug 29, 2023

多标签语义分割任务 multi-label semantic segmentation #2174

多标签语义分割任务 multi-label semantic segmentation #2174

Comments

Wulx2050 commented May 30, 2022

Wulx2050 commented May 30, 2022 • edited Loading

juncaipeng commented May 31, 2022

alexhmyang commented Aug 16, 2023

MINGtoMING commented Aug 29, 2023

Wulx2050 commented May 30, 2022 •

edited

Loading