PaddleHub/hub_module/modules/image/semantic_segmentation/deeplabv3p_xception65_humanseg内存溢出问题 #603

livingbody · 2020-05-16T18:15:21Z

环境:截止2020年5月17日
模块:paddlehub人像分割模块
PaddleHub/hub_module/modules/image/semantic_segmentation/deeplabv3p_xception65_humanseg
错误现象:数据量很大的情况下内存溢出
错误原因分析:该模块进行分割时，能够设置batch size ，使用少量多次的办法处理图像，但是返回值是一次性返回所有数据，包括处理后的图像numpy二进制数据，这个量特别大，所以很容易就内存溢出了。
代码看了，问题就在那。

Steffy-zxf · 2020-05-18T01:54:18Z

你好！deeplabv3p_xception65_humanseg segmentation接口返回的是所有输入的images分割后的结果。如果返回结果出现内存溢出的情况，可以尝试分批量处理，如images只传入一张图片数据。

livingbody · 2020-05-18T04:38:35Z

你好！deeplabv3p_xception65_humanseg segmentation接口返回的是所有输入的images分割后的结果。如果返回结果出现内存溢出的情况，可以尝试分批量处理，如images只传入一张图片数据。

你说的很对，分批传入可以解决问题的。
但是，这个segmentation对输入都做了batch size了，感觉分批传入，batch size基本就没啥用了。或者返回的话，直接只返回保存的image地址，或许会好点。
就看选哪个思路了，要么就手动分批少量多次传入；要么就维持现状，segment大量数据传入，但返回的话只返回生成的image保存后的文件名。

livingbody · 2020-05-18T04:43:16Z

我的临时办法：

def GetHumanSeg(in_path, out_path):
    # load model
    module = hub.Module(name="deeplabv3p_xception65_humanseg")
    # config
    frame_path = in_path
    test_img_path = [os.path.join(frame_path, fname) for fname in os.listdir(frame_path)]
    input_dict = {"image": test_img_path}
    print('file len: %d'% len(test_img_path))

   **一个批次传入10张图片**
    total_num = len(test_img_path)
    loop_num = int(np.ceil(total_num / 10))


    for iter_id in range(loop_num):
        batch_data=list()
        handle_id=iter_id *10
        for image_id in range(10):
            try:
                batch_data.append(test_img_path[handle_id +image_id])
                print(handle_id +image_id)
                print(test_img_path[handle_id +image_id])
            except:
                pass
        batch_input_dict={"image": batch_data}
        **segmentation的batch size基本没用了**
        results = module.segmentation(data=batch_input_dict, use_gpu=True, visualization=True,  output_dir=out_path)

Steffy-zxf · 2020-05-18T08:07:58Z

但是，这个segmentation对输入都做了batch size了，感觉分批传入，batch size基本就没啥用了。或者返回的话，直接只返回保存的image地址，或许会好点。

感谢反馈！batch_size的意义是运行program一次处理的样本数量，这个预测接口segmentation 返回的是所有样本预测结果（分割结果以及分割图片保存路径）。

livingbody · 2020-05-18T09:04:52Z

这个里面，分割结果包含图片二进制数据，特别大。

…

---原始邮件--- 发件人: "Steffy-zxf"<[email protected]> 发送时间: 2020年5月18日(周一) 下午4:08 收件人: "PaddlePaddle/PaddleHub"<[email protected]>; 抄送: "Author"<[email protected]>;"livingbody"<[email protected]>; 主题: Re: [PaddlePaddle/PaddleHub] PaddleHub/hub_module/modules/image/semantic_segmentation/deeplabv3p_xception65_humanseg内存溢出问题 (#603) 但是，这个segmentation对输入都做了batch size了，感觉分批传入，batch size基本就没啥用了。或者返回的话，直接只返回保存的image地址，或许会好点。感谢反馈！batch_size的意义是运行program一次处理的样本数量，这个预测接口segmentation 返回的是所有样本预测结果（分割结果以及分割图片保存路径）。 — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub, or unsubscribe.

livingbody · 2020-05-18T09:16:45Z

def segmentation(images=None, paths=None, batch_size=1, use_gpu=False, visualization=False, output_dir='humanseg_output') 预测API，用于人像分割。参数 images (list[numpy.ndarray]): 图片数据，ndarray.shape 为 [H, W, C]，BGR格式； paths (list[str]): 图片的路径； batch_size (int): batch 的大小； use_gpu (bool): 是否使用 GPU； visualization (bool): 是否将识别结果保存为图片文件； output_dir (str): 图片的保存路径。返回 res (list[dict]): 识别结果的列表，列表中每一个元素为 dict，关键字有 'save_path', 'data'，对应的取值为： save_path (str, optional): 可视化图片的保存路径（仅当visualization=True时存在）； data (numpy.ndarray): 人像分割结果，仅包含Alpha通道，取值为0-255 (0为全透明，255为不透明)，也即取值越大的像素点越可能为人体，取值越小的像素点越可能为背景可以看出，输入很小，只是图片名，输出很大，输出data为抠图后的alpha通道分割结果，这个特别大，作为返回值一起返回了。如果存下来也和输入一样返回文件名就好了。

sunjunling666 · 2020-09-21T10:03:52Z

追加一问，每次都要运行这个模块吗

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PaddleHub/hub_module/modules/image/semantic_segmentation/deeplabv3p_xception65_humanseg内存溢出问题 #603

PaddleHub/hub_module/modules/image/semantic_segmentation/deeplabv3p_xception65_humanseg内存溢出问题 #603

livingbody commented May 16, 2020

Steffy-zxf commented May 18, 2020 •

edited

Loading

livingbody commented May 18, 2020

livingbody commented May 18, 2020

Steffy-zxf commented May 18, 2020

livingbody commented May 18, 2020 via email

livingbody commented May 18, 2020 via email

sunjunling666 commented Sep 21, 2020

PaddleHub/hub_module/modules/image/semantic_segmentation/deeplabv3p_xception65_humanseg内存溢出问题 #603

PaddleHub/hub_module/modules/image/semantic_segmentation/deeplabv3p_xception65_humanseg内存溢出问题 #603

Comments

livingbody commented May 16, 2020

Steffy-zxf commented May 18, 2020 • edited Loading

livingbody commented May 18, 2020

livingbody commented May 18, 2020

Steffy-zxf commented May 18, 2020

livingbody commented May 18, 2020 via email

livingbody commented May 18, 2020 via email

sunjunling666 commented Sep 21, 2020

Steffy-zxf commented May 18, 2020 •

edited

Loading