psroi_pool

psroi_pool

paddle.fluid.layers.psroi_pool(input, rois, output_channels, spatial_scale, pooled_height, pooled_width, name=None)[源代码]

注意 rois必须为2维LoDTensor，lod_level为1

该OP执行PSROIPooling运算，是位置敏感的感兴趣区域池化方法（Position sensitive region of interest pooling，也称为PSROIPooling）。输入input是位置敏感的评分图，输入rois是感兴趣区域的位置坐标。PSROIPooling不同于普通ROIPooling的地方在于，输入input特征图的不同通道会跟输出特征图上的位置区域相关联，该方法是在R-FCN模型中首次提出来的，更多详细信息请参阅 https://arxiv.org/abs/1605.06409。

样例：

Given:
  input.shape = [2, 490, 28, 28]
  rois.shape = [5, 4], rois.lod = [[3, 2]]
  output_channels = 10
  pooled_height = 7
  pooled_width = 7
Return:
  out.shape = [5, 10, 7, 7], out.lod = [[3, 2]]

参数

input (Variable) - 输入特征图，4-D Tensor，格式是NCHW。其中N是batch_size，C是输入通道的数量，H是输入特征图的高度，W是特征图宽度。数据类型是float32或者float64
rois (Variable) - 感兴趣区域，2-D LoDTensor，形状为(num_rois, 4)，lod_level为1。形式如[x1, y1, x2, y2], …]，其中(x1, y1)为左上角坐标，(x2, y2)为右下角坐标。数据类型与input相同
output_channels (int) - 输出特征图的通道数。对于共C个种类的图像分类任务，output_channels应该是 (C + 1) ，其中1代表背景
spatial_scale (float) - 空间跨度因子，用于将 rois 中的坐标从其输入尺寸按比例映射到 input 特征图的尺寸
pooled_height (int) - 池化输出的高度
pooled_width (int) - 池化输出的宽度
name (str|None) - 该参数供开发人员打印调试信息时使用，具体用法请参见 Name ，默认值为None。

经过PSROIPooling之后的结果，形状为(num_rois，output_channels，pooled_height，pooled_width) 的4维LoDTensor，lod_level为1，数据类型与input相同，与rois具有相同的lod信息。

返回类型

Variable

代码示例

import paddle.fluid as fluid
x = fluid.layers.data(name='x', shape=[490, 28, 28], dtype='float32')
rois = fluid.layers.data(name='rois', shape=[4], lod_level=1, dtype='float32')
pool_out = fluid.layers.psroi_pool(x, rois, 10, 1.0, 7, 7)