Apply the max roi pooling to input. [Girshick, 2015].

The boxes should be packed with a shape like \((N, 5)\), where \(N\) is the number of boxes.

Each box is a 5d sequence containing (batch_index, x1, y1, x2, y2).

  • input (dragon.vm.torch.Tensor) – The input tensor.
  • boxes (dragon.vm.torch.Tensor) – The box coordinates.
  • output_size (Sequence[int]) – The output height and width.
  • spatial_scale (float, optional, default=1.) – The input scale to the size of boxes.

dragon.vm.torch.Tensor – The output tensor.