Interest Grouping (RoI):
This is a type of union layer that performs maximum union at the inputs (here, convnet function maps) of uneven size and creates a small map of objects of a fixed size (say, 7x7). The choice of this fixed size is a hyperparameter of the network and is predetermined.
The main goal of such an association is to accelerate the time of training and testing, as well as to train the entire system from end-to-end (in a joint way).
Due to the use of this combining layer, the training and testing time is faster compared to the original (vanilla?) R-CNN architecture and, therefore, the name Fast R-CNN.
A simple example (from Area of Interests Explained by deepsense.io ):

source share