Resumen
In the field of computer vision, training a well-performing model on a dataset with a long-tail distribution is a challenging task. To address this challenge, image resampling is usually introduced as a simple and effective solution. However, when performing instance segmentation tasks, there may be multiple classes in one image. Hence, image resampling alone is not enough to obtain a sufficiently balanced distribution at the level of target data volume. In this paper, we propose an improved instance segmentation method for long-tail datasets based on Mask R-CNN. Specifically, an object-centric memory bank is used to establish an object-centric storage strategy that can solve the imbalance problem with respect to categories. In the testing phase, a post-processing calibration is used to adjust each class logit to change the confidence score, which improves the prediction score of tail classes. A discrete cosine transform-based mask is used to obtain high-quality masks, which improves segmentation accuracy. The evaluation of the proposed method on the LVIS dataset demonstrates its effectiveness. The proposed method improves the AP performance of EQL by 2.2%.