Self-supervised learning

Space-Time Crop & Attend: Improving Cross-modal Video Representation Learning

The quality of the image representations obtained from self-supervised learning depends strongly on the type of data augmentations used in the learning formulation. Recent papers have ported these methods from still images to videos and found that …

On Compositions of Transformations in Contrastive Self-Supervised Learning

In the image domain, excellent representations can be learned by inducing invariance to content-preserving transformations via noise contrastive learning. In this paper, we generalize contrastive learning to a wider set of transformations, and their …