Siamese v.s. Vanilla MoCo v.s. Enhanced MoCo
There are 2 metrics that we care about the most for the actual downstream applications. The total amount of identical items that we could capture through vision: reflected by Coverage. Among these recalled items, how many of them are true positive matches: reflected by Precision.
- Baseline model performs poorly on hard-dataset mainly due to vision disturbances.
- Vanilla MoCo shows great improvement. Good choice as requires no additional fine-tuning.
- Enhanced MoCo demonstrate the best performance in both coverage and precision