Vision 15
- Dynamic Head: Unifying Object Detection Heads with Attentions
- COOT:Cooperative Hierarchical Transformer for Video-Text Representation Learning
- Stanford CS231n Lec 02. Image Classification
- Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks
- Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks (DCGAN)
- Unsupervised Intra-domain Adaptation for Semantic Segmentation
- Multimodal Unsupervised Image-to-Image Translation (MUNIT)
- DETR:End-to-End Object Detection with Transformers
- MMDetection 사용하기
- An Image is Worth 16x16 Words:Transformers for Image Recognition at Scale(ViT)
- StyleGAN:A Style-Based Generator Architecture for Generative Adversarial Networks
- HarDNet:A Low Memory Traffic network
- Generative Adversarial Nets
- You Only Look Once(YOLO):Unified, Real-Time Object Detection
- Faster R-CNN:Towards Real-Time Object Detection with Region Proposal Networks