Intuitive Classification A visual explanation of neural networks. Image-to-Text Generation with GPT-2 Align CLIP's visual representation with GPT-2.