HyperAI

Fine-tuning Pix2Struct Using Hugging Face Transformers and Datasets

This tutorial is based heavily on the GiT tutorial and shows how to fine-tune GiT on a custom image captioning dataset.