Search for a command to run...
Vision Transformer Based Model for Describing a Set of Images as a Story