Search for a command to run...
LAVT: Language-Aware Vision Transformer for Referring Image Segmentation