Given a query composed of a reference image and a relative caption, the composed image retrieval goal is to retrieve images visually similar to the reference one that integrates the modifications expressed by the caption. Given that recent research has demonstrated the efficacy of larg