Medicine, by its nature, is a multifaceted domain that requires the synthesis of information across various modalities. medical generative vision-language models (VLMs) make a first step in this direction and promise many exciting clinical applications. However, existing models typical