The rapid advancements in Large Language Models (LLMs) and Vision-Language Models (VLMs) have shown great potential in Medical Diagnostics, particularly in radiology, where datasets such as X-rays are paired with human-generated diagnostic reports. However, a significant research gap e