Foundation models such as the recently introduced Segment Anything Model (SAM) have achieved remarkable results in Image Segmentation tasks. However, these models typically require user interaction through handcrafted prompts such as bounding boxes, which limits their deployment to dow