When labeled data is scarce for a specific target task, transfer learning
often offers an effective solution by utilizing data from a related source
task. However, when transferring knowledge from a less related source, it may
inversely hurt the target performance, a phenomenon known a