dueling bandits is a prominent framework for decision-making involving
preferential feedback, a valuable feature that fits various applications
involving human interaction, such as ranking, information retrieval, and
recommendation systems. While substantial efforts have been made to m