BriefGPT.xyz
Jan, 2024
图像安全维护:使用条件视觉语言模型推理和逆向遮蔽危险内容
Image Safeguarding: Reasoning with Conditional Vision Language Model and Obfuscating Unsafe Content Counterfactually
HTML
PDF
Mazal Bethany, Brandon Wherry, Nishant Vishwamitra, Peyman Najafirad
TL;DR
社交媒体平台通过使用人工智能和人工审核,模糊分享危险内容的图像以提高用户安全性,研究了图像模糊的理由和最小化模糊的方法,并通过实验证明了所提出方法的有效性。
Abstract
social media platforms
are being increasingly used by malicious actors to share
unsafe content
, such as images depicting sexual activity, cyberbullying, and self-harm. Consequently, major platforms use
→