May 16, 2025, Adversarial Preference Learning for Robust LLM Alignment is accepted by ACL2025.