Shengfang Zhai's picture

4

Shengfang Zhai

zsf

https://scholar.google.com/citations?user=bJYY-tIAAAAJ&hl=en

zhaisf

AI & ML interests

Trustworthy AI, Generative Models, AI Privacy, Backdoor Attacks

Organizations

None yet

upvoted a paper 2 months ago

Imperceptible Jailbreaking against Large Language Models

Paper • 2510.05025 • Published Oct 6 • 33

upvoted 2 papers 9 months ago

Membership Inference on Text-to-Image Diffusion Models via Conditional Likelihood Discrepancy

Paper • 2405.14800 • Published May 23, 2024 • 1

Text-to-Image Diffusion Models can be Easily Backdoored through Multimodal Data Poisoning

Paper • 2305.04175 • Published May 7, 2023 • 1

upvoted a paper 10 months ago

GuardReasoner: Towards Reasoning-based LLM Safeguards

Paper • 2501.18492 • Published Jan 30 • 88