Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Shengfang Zhai's picture
4

Shengfang Zhai

zsf
https://scholar.google.com/citations?user=bJYY-tIAAAAJ&hl=en
  • zhaisf

AI & ML interests

Trustworthy AI, Generative Models, AI Privacy, Backdoor Attacks

Organizations

None yet

upvoted a paper 2 months ago

Imperceptible Jailbreaking against Large Language Models

Paper • 2510.05025 • Published Oct 6 • 33
upvoted 2 papers 9 months ago

Membership Inference on Text-to-Image Diffusion Models via Conditional Likelihood Discrepancy

Paper • 2405.14800 • Published May 23, 2024 • 1

Text-to-Image Diffusion Models can be Easily Backdoored through Multimodal Data Poisoning

Paper • 2305.04175 • Published May 7, 2023 • 1
upvoted a paper 10 months ago

GuardReasoner: Towards Reasoning-based LLM Safeguards

Paper • 2501.18492 • Published Jan 30 • 88
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs