The models of the paper "X-Boundary: Establishing Exact Safety Boundary to Shield LLMs from Multi-Turn Jailbreaks without Compromising Usability".
Xiaoya Lu
Ursulalala
·
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 20 hours ago
DeepSight: An All-in-One LM Safety Toolkit
upvoted
a
paper
18 days ago
AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security
authored
a paper
30 days ago
Geometrically-Constrained Agent for Spatial Reasoning