sumitranjan
/

PromptShield

Text Classification

Model card Files Files and versions

sumitranjan commited on May 19

Commit

bcd7c78

·

verified ·

1 Parent(s): 5e5543f

Update README.md

Files changed (1) hide show

README.md +17 -2

README.md CHANGED Viewed

@@ -1,3 +1,19 @@
 # 🛡️ PromptShield
 **PromptShield** is a prompt classification model designed to detect **unsafe**, **adversarial**, or **prompt injection** inputs. Built on the `xlm-roberta-base` transformer, it delivers high-accuracy performance in distinguishing between **safe** and **unsafe** prompts — achieving **99.33% accuracy** during training.
@@ -105,5 +121,4 @@ print("🟢 Safe" if prediction == 0 else "🔴 Unsafe")
 📄 License
-MIT License

+---
+license: mit
+datasets:
+- xTRam1/safe-guard-prompt-injection
+language:
+- en
+metrics:
+- accuracy
+base_model:
+- FacebookAI/xlm-roberta-base
+pipeline_tag: text-classification
+library_name: keras
+tags:
+- cybersecurity
+- llmsecurity
+---
 # 🛡️ PromptShield
 **PromptShield** is a prompt classification model designed to detect **unsafe**, **adversarial**, or **prompt injection** inputs. Built on the `xlm-roberta-base` transformer, it delivers high-accuracy performance in distinguishing between **safe** and **unsafe** prompts — achieving **99.33% accuracy** during training.
 📄 License
+MIT License