Update README.md
Browse files
README.md
CHANGED
|
@@ -1,3 +1,19 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
# π‘οΈ PromptShield
|
| 2 |
|
| 3 |
**PromptShield** is a prompt classification model designed to detect **unsafe**, **adversarial**, or **prompt injection** inputs. Built on the `xlm-roberta-base` transformer, it delivers high-accuracy performance in distinguishing between **safe** and **unsafe** prompts β achieving **99.33% accuracy** during training.
|
|
@@ -105,5 +121,4 @@ print("π’ Safe" if prediction == 0 else "π΄ Unsafe")
|
|
| 105 |
|
| 106 |
π License
|
| 107 |
|
| 108 |
-
MIT License
|
| 109 |
-
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: mit
|
| 3 |
+
datasets:
|
| 4 |
+
- xTRam1/safe-guard-prompt-injection
|
| 5 |
+
language:
|
| 6 |
+
- en
|
| 7 |
+
metrics:
|
| 8 |
+
- accuracy
|
| 9 |
+
base_model:
|
| 10 |
+
- FacebookAI/xlm-roberta-base
|
| 11 |
+
pipeline_tag: text-classification
|
| 12 |
+
library_name: keras
|
| 13 |
+
tags:
|
| 14 |
+
- cybersecurity
|
| 15 |
+
- llmsecurity
|
| 16 |
+
---
|
| 17 |
# π‘οΈ PromptShield
|
| 18 |
|
| 19 |
**PromptShield** is a prompt classification model designed to detect **unsafe**, **adversarial**, or **prompt injection** inputs. Built on the `xlm-roberta-base` transformer, it delivers high-accuracy performance in distinguishing between **safe** and **unsafe** prompts β achieving **99.33% accuracy** during training.
|
|
|
|
| 121 |
|
| 122 |
π License
|
| 123 |
|
| 124 |
+
MIT License
|
|
|