Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
|
@@ -8,7 +8,7 @@ pinned: false
|
|
| 8 |
short_description: Unsupervised Human Preference Learning
|
| 9 |
---
|
| 10 |
|
| 11 |
-
# [LINK TO PAPER](https://
|
| 12 |
|
| 13 |
|
| 14 |
# ⚙️ Preference Agents
|
|
@@ -54,7 +54,15 @@ Our approach was evaluated using GPT-4o and human evaluations, demonstrating sig
|
|
| 54 |
If you use our resources in your research or applications, please cite our paper:
|
| 55 |
|
| 56 |
```bibtex
|
| 57 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 58 |
```
|
| 59 |
|
| 60 |
## 🙏 Acknowledgements
|
|
|
|
| 8 |
short_description: Unsupervised Human Preference Learning
|
| 9 |
---
|
| 10 |
|
| 11 |
+
# [LINK TO PAPER](https://arxiv.org/abs/2410.03731)
|
| 12 |
|
| 13 |
|
| 14 |
# ⚙️ Preference Agents
|
|
|
|
| 54 |
If you use our resources in your research or applications, please cite our paper:
|
| 55 |
|
| 56 |
```bibtex
|
| 57 |
+
@misc{shashidhar2024unsupervisedhumanpreferencelearning,
|
| 58 |
+
title={Unsupervised Human Preference Learning},
|
| 59 |
+
author={Sumuk Shashidhar and Abhinav Chinta and Vaibhav Sahai and Dilek Hakkani-Tür},
|
| 60 |
+
year={2024},
|
| 61 |
+
eprint={2410.03731},
|
| 62 |
+
archivePrefix={arXiv},
|
| 63 |
+
primaryClass={cs.CL},
|
| 64 |
+
url={https://arxiv.org/abs/2410.03731},
|
| 65 |
+
}
|
| 66 |
```
|
| 67 |
|
| 68 |
## 🙏 Acknowledgements
|