| --- |
| base_model: |
| - Delta-Vector/Austral-Xgen-9B-Winton |
| library_name: transformers |
| tags: |
| - fine-tuning |
| - prose |
| - GRPO |
| - axolotl |
| - finetune |
| - roleplaying |
| - creative-writing |
| datasets: |
| - PocketDoc/Dans-Systemmaxx |
| --- |
| <style> |
| body { |
| font-family: 'Quicksand', sans-serif; |
| background: linear-gradient(135deg, #f9ffd1 0%, #e2fab5 100%); |
| color: #000000; |
| margin: 0; |
| padding: 0; |
| font-size: 16px; |
| } |
| .container { |
| margin: 20px; |
| background-color: rgba(255, 255, 255, 0.9); |
| padding: 20px; |
| border-radius: 12px; |
| box-shadow: 0 4px 20px rgba(0, 0, 0, 0.3); |
| border: 3px solid #000000; |
| position: relative; |
| } |
| .header h1 { |
| font-size: 28px; |
| color: #000000; |
| margin: 0 0 20px 0; |
| text-align: center; |
| text-decoration: underline; |
| } |
| .section { |
| margin-top: 30px; |
| } |
| .section h2 { |
| font-size: 24px; |
| color: #000000; |
| text-align: center; |
| text-decoration: underline; |
| } |
| .info p { |
| color: #000000; |
| line-height: 1.6; |
| font-size: 16px; |
| } |
| .info img { |
| width: 85%; |
| border-radius: 10px; |
| margin: 0 auto 15px; |
| display: block; |
| box-shadow: 0 0 20px rgba(0, 0, 0, 0.3); |
| border: 1px solid #000000; |
| } |
| a { |
| color: #000000; |
| text-decoration: none; |
| transition: color 0.2s ease; |
| } |
| a:hover { |
| color: #538125; |
| } |
| .button { |
| display: inline-block; |
| background-color: rgba(106, 168, 79, 0.8); |
| color: #000000; |
| padding: 10px 20px; |
| border-radius: 5px; |
| cursor: pointer; |
| text-decoration: none; |
| transition: background-color 0.2s ease, box-shadow 0.2s ease; |
| } |
| .button:hover { |
| background-color: #538125; |
| box-shadow: 0 0 15px rgba(106, 168, 79, 0.5); |
| } |
| pre { |
| background-color: rgba(240, 248, 225, 0.95); |
| padding: 15px; |
| border-radius: 5px; |
| overflow-x: auto; |
| border: 1px solid #000000; |
| } |
| code { |
| font-family: 'Courier New', monospace; |
| color: #000000; |
| } |
| .info-card { |
| background: rgba(249, 255, 235, 0.95); |
| border: 1px solid #000000; |
| border-radius: 8px; |
| overflow: hidden; |
| } |
| .info-header { |
| background: rgba(106, 168, 79, 0.1); |
| padding: 20px; |
| border-bottom: 1px solid #000000; |
| } |
| .info-header h3 { |
| color: #000000; |
| margin: 0 0 10px 0; |
| font-size: 20px; |
| text-align: center; |
| text-decoration: underline; |
| } |
| .model-tags { |
| display: flex; |
| gap: 8px; |
| flex-wrap: wrap; |
| } |
| .model-tag { |
| background: rgba(106, 168, 79, 0.1); |
| color: #000000; |
| padding: 4px 8px; |
| border-radius: 4px; |
| font-size: 12px; |
| border: 1px solid #000000; |
| } |
| .model-composition { |
| padding: 20px; |
| border-bottom: 1px solid #000000; |
| } |
| .model-composition h4 { |
| color: #000000; |
| margin: 0 0 15px 0; |
| font-size: 16px; |
| text-align: center; |
| text-decoration: underline; |
| } |
| .composition-list { |
| list-style: none; |
| padding: 0; |
| margin: 0; |
| display: grid; |
| gap: 10px; |
| } |
| .composition-list li { |
| color: #000000; |
| display: flex; |
| align-items: baseline; |
| gap: 8px; |
| } |
| .model-component { |
| font-weight: 500; |
| min-width: 120px; |
| } |
| .model-description { |
| padding: 20px; |
| background: rgba(255, 255, 255, 0.5); |
| } |
| .metrics-section { |
| margin-bottom: 30px; |
| } |
| .metrics-section details { |
| background: rgba(249, 255, 235, 0.95); |
| border: 1px solid #000000; |
| border-radius: 8px; |
| padding: 15px; |
| margin-bottom: 15px; |
| } |
| .metrics-section summary { |
| color: #000000; |
| font-size: 18px; |
| cursor: pointer; |
| outline: none; |
| padding: 5px 0; |
| text-align: center; |
| } |
| .creator-section { |
| margin: 20px 0; |
| } |
| .creator-badge { |
| display: inline-flex; |
| align-items: center; |
| background: rgba(249, 255, 235, 0.95); |
| border: 1px solid #000000; |
| border-radius: 8px; |
| padding: 10px 15px; |
| } |
| .creator-label { |
| color: #000000; |
| font-size: 14px; |
| margin-right: 8px; |
| } |
| .creator-link { |
| display: flex; |
| align-items: center; |
| gap: 5px; |
| color: #000000; |
| text-decoration: none; |
| transition: all 0.2s ease; |
| } |
| .creator-name { |
| font-weight: 600; |
| } |
| .creator-arrow { |
| font-size: 16px; |
| transition: transform 0.2s ease; |
| } |
| .creator-link:hover .creator-arrow { |
| transform: translateX(3px); |
| } |
| .link-arrow { |
| display: inline-block; |
| transition: transform 0.2s ease; |
| } |
| a:hover .link-arrow { |
| transform: translateX(3px); |
| } |
| .axolotl-container { |
| text-align: center; |
| margin: 30px 0; |
| display: flex; |
| justify-content: center; |
| align-items: center; |
| } |
| .axolotl-container img { |
| max-width: 200px; |
| width: 200px; |
| height: auto; |
| border-radius: 8px; |
| box-shadow: 0 2px 10px rgba(0, 0, 0, 0.3); |
| border: 1px solid #000000; |
| background-color: #000000; |
| padding: 15px; |
| object-fit: contain; |
| display: block; |
| margin: 0 auto; |
| } |
| </style> |
| <html lang="en"> |
| <head> |
| <meta charset="UTF-8"> |
| <meta name="viewport" content="width=device-width, initial-scale=1.0"> |
| <title>Nanuq-R1 9B</title> |
| <link href="https://fonts.googleapis.com/css2?family=Quicksand:wght@400;500;600&display=swap" rel="stylesheet"> |
| </head> |
| <body> |
| <div class="container"> |
| <div class="header"> |
| <h1>Nanuq-R1 9B</h1> |
| </div> |
| <div class="info"> |
| <img src="https://files.catbox.moe/i77ubj.webp" alt="Model banner"> |
| <div style="text-align: center;"> |
| <div class="creator-section"> |
| <div class="creator-badge"> |
| <span class="creator-label">Created by</span> |
| <a href="https://huggingface.co/Delta-Vector" target="_blank" class="creator-link"> |
| <span class="creator-name">Delta-Vector</span> |
| <span class="creator-arrow">→</span> |
| </a> |
| </div> |
| </div> |
| <div class="model-info"> |
| <h2>Model Information</h2> |
| <div class="info-card"> |
| <div class="info-header"> |
| <h3>Nanuq-R1 9B</h3> |
| <div class="model-tags"> |
| <span class="model-tag">GRPO Experiment</span> |
| <span class="model-tag">Austral finetune</span> |
| <span class="model-tag">Creative & Refreshing Prose with deep IF.</span> |
| </div> |
| </div> |
| <div class="model-description"> |
| <p>A sequel! The new Nanuq series is meant to be as a testing grounds for my GRPO experiments, This model is meant to have great Instruct Following and System prompt Adherence in Creative Scenarios.</p> |
| <p>Built ontop of Austral Xgen 9B, I made an RL env using PrimeIntellect-ai/verifiers and implemented InternLM/POLAR in said env, then using Pocketdoc's Systemmax dataset, I finetuned the model for 150 steps and this was the result.</p> |
| <p>There's alot of things i could do different, As the reward almost falls flat as soon as you get out of warm-up but this model was pretty decent so decided to release it, Hope people enjoy it!</p> |
| </div> |
| </div> |
| </div> |
| <div class="section"> |
| <h2>Quantized Versions</h2> |
| <div class="info-card"> |
| <div class="model-composition"> |
| <h4>Available Downloads</h4> |
| <ul class="composition-list"> |
| <li><span class="model-component"><a href="" target="_blank">GGUF Format</a></span>For use with LLama.cpp & Forks(Coming Soon!)</li> |
| <li><span class="model-component"><a href="" target="_blank">EXL2 Format</a></span>For use with TabbyAPI (Coming soon!)</li> |
| </ul> |
| </div> |
| </div> |
| </div> |
| <div class="section"> |
| <h2>Prompting</h2> |
| <p>Model has been tuned with the ChatML formatting. A typical input would look like this:</p> |
| <pre><code>"""<|im_start|>user |
| Hi there!<|im_end|> |
| <|im_start|>assistant |
| Nice to meet you!<|im_end|> |
| <|im_start|>user |
| Can I ask a question?<|im_end|> |
| <|im_start|>assistant |
| """</code></pre> |
| </div> |
| <div class="section"> |
| <h2>System Prompting</h2> |
| <p>I would highly recommend using either Euryale's system prompt or the EVA system prompt with the model.</p> |
| <div class="metrics-section"> |
| <details> |
| <summary>See Sao10k's Euryale System Prompt</summary> |
| <pre><code>Currently, your role is {{char}}, described in detail below. As {{char}}, continue the narrative exchange with {{user}}. |
| <Guidelines> |
| • Maintain the character persona but allow it to evolve with the story. |
| • Be creative and proactive. Drive the story forward, introducing plotlines and events when relevant. |
| • All types of outputs are encouraged; respond accordingly to the narrative. |
| • Include dialogues, actions, and thoughts in each response. |
| • Utilize all five senses to describe scenarios within {{char}}'s dialogue. |
| • Use emotional symbols such as "!" and "~" in appropriate contexts. |
| • Incorporate onomatopoeia when suitable. |
| • Allow time for {{user}} to respond with their own input, respecting their agency. |
| • Act as secondary characters and NPCs as needed, and remove them when appropriate. |
| • When prompted for an Out of Character [OOC:] reply, answer neutrally and in plaintext, not as {{char}}. |
| </Guidelines> |
| |
| <Forbidden> |
| • Using excessive literary embellishments and purple prose unless dictated by {{char}}'s persona. |
| • Writing for, speaking, thinking, acting, or replying as {{user}} in your response. |
| • Repetitive and monotonous outputs. |
| • Positivity bias in your replies. |
| • Being overly extreme or NSFW when the narrative context is inappropriate. |
| </Forbidden> |
|
|
| Follow the instructions in <Guidelines></Guidelines>, avoiding the items listed in <Forbidden></Forbidden>.</code></pre> |
| </details> |
| </div> |
| </div> |
| <div class="section"> |
| <h2>Training</h2> |
| <p>The training was done for 150 steps using 8 x <a href="https://www.nvidia.com/en-us/data-center/h200/">H200s</a> GPUs for the fine-tuning of the model.</p> |
| <p style="text-align: center; margin-top: 20px;"> |
| <div class="axolotl-container"> |
| <a href="https://github.com/PrimeIntellect-ai/verifiers" target="_blank"> |
| <img src="https://files.catbox.moe/j36qbe.png" alt="Built with Verifiers"> |
| </a> |
| </div> |
| <div class="section"> |
| <h2>Credits</h2> |
| <p>Thank you to <a href="https://huggingface.co/intervitens">Intervitens</a>, <a href="https://huggingface.co/cgato">Cgato</a>, <a href="https://huggingface.co/kubernetes-bad">Kubernetes Bad</a>, <a href="https://huggingface.co/cgato">Cgato</a>, <a href="https://huggingface.co/snwy">Snwy</a>, <a href="https://huggingface.co/AuriAetherwiing">Auri</a>, <a href="https://x.com/willccbb">Will Brown</a> and most of all: <a href="https://huggingface.co/kalomaze">Kalomaze</a> </p> |
| </div> |
| </div> |
| </div> |