PyTorch
English
llama
desaifan-mbzuai commited on
Commit
35cf92c
Β·
verified Β·
1 Parent(s): 1720ac7
Files changed (1) hide show
  1. README.md +8 -4
README.md CHANGED
@@ -7,20 +7,24 @@ language:
7
 
8
  # **K2-V2**
9
 
10
- πŸ“š [Tech Report](arxiv_url) - πŸ“ [Code](github_url) - 🏒 [Project Page](self_url)
 
 
 
 
11
 
12
  K2-V2 is our best fully open source model to date and ranked among the best open weight models of its class. As the latest base model in the LLM360's strongest project family, K2 features a dense architecture with 70 billion parameters.
13
 
14
- <img src="figures/sft-models.pdf" alt="k2-sft-aime"/>
15
 
16
  Beyond standard competencies like knowledge and conversation, K2 provides advanced capabilities, including long context consistency, deep mathematical knowledge, and reasoning behaviors. These serve as foundational building blocks that enable sophisticated downstream use cases, such as solving complex math problems and executing agentic workflows.
17
 
18
 
19
- <img src="figures/base-models.pdf" alt="k2-base-gpqa"/>
20
 
21
  ---
22
 
23
- ## **Usage**
24
 
25
  ```python
26
  from transformers import AutoModelForCausalLM, AutoTokenizer
 
7
 
8
  # **K2-V2**
9
 
10
+ πŸ“š [Tech Report](https://www.llm360.ai/reports/K2_V2_report.pdf ) - πŸ“ [Code](github_url) - 🏒 [Project Page](https://huggingface.co/LLM360/K2-V2)
11
+
12
+ <img src="figures/banner.png" alt="k2-banner-placeholder"/>
13
+
14
+ <br>
15
 
16
  K2-V2 is our best fully open source model to date and ranked among the best open weight models of its class. As the latest base model in the LLM360's strongest project family, K2 features a dense architecture with 70 billion parameters.
17
 
18
+ <img src="figures/sft-models.png" width="400" alt="k2-sft-aime"/>
19
 
20
  Beyond standard competencies like knowledge and conversation, K2 provides advanced capabilities, including long context consistency, deep mathematical knowledge, and reasoning behaviors. These serve as foundational building blocks that enable sophisticated downstream use cases, such as solving complex math problems and executing agentic workflows.
21
 
22
 
23
+ <img src="figures/base-models.png" width="400" alt="k2-base-gpqa"/>
24
 
25
  ---
26
 
27
+ ## **Quick Start**
28
 
29
  ```python
30
  from transformers import AutoModelForCausalLM, AutoTokenizer