Update README.md
Browse files
README.md
CHANGED
|
@@ -14,10 +14,10 @@ This repo contains the model weights for [Continue](https://continue.dev)'s stat
|
|
| 14 |
|
| 15 |
**Ollama**: We've released a [Q4_K_M GGUF quantization of Instinct](https://huggingface.co/continuedev/instinct-GGUF) for efficient local inference. Try it with [Continue's Ollama integration](https://docs.continue.dev/guides/ollama-guide).
|
| 16 |
|
| 17 |
-
|
| 18 |
|
| 19 |
-
SGLang
|
| 20 |
-
<br
|
| 21 |
|
| 22 |
## Learn more
|
| 23 |
|
|
|
|
| 14 |
|
| 15 |
**Ollama**: We've released a [Q4_K_M GGUF quantization of Instinct](https://huggingface.co/continuedev/instinct-GGUF) for efficient local inference. Try it with [Continue's Ollama integration](https://docs.continue.dev/guides/ollama-guide).
|
| 16 |
|
| 17 |
+
You can also serve the model using either of the below options, then [connect it with Continue](https://docs.continue.dev/guides/how-to-self-host-a-model).
|
| 18 |
|
| 19 |
+
**SGLang**: `python3 -m sglang.launch_server --model-path continuedev/instinct --load-format safetensors`
|
| 20 |
+
<br>**vLLM**: `vllm serve continuedev/instinct --served-model-name instinct --load-format safetensors`
|
| 21 |
|
| 22 |
## Learn more
|
| 23 |
|