ollama

History

Devon Rifkin 5f57b0ef42 add thinking support to the api and cli (#10584 ) - Both `/api/generate` and `/api/chat` now accept a `"think"` option that allows specifying whether thinking mode should be on or not - Templates get passed this new option so, e.g., qwen3's template can put `/think` or `/no_think` in the system prompt depending on the value of the setting - Models' thinking support is inferred by inspecting model templates. The prefix and suffix the parser uses to identify thinking support is also automatically inferred from templates - Thinking control & parsing is opt-in via the API to prevent breaking existing API consumers. If the `"think"` option is not specified, the behavior is unchanged from previous versions of ollama - Add parsing for thinking blocks in both streaming/non-streaming mode in both `/generate` and `/chat` - Update the CLI to make use of these changes. Users can pass `--think` or `--think=false` to control thinking, or during an interactive session they can use the commands `/set think` or `/set nothink` - A `--hidethinking` option has also been added to the CLI. This makes it easy to use thinking in scripting scenarios like `ollama run qwen3 --think --hidethinking "my question here"` where you just want to see the answer but still want the benefits of thinking models		2025-05-28 19:38:52 -07:00
..
internal	lint: enable usetesting, disable tenv (#10594 )	2025-05-08 11:42:14 -07:00
auth.go	fix nil deref in auth.go	2024-07-26 14:14:48 -07:00
create.go	remove support for multiple ggufs in a single file (#10722 )	2025-05-21 13:55:31 -07:00
create_test.go	server: validate local path on safetensor create (#9379 )	2025-02-28 16:10:43 -08:00
download.go	server: abort download on empty digest	2025-05-27 11:28:48 -07:00
fixblobs.go	server: replace blob prefix separator from ':' to '-' (#3146 )	2024-03-14 20:18:06 -07:00
fixblobs_test.go	server: replace blob prefix separator from ':' to '-' (#3146 )	2024-03-14 20:18:06 -07:00
images.go	add thinking support to the api and cli (#10584 )	2025-05-28 19:38:52 -07:00
images_test.go	lint: enable usetesting, disable tenv (#10594 )	2025-05-08 11:42:14 -07:00
layer.go	One corrupt manifest should not wedge model operations (#7515 )	2024-11-05 14:21:45 -08:00
manifest.go	One corrupt manifest should not wedge model operations (#7515 )	2024-11-05 14:21:45 -08:00
manifest_test.go	One corrupt manifest should not wedge model operations (#7515 )	2024-11-05 14:21:45 -08:00
model.go	tools: refactor tool call parsing and enable streaming (#10415 )	2025-05-23 14:19:31 -07:00
modelpath.go	server: add hint to the error message when model path access fails (#10843 )	2025-05-24 13:17:04 -07:00
modelpath_test.go	lint: enable usetesting, disable tenv (#10594 )	2025-05-08 11:42:14 -07:00
prompt.go	add thinking support to the api and cli (#10584 )	2025-05-28 19:38:52 -07:00
prompt_test.go	add thinking support to the api and cli (#10584 )	2025-05-28 19:38:52 -07:00
quantization.go	server: improve tensor quantization fallback logic (#10806 )	2025-05-22 10:48:08 -07:00
quantization_test.go	ggml: Seperate tensor load from backend creation	2025-05-19 09:54:22 -07:00
routes.go	add thinking support to the api and cli (#10584 )	2025-05-28 19:38:52 -07:00
routes_create_test.go	Move quantization to new backend (#10363 )	2025-05-06 11:20:48 -07:00
routes_delete_test.go	Update the /api/create endpoint to use JSON (#7935 )	2024-12-31 18:02:30 -08:00
routes_generate_test.go	add thinking support to the api and cli (#10584 )	2025-05-28 19:38:52 -07:00
routes_list_test.go	Update the /api/create endpoint to use JSON (#7935 )	2024-12-31 18:02:30 -08:00
routes_test.go	fix: stream accumulator exits early (#10593 )	2025-05-08 13:17:30 -07:00
sched.go	sched: fix runner leak during reloading unload (#10819 )	2025-05-22 14:31:36 -07:00
sched_test.go	lint: enable usetesting, disable tenv (#10594 )	2025-05-08 11:42:14 -07:00
sparse_common.go	Don't hard fail on sparse setup error	2024-08-09 12:16:19 -07:00
sparse_windows.go	Don't hard fail on sparse setup error	2024-08-09 12:16:19 -07:00
thinking.go	add thinking support to the api and cli (#10584 )	2025-05-28 19:38:52 -07:00
thinking_test.go	add thinking support to the api and cli (#10584 )	2025-05-28 19:38:52 -07:00
upload.go	server: always print upload/download part info (#8832 )	2025-02-04 19:30:49 -08:00