ollama/ml
Daniel Hiltgen 3f30836734
CUDA: filter devices on secondary discovery (#13317)
We now do a deeper probe of CUDA devices to verify the library version has
the correct compute capability coverage for the device.  Due to ROCm also
interpreting the CUDA env var to filter AMD devices, we try to avoid setting
it which leads to problems in mixed vendor systems.  However without setting
it for this deeper probe, each CUDA library subprocess discovers all CUDA GPUs
and on systems with lots of GPUs, this can lead to hitting timeouts.  The fix is
to turn on the CUDA visibility env var just for this deeper probe use-case.
2025-12-03 12:58:16 -08:00
..
backend kvcache: Use SetRows to store cache data 2025-11-18 20:42:28 -08:00
nn Add deepseek v3.1 (#13063) 2025-11-17 18:03:21 -08:00
backend.go kvcache: Use SetRows to store cache data 2025-11-18 20:42:28 -08:00
device.go CUDA: filter devices on secondary discovery (#13317) 2025-12-03 12:58:16 -08:00
path.go cpu: always ensure LibOllamaPath included (#12890) 2025-10-31 14:37:29 -07:00