ollama/kvcache
Jesse Gross 01aa788722 ml: Remove Output from Context interface
Model implementations should use Input for all of their tensors
supplied to the model. This includes tensors that relate to the
outputs, which is confusing since there is also an Output funciton.

Since Output is only used internally in GGML and not used by any
model implementations, we can remove it from the interface to
reduce confusion.
2025-03-27 12:19:43 -07:00
..
cache.go kvcache: Pass granular cache size into implementations 2025-03-21 11:20:19 -07:00
causal.go kvcache: Sliding window cache only needs a single batch total 2025-03-26 13:16:03 -07:00
causal_test.go ml: Remove Output from Context interface 2025-03-27 12:19:43 -07:00
encoder.go kvcache: Pass granular cache size into implementations 2025-03-21 11:20:19 -07:00
wrapper.go kvcache: Pass granular cache size into implementations 2025-03-21 11:20:19 -07:00