ThirdPartyComponentArchitecture
Ollama GPU qwen3-coder-next 80B model
The Ollama GPU qwen3-coder-next 80B model is used by the async embedding queue to generate GPU embeddings for text chunks.
The Ollama GPU qwen3-coder-next 80B model is used by the async embedding queue to generate GPU embeddings for text chunks.