We noticed in community discussions that when using Qwen3-embedding's GGUF models, some developers are not appending the special token <|endoftext|> at the end of the context. This can significantly hurt model accuracy. Check our Model Card ( for more.