Kimi-K2-Thinking vLLM self host tool call fail

#40
by CHONGYOEYAT - opened

CUDA ARCH sm120 Driver Version: 580.95.05 CUDA Version: 13.0
OS Ubuntu 22.04 Linux 6.8.0-87-generic x86_64 x86_64
vllm 0.11.2.dev618+ga238cbd89.d20251206.cu130
8x NVIDIA RTX PRO 6000 Blackwell Workstation 96GB

vllm serve Kimi-K2-Thinking
--served-model-name llm_model
--tensor-parallel-size 8
--decode-context-parallel-size 8
--enable-auto-tool-choice
--tool-call-parser kimi_k2
--reasoning-parser kimi_k2
--trust-remote-code
--gpu-memory-utilization 0.90
--max-model-len 262144
--port 9999 \

kimi cli
{
"default_model": "",
"models": {"kimi": {
"provider": "openai",
"model": "llm_model",
"max_context_size": 160000
}},
"providers": {"openai": {
"type": "openai_legacy",
"base_url": "http://localhost:9999/v1",
"api_key": "your-secret-key"
}},
"loop_control": {
"max_steps_per_run": 100,
"max_retries_per_step": 3
},
"services": {}
}

kimi, version 0.61

kimi -m kimi

Image

user@20251130_base_modelโœจ what is this repo about
โ•ญโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ•ฎ
โ”‚ what is this repo about โ”‚
โ•ฐโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ•ฏ
โ€ข I'll help you understand what this repository is about. Let me explore the key files and structure.
โ€ข Used Shell (dir /b)
โ€ข Used ReadFile (client.py)
โ€ข Used ReadFile (solar-system.html)
โ€ข Used Shell (dir temperature_outputs_1)
โ€ข I'll help you understand what this repository is about. Let me explore the key files and structure. <|tool_call_begin|> functions.Shell:0 <|tool_call_argument_begin|> {"command": "dir /b", "timeout": 10} <
|tool_call_end|> <|tool_call_begin|> functions.ReadFile:1 <|tool_call_argument_begin|> {"path": "D:\workspace\20251130_base_model\client.py", "n_lines": 50} <|tool_call_end|> <|tool_call_begin|> functions.R
eadFile:2 <|tool_call_argument_begin|> {"path": "D:\workspace\20251130_base_model\solar-system.html", "n_lines": 50} <|tool_call_end|> <|tool_call_begin|> functions.Shell:3 <|tool_call_argument_begin|> {"co
mmand": "dir temperature_outputs_1", "timeout": 10} <|tool_call_end|>
โ€ข Let me look at the actual content of client.py (beyond the first 50 lines) to see what it does, and possibly look at some of the output files to understand the full context.
โ€ข Used Grep (def )
โ€ข Used ReadFile (temperature_outputs_1\output_temp_0.50.txt)
โ€ข Used ReadFile
Invalid arguments
โ€ข Let me look at the actual content of client.py (beyond the first 50 lines) to see what it does, and possibly look at some of the output files to understand the full context. <|tool_call_begin|> functions
.Grep:4 <|tool_call_argument_begin|> {"pattern": "def ", "path": "D:\workspace\20251130_base_model\client.py", "output_mode": "content", "-n": true} <|tool_call_end|> <|tool_call_begin|> functions.ReadFi
le:5 <|tool_call_argument_begin|> {"path": "D:\workspace\20251130_base_model\temperature_outputs_1\output_temp_0.50.txt"} <|tool_call_end|> <|tool_call_begin|> functions.ReadFile:6 {"path": "D:\workspac
e\20251130_base_model\Data_2013.json", "n_lines": 30} <|tool_call_end|>
LLM provider error: Error code: 400 - {'error': {'message': "1 validation error for ValidatorIterator\n2.function.arguments\n Field required \n For further information visit https://errors.pydantic.dev/2.
12/v/missing None", 'type': 'BadRequestError', 'param': None, 'code': 400}}

issue same for kilo code, cline

Sign up or log in to comment