added "minimal" reasoning option (better for coding)
This commit is contained in:
@@ -93,7 +93,7 @@ curl http://127.0.0.1:8000/v1/chat/completions \
|
||||
|
||||
### Thinking effort
|
||||
|
||||
- `--reasoning-effort` (choice of low,medium,high)<br>
|
||||
- `--reasoning-effort` (choice of minimal,low,medium,high)<br>
|
||||
GPT-5 has a configurable amount of "effort" it can put into thinking, which may cause it to take more time for a response to return, but may overall give a smarter answer. Applying this parameter after `serve` forces the server to use this reasoning effort by default, unless overrided by the API request with a different effort set. The default reasoning effort without setting this parameter is `medium`.
|
||||
|
||||
### Thinking summaries
|
||||
|
||||
Reference in New Issue
Block a user