Add support for GPT-5.2
This commit is contained in:
@@ -115,6 +115,7 @@ curl http://127.0.0.1:8000/v1/chat/completions \
|
||||
# Supported models
|
||||
- `gpt-5`
|
||||
- `gpt-5.1`
|
||||
- `gpt-5.2`
|
||||
- `gpt-5-codex`
|
||||
- `gpt-5.1-codex`
|
||||
- `gpt-5.1-codex-max`
|
||||
@@ -126,7 +127,8 @@ curl http://127.0.0.1:8000/v1/chat/completions \
|
||||
### Thinking effort
|
||||
|
||||
- `--reasoning-effort` (choice of minimal,low,medium,high,xhigh)<br>
|
||||
GPT-5 has a configurable amount of "effort" it can put into thinking, which may cause it to take more time for a response to return, but may overall give a smarter answer. Applying this parameter after `serve` forces the server to use this reasoning effort by default, unless overrided by the API request with a different effort set. The default reasoning effort without setting this parameter is `medium`. The `gpt-5.1` family (including codex) supports `low`, `medium`, and `high` while `gpt-5.1-codex-max` adds `xhigh`; neither offers a `minimal` variant.
|
||||
GPT-5 has a configurable amount of "effort" it can put into thinking, which may cause it to take more time for a response to return, but may overall give a smarter answer. Applying this parameter after `serve` forces the server to use this reasoning effort by default, unless overrided by the API request with a different effort set. The default reasoning effort without setting this parameter is `medium`.<br>
|
||||
The `gpt-5.1` family (including codex) supports `low`, `medium`, and `high` while `gpt-5.1-codex-max` adds `xhigh`. `gpt-5.2` supports `low`, `medium`, `high`, and `xhigh`.
|
||||
|
||||
### Thinking summaries
|
||||
|
||||
@@ -159,7 +161,7 @@ You can enable it by starting the server with this parameter, which will allow O
|
||||
If your preferred app doesn’t support selecting reasoning effort, or you just want a simpler approach, this parameter exposes each reasoning level as a separate, queryable model. Each reasoning level also appears individually under /v1/models, so model pickers in your favorite chat apps will list all reasoning options as distinct models you can switch between.
|
||||
|
||||
## Notes
|
||||
If you wish to have the fastest responses, I'd recommend setting `--reasoning-effort` to minimal, and `--reasoning-summary` to none. <br>
|
||||
If you wish to have the fastest responses, I'd recommend setting `--reasoning-effort` to low, and `--reasoning-summary` to none. <br>
|
||||
All parameters and choices can be seen by sending `python chatmock.py serve --h`<br>
|
||||
The context size of this route is also larger than what you get access to in the regular ChatGPT app.<br>
|
||||
|
||||
@@ -172,4 +174,3 @@ When the model returns a thinking summary, the model will send back thinking tag
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
Reference in New Issue
Block a user