From 54aa6d7a18830974f1a12a931ea19cd0b1d655ea Mon Sep 17 00:00:00 2001 From: Game_Time <108236317+RayBytes@users.noreply.github.com> Date: Sat, 16 Aug 2025 11:57:52 +0500 Subject: [PATCH] Update README.md --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index 96020f4..678ee75 100644 --- a/README.md +++ b/README.md @@ -93,7 +93,7 @@ Models like GPT-5 do not return raw thinking content, but instead return thinkin ## Notes If you wish to have the fastest responses, I'd recommend setting `--reasoning-effort` to low, and `--reasoning-summary` to none. -All parameters and choices can be seen by sending `python chatmock.py serve --h` +All parameters and choices can be seen by sending `python chatmock.py serve --h`
The context size of this route is also larger than what you get access to in the regular ChatGPT app. **When the model returns a thinking summary, the model will send back thinking tags to make it compatible with chat apps. If you don't like this behavior, you can instead set `--reasoning-compat` to legacy, and reasoning will be set in the reasoning tag instead of being returned in the actual response text.**