Not sure if this helps, but gpt-oss, for example, only allows low reasoning, but you can't disable reasoning entirely. This seems to align with OpenAI's documentation on GPT-5 reasoning.
If you're not tied to the OpenAI platform, why not give https://groq.com/ a shot for blazingly fast inference?