O1 model doesn'ts support streaming completions

#### Background

With the introduction of newer reasoning models in v1.7.0, we've seen that while some tweaks have been made, streaming support for models like `o1` was initially unavailable. This support varies based on the provider:

- **OpenAI:** Recently added streaming support.
- **Azure OpenAI:** Still lacks streaming support for `o1`.

#### Problem

The current implementation assumes that all results are streamed, which is incompatible with some `o1` models. They require complete chat completion before results can be read, creating a gap in configuration and logic, especially for Azure OpenAI users.

#### Proposal

1. Add a new configuration value:
   - `stream: false` (default to `true`).
1. Introduce a CLI flag for manual override:
   - Use `--stream false` or `--no-stream`.

Seem reasonable? If so I'll see if I can knock that out. 

ref: [current limitations of o1 in azure openai](https://learn.microsoft.com/en-us/azure/ai-services/openai/how-to/reasoning?tabs=python-secure#not-supported)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

O1 model doesn'ts support streaming completions #430

Background

Problem

Proposal

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

O1 model doesn'ts support streaming completions #430

Description

Background

Problem

Proposal

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions