O1
The o1 series of models are trained with reinforcement learning to perform complex reasoning. o1 models think before they answer, producing a long internal chain of thought before responding to the user.
200,000 context window 100,000 max output tokens Oct 01, 2023 knowledge cutoff Reasoning token support