Introducing gpt-oss
I ran a few tests on the 20B parameter
Some of the live footage is here:
Overview
- 20 and 120 billion parameters
- June 2024 knowledge cutoff
- Reasoning (low/medium/high)
- 128k context length (same as gpt-4, llama-3.1, mistral-large)
- Tool and Function calling
- Apache 2.0 open source license
- MXFP4 quantization (runs on 16GB of memory)
- Uses o200k_harmony tokenizer
Quirky Prompts
- Give a number between 1 and 100
42