gpt-4ogpt-4o is the flagship model of this family, supports a 128k context window and is optimized for dialog.
deepseek-ai/DeepSeek-R1-Distill-Llama-70B-freeThe first reasoning model from DeepSeek. Outperforms OpenAI GPT-4-o1 on multiple benchmarks.
Meta-Llama-3.3-70B-InstructGeneration over generation, Meta Llama 3 demonstrates state-of-the-art performance on a wide range of industry benchmarks and offers new capabilities, including improved reasoning.
Qwen 2.5 72BQwen2.5 is the latest series of Qwen large language models. For Qwen2.5, we release a number of base language models and instruction-tuned language models ranging from 0.5 to 72 billion parameters.
Deepseek V3Deepseek V3 achieves state-of-the-art performance among open-source code models on multiple programming languages and various benchmarks.