DeepSeek: R1 Distill Llama 70B

提供商: DeepSeek

模型IDdeepseek/deepseek-r1-distill-llama-70b
上下文长度131K (131,072 tokens)
输入价格$0.70/M
输出价格$0.80/M
模态text->text
分词器Llama3
知识截止2024-07-31
上线时间15月前
工具调用❌ 不支持

简介

DeepSeek R1 Distill Llama 70B is a distilled large language model based on [Llama-3.3-70B-Instruct](/meta-llama/llama-3.3-70b-instruct), using outputs from [DeepSeek R1](/deepseek/deepseek-r1). The model combines advanced distillation techniques to achieve high performance across...