Welcome to our SELF GOVERNING and SUSTAINABLE Community!

MaricelaLi · 2025-02-01 06:28:20

MaricelaLi
New member
Offline

Registered: 2025-02-01
Posts: 3

Topic: DeepSeek's First-generation Reasoning Models

$https://www.ipu.org/sites/default/files/styles/card_image/public/ai_-_brain_banner_1024_x_768_72dpi-01_1.jpg?h\u003dddb1ad0c\u0026itok\u003dEATLRuCr$
DeepSeek's first-generation thinking models, achieving performance comparable to OpenAI-o1 across mathematics, code, and thinking tasks.

Models

DeepSeek-R1

Distilled designs

DeepSeek team has actually shown that the thinking patterns of bigger models can be distilled into smaller designs, leading to better efficiency compared to the thinking patterns discovered through RL on little models.

Below are the designs created by means of fine-tuning versus several dense designs widely used in the research community using thinking information created by DeepSeek-R1. The evaluation results demonstrate that the distilled smaller thick models perform extremely well on criteria.

DeepSeek-R1-Distill-Qwen-1.5 B

DeepSeek-R1-Distill-Qwen-7B

DeepSeek-R1-Distill-Llama-8B

DeepSeek-R1-Distill-Qwen-14B

DeepSeek-R1-Distill-Qwen-32B

DeepSeek-R1-Distill-Llama-70B

License

The design weights are accredited under the MIT License. DeepSeek-R1 series support commercial use, enable for any modifications and acquired works, including, however not limited to, distillation for training other LLMs.

Feel free to surf to my weblog ... ai

Welcome to our SELF GOVERNING and SUSTAINABLE Community!

DeepSeek's First-generation Reasoning Models

Posts: 1

1 Topic by MaricelaLi 2025-02-01 06:28:20

Topic: DeepSeek's First-generation Reasoning Models

Posts: 1