DeepSeek-R1-Lite, a homegrown inference model comparable to o1-preview, is online!

AI News4mos agoupdate Sharenet.ai
1.5K 0
Trae
Yesterday, DeepSeek released DeepSeek-R1-A preview version of Lite, a program that works with the o1 competing autonomic reasoning macrolanguage models, and shows users the complete thought process that o1 does not make public.
Similar to OpenAI's o1-preview, the DeepSeek-R1-Lite preview reasoned about the task, planned ahead, and performed a series of actions to help the model arrive at the answer, and it showed the full thought process.DeepSeek-R1-Lite was trained using reinforcement learning, and the reasoning process included a lot of reflection and validation, with chains of thought tens of thousands of words long. The reasoning process includes a lot of reflection and verification, and the chain of thought is tens of thousands of words long, which makes it more efficient. Currently, it only supports web use, and the official version will be completely open source.
媲美 o1-preview 的国产推理模型——DeepSeek-R1-Lite上线
DeepSeek-R1-Lite Preview excels on math, code, and complex logical reasoning tasks, outperforming o1-preview in some tests. in prestigious reviews such as AIME, the highest difficulty level in the AMC, a U.S. math competition, and codeforces, the world's top programming competition, it outperforms the o1-preview and other models.
媲美 o1-preview 的国产推理模型——DeepSeek-R1-Lite上线
Give it the basic "strawberry test" and it will answer perfectly.
媲美 o1-preview 的国产推理模型——DeepSeek-R1-Lite上线
媲美 o1-preview 的国产推理模型——DeepSeek-R1-Lite上线
媲美 o1-preview 的国产推理模型——DeepSeek-R1-Lite上线
媲美 o1-preview 的国产推理模型——DeepSeek-R1-Lite上线
媲美 o1-preview 的国产推理模型——DeepSeek-R1-Lite上线
Depending on the complexity of the question, DeepSeek-R1 may "think" for tens of seconds before answering, and users have reported longer reasoning times for the same question than o1. Officially, as the length of the chain of thought increases, the longer the reasoning time, the more accurate the results.
媲美 o1-preview 的国产推理模型——DeepSeek-R1-Lite上线
Various tests have been done online, and DeepSeek also makes it easy to jailbreak - i.e. by prompting in a way that ignores security measures. One X user got DeepSeek-R1-Lite to give a detailed recipe for poison by writing special jailbreak prompts.
媲美 o1-preview 的国产推理模型——DeepSeek-R1-Lite上线
Of course, DeepSeek-R1-Lite still had all sorts of flops in online testing, and performed poorly on tic-tac-toe and other logic problems in particular, as did o1.
媲美 o1-preview 的国产推理模型——DeepSeek-R1-Lite上线
Log in to chat.deepseek.com and select "Deep Thinking" mode in the input box to talk to the DeepSeek-R1-Lite preview. The "Deep Thinking" mode is specially designed for complex logical reasoning questions in math, code, etc., and provides more comprehensive, clear, and well-thought-out answers than simple questions.
However, it currently supports web use, does not support API calls for the time being, and has only 50 usage credits per day.
© Copyright notes
AiPPT

Related posts

No comments

none
No comments...