DeepSeek-R1-Lite, a homegrown inference model comparable to o1-preview, is online!

1.7K 0

Yesterday, DeepSeek released DeepSeek-R1-A preview version of Lite, a program that works with the o1 competing autonomic reasoning macrolanguage models, and shows users the complete thought process that o1 does not make public.

Similar to OpenAI's o1-preview, the DeepSeek-R1-Lite preview reasoned about the task, planned ahead, and performed a series of actions to help the model arrive at the answer, and it showed the full thought process.DeepSeek-R1-Lite was trained using reinforcement learning, and the reasoning process included a lot of reflection and validation, with chains of thought tens of thousands of words long. The reasoning process includes a lot of reflection and verification, and the chain of thought is tens of thousands of words long, which makes it more efficient. Currently, it only supports web use, and the official version will be completely open source.

媲美 o1-preview 的国产推理模型——DeepSeek-R1-Lite上线

DeepSeek-R1-Lite Preview excels on math, code, and complex logical reasoning tasks, outperforming o1-preview in some tests. in prestigious reviews such as AIME, the highest difficulty level in the AMC, a U.S. math competition, and codeforces, the world's top programming competition, it outperforms the o1-preview and other models.

Give it the basic "strawberry test" and it will answer perfectly.

Depending on the complexity of the question, DeepSeek-R1 may "think" for tens of seconds before answering, and users have reported longer reasoning times for the same question than o1. Officially, as the length of the chain of thought increases, the longer the reasoning time, the more accurate the results.

Various tests have been done online, and DeepSeek also makes it easy to jailbreak - i.e. by prompting in a way that ignores security measures. One X user got DeepSeek-R1-Lite to give a detailed recipe for poison by writing special jailbreak prompts.

Of course, DeepSeek-R1-Lite still had all sorts of flops in online testing, and performed poorly on tic-tac-toe and other logic problems in particular, as did o1.

Log in to chat.deepseek.com and select "Deep Thinking" mode in the input box to talk to the DeepSeek-R1-Lite preview. The "Deep Thinking" mode is specially designed for complex logical reasoning questions in math, code, etc., and provides more comprehensive, clear, and well-thought-out answers than simple questions.

However, it currently supports web use, does not support API calls for the time being, and has only 50 usage credits per day.

AI News

The copyright of the article belongs to the author, please do not reprint without permission.

Bing's "deep search" feature is opening up to more users

AI News

1yrs ago

01.3K

Cursor explodes in popularity, but Cursor is not the way out for domestic AI programming

AI News

6mos ago

01.7K

OpenAI is zooming in to revolutionize smartphones with AI hardware!

AI News

5mos ago

0831

DeepSeek Temporarily Limits New Registrations, Citing "Massive Malicious Attacks"

AI News

6mos ago

01.1K

No comments

No comments...

DeepSeek-R1-Lite, a homegrown inference model comparable to o1-preview, is online!

Copilot for PowerPoint has undergone major changes, and these are the key points that have to be looked at: rewriting, translating, illustrating, annotating

Microsoft Announces AI Shell in Public Beta, No More Fear of Knocking Out the Wrong Command

Related posts

Bing's "deep search" feature is opening up to more users

Cursor explodes in popularity, but Cursor is not the way out for domestic AI programming

OpenAI is zooming in to revolutionize smartphones with AI hardware!

DeepSeek Temporarily Limits New Registrations, Citing "Massive Malicious Attacks"

No comments

Latest Articles

DeepSeek-R1-Lite, a homegrown inference model comparable to o1-preview, is online!

Copilot for PowerPoint has undergone major changes, and these are the key points that have to be looked at: rewriting, translating, illustrating, annotating

Microsoft Announces AI Shell in Public Beta, No More Fear of Knocking Out the Wrong Command

Related posts

Bing's "deep search" feature is opening up to more users

Cursor explodes in popularity, but Cursor is not the way out for domestic AI programming

OpenAI is zooming in to revolutionize smartphones with AI hardware!

DeepSeek Temporarily Limits New Registrations, Citing "Massive Malicious Attacks"

No comments

Selected AI Tools

Latest Articles