Search o1:赋予推理模型主动搜索能力,让大模型边思考边搜索外部知识

Search o1: Empowering inference models to actively search for external knowledge while the larger model is thinking

Comprehensive Introduction Search-o1 is an open source project that aims to enhance the performance of large-scale reasoning models (LRMs) by integrating advanced search mechanisms. The core idea is to solve the knowledge deficit problem encountered in the reasoning process through dynamic search and knowledge integration. The project was developed by sunn...
5mos ago
0941
LiveTalking:开源实时互动数字人直播系统,实现音视频同步对话

LiveTalking: open source real-time interactive digital human live system, to achieve synchronous audio and video dialogues

Comprehensive introduction LiveTalking is an open source real-time interactive digital human system , is committed to building high-quality digital human live solution . The project uses the Apache 2.0 open source protocol and integrates a number of cutting-edge technologies , including ER-NeRF rendering , real-time audio and video streaming processing ...
5mos ago
01.3K
JoyGen:音频驱动的3D深度感知人像说话视频编辑工具

JoyGen: Audio-Driven 3D Depth-Sensitive Portrait Talking Video Editing Tool

Comprehensive Introduction JoyGen is an innovative two-stage video generation framework for talking faces, focusing on solving the problem of audio-driven facial expression generation. Developed by a team from Jingdong Technology, the project uses advanced 3D reconstruction techniques and audio feature extraction methods to accurately capture the identity characteristics of the speaker and the expression...
5mos ago
0973