Latest AI Resources

Total 2598 articles posts
MagicQuill:智能交互式图像涂鸦编辑系统,精准局部涂鸦编辑

MagicQuill: Intelligent Interactive Image Graffiti Editing System, Precise Localized Graffiti Editing

General Introduction MagicQuill is an open-source AI interactive image editing tool jointly launched by Hong Kong University of Science and Technology (HKUST), Ant Group, Zhejiang University and University of Hong Kong. The tool aims to achieve accurate localized editing of images in an intelligent and interactive way.MagicQuill...
8mos ago
03.4K
SoniTranslate:开源视频翻译配音解决方案,多人配音、调整语速与模仿原声

SoniTranslate: open source video translation and dubbing solution, multi-person dubbing, adjust the speed of speech and mimic the original sound

General Description SoniTranslate is a powerful and user-friendly video multilingual dubbing tool designed to provide a solution for video translation and synchronized audio. It uses advanced speech recognition and machine translation technologies to translate video content into multiple languages and keep the audio synchronized. The program ...
9mos ago
03.1K
LTX Studio:拥有分镜管理工具的AI电影制作平台,可设置多人物保持面部一致

LTX Studio: AI movie-making platform with split-screen management tools to set up multiple characters to keep their faces consistent

General Introduction LTX Studio is an innovative AI-driven video creation platform designed for creators, marketers, filmmakers and studios. It provides full-process operation from story conceptualization, split-screen generation, kinetic effects addition to post-editing, helping users transform creative concepts into...
5mos ago
02.7K
朱雀大模型检测:识别AI生成内容,确保文本和图像真实性

Jubilee Big Model Detection: identifying AI-generated content to ensure text and image authenticity

Comprehensive Introduction Big Model Detection is an AI-generated content detection tool developed by Tencent's hybrid security team, Jubilee Labs. The tool can quickly identify text and images generated by AI and help users distinguish between manually created and AI-generated content. By capturing the differences between AI-generated content and real content...
4mos ago
02.6K
CosyVoice:阿里推出的3秒急速语音克隆开源项目,支持情感控制标签

CosyVoice: 3-second rush voice cloning open source project launched by Ali with support for emotionally controlled tags

Comprehensive Introduction CosyVoice is a multilingual large-scale speech generation model that provides full-stack capabilities from inference, training to deployment. Developed by the FunAudioLLM team, it aims to achieve high quality speech through advanced autoregressive transformers and ODE-based diffusion models...
6mos ago
02.5K
豆包:抖音旗下AI智能助手

Doubao: Shake's AI Intelligent Assistant

Beanbag Comprehensive Introduction Beanbag is an artificial intelligence AI assistant developed by a subsidiary of Jitterbug, the domestic version of which uses the latest Lark Large model. It is an intelligent assistant tool that can help users solve problems, get information and improve efficiency. Beanbag supports Chinese and English, can be used online, and provides web version, Android...
7mos ago
02.5K
Browser Use Web UI:运行AI智能体浏览网页,让AI能够自动操作网页的开源框架

Browser Use Web UI: an open source framework for running AI intelligences to browse the web, allowing AI to automatically manipulate web pages

Comprehensive Introduction Browser Use Web UI is an innovative open source project focused on providing AI agents with a graphical interface tool for browser interaction capabilities. The project is built on top of the browser-use core framework, built with Gradio ...
2mos ago
02.4K
即梦AI:一站式AI创作平台, 图像生成, 智能画布, 视频生成, 音乐生成

Instant Dream AI: One-stop AI creation platform, image generation, smart canvas, video generation, music generation

Comprehensive Introduction Instant Dream AI is a one-stop AI creation platform designed to provide users with versatile and powerful creation tools. Whether it's image generation, smart canvas, video generation or music generation, Instant Dream AI can help users easily realize their creativity. The platform supports multiple creation modes, including AI drawing...
7mos ago
02.4K
PDFMathTranslate:保留PDF完整排版的AI翻译工具

PDFMathTranslate: AI translation tool that preserves the full typography of PDFs

Comprehensive introduction PDFMathTranslate is an open source tool focusing on the translation of scientific papers , PDF documents can be translated in full and generate a bilingual version . It uses AI technology to retain the full layout of the original document , including formulas , diagrams , tables of contents and notes , support ...
2mos ago
02.4K
PraisonAI:低代码多智能体框架,简化复杂任务的自动化解决方案

PraisonAI: A Low-Code Multi-Intelligent Body Framework to Simplify Automation Solutions for Complex Tasks

Comprehensive Introduction PraisonAI is an out-of-the-box multi-intelligence body framework for production environments, designed to create AI intelligences to automate and solve problems ranging from simple tasks to complex challenges. The framework provides a low-code solution that simplifies the building of multi-intelligent body LLM systems and...
6mos ago
02.3K
MMAudio:为视频画面生成同步音效与配乐,视频到音频的多模态联合训练工具

MMAudio: generating synchronized sound effects and soundtracks for video footage, video-to-audio multimodal co-training tool

General Introduction MMAudio is an open-source project aiming to generate high-quality synchronized audio through joint multimodal training. Developed by Ho Kei Cheng et al. at the Chinese University of Hong Kong, the project's main function is to generate synchronized audio based on video and/or text input.MM...
8mos ago
02.3K
百度作家:一站式小说创作与投稿平台,免费AI智能小说写作工具

Baidu Writer: one-stop novel creation and submission platform, free AI intelligent novel writing tool

Comprehensive Introduction Baidu Writers Platform is a one-stop creation and submission platform under Baidu for online literature writers. Writers can create short stories and novels on the platform, make submissions, manage their works, and view revenue data. The platform relies on Baidu's powerful AI capabilities to provide intelligent creation tools to help...
10mos ago
02.3K
IOPaint:全能AI图像处理工具,擦除、扩图、替换元素与绘制文本

IOPaint: All-around AI image processing tool, erasing, expanding, replacing elements and drawing text.

综合介绍 IOPaint是一款免费且开源的AI图像处理工具,支持图像擦除、修复和扩展等功能。它采用最先进的AI模型,能够帮助用户轻松移除图像中的不需要对象、修复瑕疵、添加新内容,甚至扩大图像。IOPa...
9mos ago
02.3K
ElizaOS:构建自主执行的多智能体,功能完备的开源AI智能体开发框架

ElizaOS: Building Autonomously Executing Multi-Intelligents, a Fully Functional Open Source AI Intelligent Body Development Framework

Comprehensive introduction Eliza is an advanced multi-intelligent body (Multi-Agent) development framework , is committed to simplifying the construction and deployment of autonomous intelligent body (Autonomous Agent) process . It supports the deployment of multiple intelligent bodies with different role settings , can realize intelligent ...
7mos ago
02.2K
FunASR:开源语音识别工具包,说话人分离/ 多人对话语音识别

FunASR: Open Source Speech Recognition Toolkit, Speaker Separation / Multi-Person Conversation Speech Recognition

Comprehensive Introduction FunASR is an open source speech recognition toolkit developed by Alibaba's Dharma Institute to bridge academic research and industrial applications. It supports a wide range of speech recognition features, including speech recognition (ASR), voice endpoint detection (VAD), punctuation recovery, language modeling, speaking...
10mos ago
02.2K