Latest AI Resources

Total 2602 articles posts
卡卡字幕助手(VideoCaptioner):基于LLM的智能字幕助手,一键生成高质量字幕

VideoCaptioner: LLM-based intelligent captioning assistant, generating high-quality captions with one click!

General Introduction Kaka Caption Assistant (VideoCaptioner) is an intelligent video caption processing tool based on the Large Language Model (LLM). It can generate high-quality subtitles in one click without high-performance GPU, and supports the whole process of subtitle generation, sentence breaking, optimization and translation. It supports the whole process of subtitle generation, sentence breaking, optimization and translation...
9mos ago
01.8K
BuffGPT:企业级生成式AI应用低代码开发平台

BuffGPT: A Low-Code Development Platform for Enterprise-Grade Generative AI Applications

Comprehensive Introduction BuffGPT is an open source AI application development platform based on the Large Language Model (LLM), providing out-of-the-box features such as data processing, model invocation, RAG retrieval, and visual workflow orchestration to help users easily build and operate generative AI applications. The platform supports privatization...
5mos ago
01.8K
TXYZ:用于学术研究的AI文献搜索和快速阅读助手

TXYZ: An AI Literature Search and Fast Reading Assistant for Academic Research

General Introduction TXYZ is an AI-enhanced research platform that aims to improve the efficiency of academic research through AI technology. Users can quickly find relevant literature through natural language search and extract the best content from it. The platform also provides personalized daily recommendations, literature management and classification functions to help users...
10mos ago
01.8K
DeOldify:使用AI技术为黑白照片和视频上色的经典开源工具

DeOldify: the classic open-source tool for colorizing black-and-white photos and videos using AI technology

Comprehensive Introduction DeOldify is an open source project based on deep learning technology, specifically designed for intelligent colorization and restoration of black and white photos and videos. The project uses an innovative NoGAN training method to successfully solve the common defects of traditional GAN networks in the image coloring process...
8mos ago
01.8K
DreamTalk:使用一张头像图片即可生成表情丰富的说话视频

DreamTalk: Generate expressive talking videos with a single avatar image!

DreamTalk Comprehensive Introduction DreamTalk is a diffusion model-driven expression talking head generation framework, jointly developed by Tsinghua University, Alibaba Group and Huazhong University of Science and Technology. It mainly consists of three parts: a noise reduction network, a style-aware lip expert and a style predictor, and can be based on...
8mos ago
01.8K
OpenBB:开源金融数据分析平台,集成私有数据集和 AI 来增强投资决策

OpenBB: Open Source Financial Data Analytics Platform Integrates Private Datasets and AI to Enhance Investment Decisions

General Introduction OpenBB is a free and fully open source financial data analytics platform designed to provide easy access to financial data and analytics tools for all. The platform integrates over 100 different data sources covering stocks, options, cryptocurrencies, forex, macroeconomic indicators, fixed...
6mos ago
01.8K
Clone AI(小冰数字人):集成多种数字人制作与发布解决方案(付费/不推荐)

Clone AI (Little Ice Digitizer): integrates multiple digitizer production and publishing solutions (paid/not recommended)

Comprehensive Introduction The Xiaobing Digital Human Operation Center (FutureAI) is a cloud-based intelligent video creation platform that integrates Xiaobing Digital Human, Interactive Digital Human, Material Collection, AI Dubbing, AI Painting, Video Editing and other functions to provide a comprehensive digital human production and publishing solution. The platform is suitable for ...
8mos ago
01.8K
ChatTTS:模仿真人说话声音的语音生成模型(ChatTTS一键加速包)

ChatTTS: a speech generation model that mimics the voice of a real person speaking (ChatTTS one-click acceleration package)

General Introduction ChatTTS is a generative speech model designed for conversational scenarios. It generates natural and expressive speech, supports multiple languages and multiple speakers, and is suitable for interactive conversations. The model does this by predicting and controlling fine-grained prosodic features such as laughter, pauses and interjections, sup...
6mos ago
01.8K
RMBG-2-Studio:批量移除图像和视频背景的开源程序,基于RMBG 2.0优化

RMBG-2-Studio: open source program for batch removal of image and video backgrounds, optimized for RMBG 2.0

General Introduction RMBG-2-Studio is an enhanced background removal and replacement application developed based on the BRIA-RMBG-2.0 model. The application is designed to provide users with efficient and accurate image background processing capabilities for a variety of image types, including e-commerce, gaming and...
8mos ago
01.8K
Novelcrafter:专业小说创作工具,利用AI在创作各阶段提供构思和到成书

Novelcrafter: a professional novel creation tool that uses AI to provide ideas at all stages of creation and through to book completion

General Description Novelcrafter is an online creative writing platform for writers that offers a range of tools and resources to help authors at every stage of the process, from idea to finished novel. You can plan where your story is going, create character profiles, and even work with an AI assistant to expand your creativity. Whether...
11mos ago
01.8K
Coqui TTS(xTTS):文本到语音生成的深度学习工具包,支持多种语言和声音克隆功能

Coqui TTS (xTTS): Deep Learning Toolkit for Text-to-Speech Generation with Multiple Language Support and Voice Cloning Capabilities

Comprehensive Introduction Coqui TTS is an open source advanced text-to-speech (TTS) generation toolkit based on deep learning techniques. It has been battle-tested in both research and production environments, and provides a rich set of features and models that support text-to-speech conversion in multiple languages.Coqui TTS...
6mos ago
01.8K
TTSMaker:免费的在线文本转语音工具

TTSMaker: free online text-to-speech tool

General Introduction TTSMaker is a free online text-to-speech tool that supports more than 100 languages and 300 speech styles. Users can convert text to natural and smooth speech and download audio files for commercial use. The tool is suitable for video dubbing, audiobooks, education and training...
11mos ago
01.8K
Step-Audio:多模态语音交互框架,识别语音并使用克隆语音交流等功能

Step-Audio: a multimodal voice interaction framework that recognizes speech and communicates using cloned speech, among other features

Comprehensive Introduction Step-Audio is an open source intelligent speech interaction framework designed to provide out-of-the-box speech understanding and generation capabilities for production environments. The framework supports multi-language dialog (e.g., Chinese, English, Japanese), emotional speech (e.g., happy, sad), regional dialects (e.g., Cantonese, Szechuan ...
6mos ago
01.8K
ViiTor AI:音频/视频多语言翻译合成与语音克隆服务

ViiTor AI: Audio/Video Multilingual Translation Synthesis and Speech Cloning Service

Comprehensive Introduction ViiTor AI is a powerful artificial intelligence platform focused on providing high-quality video translation, voice cloning, AI-generated avatar videos, and speech synthesis services. The platform supports multiple languages and is designed to help users easily realize multilingual content creation.ViiTo...
7mos ago
01.8K
番茄创作工具:将授权小说和短剧文稿转视频,生成短视频用于推广引流

Tomato Creation Tool: Convert licensed novels and short play scripts to video, generating short videos for promotion and traffic generation

Comprehensive Introduction Tomato Darling Center's Copy to Video Creation Tool is a powerful AIGC (Artificial Intelligence Generated Content) tool designed to help content creators quickly convert written copy to video. The tool simplifies the production of copy to video through semantic analysis, illustration generation and video export...
10mos ago
01.8K
触手AI:简单易上手的AI绘图工具,支持训练自己的图像风格

Tentacle AI: simple and easy to use AI drawing tools, support training your own image style

Comprehensive Introduction Touch AI is a professional AI creation platform under Jellyfish Intelligence, providing AI painting, online drawing and massive models and other functions. The platform supports minimalist and professional modes with strong ease of use, provides a variety of drawing styles and design models, rich plug-in options, and allows users to experience AIGC creation capabilities online...
11mos ago
01.8K