Latest AI Resources

Total 2603 articles posts
OpenCreator:整合多种AI模型生成创意视频

OpenCreator: integrating multiple AI models to generate creative videos

Comprehensive Introduction OpenCreator is an online tool designed for creators with the core function of integrating more than 20 generative AI models together. Users can use it to easily generate creative videos without switching between platforms or paying multiple subscriptions. It has a simple interface and supports one-click...
4mos ago
0928
自动解析PDF内容并提取文字与表格的开源服务

Automatically parse PDF content and extract text and tables of open source services

Comprehensive Introduction It can automatically analyze the layout of PDF documents, identify text, titles, images, tables, formulas and other elements in the page, and determine their correct order. The tool supports OCR functionality and can convert scanned PDF to searchable text. It runs on Docker and provides two models...
4mos ago
0818
Internet.io:聚合多AI模型答案的智能工作平台

Internet.io: an intelligent work platform that aggregates answers from multiple AI models

General Introduction Internet.io is an intelligent platform that aggregates answers from multiple top AI models. It is designed to solve the problem that a single AI answer may be inaccurate or inconsistent. Users can simply ask a question and get answers from multiple leading AI models at the same time, making it easy to compare...
4mos ago
0790
Recall:浏览网页时显示个人知识库相关信息

Recall: display information about your personal knowledge base when browsing the web

Comprehensive Introduction Recall is an artificial intelligence tool that enhances your browsing experience by quickly summarizing and depositing web pages, videos, PDFs and more into your personal knowledge base. The core function is to help you display relevant information in real time while browsing, and organize fragmented content into an orderly knowledge network. It consists of ...
4mos ago
0750
WeClone:用微信聊天记录和语音训练数字分身

WeClone: training digital doppelgangers with WeChat chats and voices

Comprehensive introduction WeClone is an open source project that uses WeChat chat logs and voice messages, combined with large language models and speech synthesis technology, to allow users to create personalized digital doppelgangers. The project can analyze the user's chat habits to train the model , but also a small number of voice samples to generate realistic sound...
4mos ago
0917
KrillinAI:一键翻译和配音的视频多语言全球化工具

KrillinAI: Multilingual Globalization Tool for Video with One-Click Translation and Dubbing

Comprehensive Introduction KrillinAI is an open-source video processing tool focused on using artificial intelligence to help users translate videos and automatically dub them. It can start from the video download, all the way to generating the finished product adapted to different platforms, the whole process is just a few clicks. The developers are available on GitHub...
2mos ago
0984
Headshotly:快速生成职业装头像的AI工具

Headshotly: an AI tool for quickly generating professional headshots

General Introduction Headshotly is an online tool that utilizes artificial intelligence technology to quickly generate professional headshots. Its core function is to allow users to upload a few ordinary selfies, which are then processed by AI to generate high-quality professional headshots. The website focuses on simple operation and efficient experience, suitable for those who need...
4mos ago
0740
AnimeGamer:用语言指令生成动漫视频和角色互动的开源工具

AnimeGamer: An Open Source Tool for Generating Anime Videos and Character Interactions with Language Commands

AnimeGamer is an open source tool launched by Tencent ARC Lab. Users can generate anime videos with simple language commands, such as "Sousuke drive around in a purple car", as well as allow different anime characters to interact with each other, such as Kiki from The Witch's House, and Sky City...
4mos ago
0815
Agent S:像人类一样操作电脑的开源智能体框架

Agent S: An Open Source Framework for Intelligent Bodies to Operate Computers Like Humans

General Introduction Agent S is an open-source framework developed by Simular AI that lets intelligences operate computers like humans through a graphical user interface (GUI). It uses a multimodal large language model and empirical learning techniques to accomplish tasks such as browsing the web, editing documents, using software...
4mos ago
0904
TwinMind:免费离线语音转录文字的APP

TwinMind: free offline voice to text transcription app

General Introduction TwinMind is a smart tool developed by ThirdEar AI, Inc. that "helps you remember everything". TwinMind is a smart tool developed by ThirdEar AI, Inc. that "remembers everything for you". It can record conversations, meetings, or lectures in real time and convert them to text in more than 100 languages, even with your cell phone in your pocket...
4mos ago
0822
VideoMind:视频按时间戳定位内容与问答的开源项目

VideoMind: video by timestamp positioning content and Q&A open source project

General Introduction VideoMind is an open source multimodal AI tool focused on inference, Q&A and summary generation for long videos. It was developed by Ye Liu of the Hong Kong Polytechnic University and a team from Show Lab at the National University of Singapore. The tool mimics human understanding of video...
2mos ago
0878
SegAnyMo:从视频中自动分割任意运动物体的开源工具

SegAnyMo: open source tool to automatically segment arbitrary moving objects from video

General Introduction SegAnyMo is an open source project developed by a team of researchers at UC Berkeley and Peking University, including members such as Nan Huang. This tool focuses on video processing and can automatically recognize and segment arbitrary moving objects in a video, such as people, animals or...
4mos ago
0908
GenXD:生成任意3D和4D场景视频的开源框架

GenXD: open source framework for generating videos of arbitrary 3D and 4D scenes

General Introduction GenXD is an open source project, developed by the National University of Singapore (NUS) and Microsoft team. It focuses on generating arbitrary 3D and 4D scenes , to solve the real-world 3D and 4D generation due to insufficient data and model design complexity brought about by the problem . The project was developed by ...
4mos ago
0903