Latest AI Resources

Total 2603 articles posts
自动解析PDF内容并提取文字与表格的开源服务

Automatically parse PDF content and extract text and tables of open source services

Comprehensive Introduction It can automatically analyze the layout of PDF documents, identify text, titles, images, tables, formulas and other elements in the page, and determine their correct order. The tool supports OCR functionality and can convert scanned PDF to searchable text. It runs on Docker and provides two models...
4mos ago
0818
Internet.io:聚合多AI模型答案的智能工作平台

Internet.io: an intelligent work platform that aggregates answers from multiple AI models

General Introduction Internet.io is an intelligent platform that aggregates answers from multiple top AI models. It is designed to solve the problem that a single AI answer may be inaccurate or inconsistent. Users can simply ask a question and get answers from multiple leading AI models at the same time, making it easy to compare...
4mos ago
0790
WeClone:用微信聊天记录和语音训练数字分身

WeClone: training digital doppelgangers with WeChat chats and voices

Comprehensive introduction WeClone is an open source project that uses WeChat chat logs and voice messages, combined with large language models and speech synthesis technology, to allow users to create personalized digital doppelgangers. The project can analyze the user's chat habits to train the model , but also a small number of voice samples to generate realistic sound...
4mos ago
0917
Headshotly:快速生成职业装头像的AI工具

Headshotly: an AI tool for quickly generating professional headshots

General Introduction Headshotly is an online tool that utilizes artificial intelligence technology to quickly generate professional headshots. Its core function is to allow users to upload a few ordinary selfies, which are then processed by AI to generate high-quality professional headshots. The website focuses on simple operation and efficient experience, suitable for those who need...
4mos ago
0740
AnimeGamer:用语言指令生成动漫视频和角色互动的开源工具

AnimeGamer: An Open Source Tool for Generating Anime Videos and Character Interactions with Language Commands

AnimeGamer is an open source tool launched by Tencent ARC Lab. Users can generate anime videos with simple language commands, such as "Sousuke drive around in a purple car", as well as allow different anime characters to interact with each other, such as Kiki from The Witch's House, and Sky City...
4mos ago
0815
Agent S:像人类一样操作电脑的开源智能体框架

Agent S: An Open Source Framework for Intelligent Bodies to Operate Computers Like Humans

General Introduction Agent S is an open-source framework developed by Simular AI that lets intelligences operate computers like humans through a graphical user interface (GUI). It uses a multimodal large language model and empirical learning techniques to accomplish tasks such as browsing the web, editing documents, using software...
4mos ago
0954
Motionvid.ai:用文字或草图快速生成演示动画视频

Motionvid.ai: Quickly generate animated demo videos with text or sketches

General Introduction Motionvid.ai is an online tool that utilizes artificial intelligence to help users quickly create professional animated videos. Its best feature is to generate animations with smooth dynamics and high-quality visual effects in seconds through text descriptions or hand-drawn sketches. Users don't need to master complex...
4mos ago
0784
TwinMind:免费离线语音转录文字的APP

TwinMind: free offline voice to text transcription app

General Introduction TwinMind is a smart tool developed by ThirdEar AI, Inc. that "helps you remember everything". TwinMind is a smart tool developed by ThirdEar AI, Inc. that "remembers everything for you". It can record conversations, meetings, or lectures in real time and convert them to text in more than 100 languages, even with your cell phone in your pocket...
4mos ago
0822
SegAnyMo:从视频中自动分割任意运动物体的开源工具

SegAnyMo: open source tool to automatically segment arbitrary moving objects from video

General Introduction SegAnyMo is an open source project developed by a team of researchers at UC Berkeley and Peking University, including members such as Nan Huang. This tool focuses on video processing and can automatically recognize and segment arbitrary moving objects in a video, such as people, animals or...
4mos ago
0908
GenXD:生成任意3D和4D场景视频的开源框架

GenXD: open source framework for generating videos of arbitrary 3D and 4D scenes

General Introduction GenXD is an open source project, developed by the National University of Singapore (NUS) and Microsoft team. It focuses on generating arbitrary 3D and 4D scenes , to solve the real-world 3D and 4D generation due to insufficient data and model design complexity brought about by the problem . The project was developed by ...
4mos ago
0978
Avcado AI:扫描食品标签并分析成分的健康助手

Avcado AI: A health assistant that scans food labels and analyzes ingredients

General Introduction Avcado AI is a smart tool that helps users understand the contents of food products. Its core function is to quickly identify ingredients, nutritional information and additives by taking a picture and scanning the food label. Users simply take a picture of the label on the package with their phone, and the website analyzes and displays detailed information about the food...
4mos ago
01K
Agent Laboratory:为研究人员提供自动化代码及研究报告撰写助手

Agent Laboratory: automated code and study writing assistant for researchers

Comprehensive Introduction Agent Laboratory is an end-to-end autonomous research workflow designed to help researchers realize their research ideas. The system consists of dedicated agents driven by large language models that support the entire research workflow - from conducting literature reviews and developing plans to executing...
4mos ago
01.4K
Research Rabbit:使用本地LLM进行网页研究和报告撰写,自动深入用户指定主题并生成总结。

Research Rabbit: Web research and report writing using native LLM, automatically drilling down into user-specified topics and generating summaries.

General Introduction Research Rabbit is a native LLM (Large Language Model) based web research and summarization assistant. After the user provides a research topic, Research Rabbit generates a search query, obtains relevant web results, and summarizes those results...
4mos ago
01.6K
Company Researcher:公司研究工具,输入公司网址以获取详细研究信息

Company Researcher: A company research tool, enter a company's web address for detailed research information.

General Description Company Researcher (Company Researcher) is a free and open source tool designed to help users get a quick and comprehensive overview of any company. Simply enter the company's URL and the tool will gather comprehensive information from the web, presenting information about the organization, its products...
4mos ago
01.3K
Deep Research:基于AI的深度研究助手,提供高效的研究工具和报告生成功能

Deep Research: an AI-based deep research assistant that provides efficient research tools and report generation capabilities

General Introduction Deep Research is an AI-based research assistant designed to perform iterative deep research by combining search engines, web crawling, and large language models. The project was released by dzhng on GitHub with the goal of providing an easy-to-use deep research genera...
4mos ago
01.1K
GPT Researcher:利用本地和网络数据,生成全面、详实的研究报告

GPT Researcher: Generate comprehensive, detailed research reports utilizing local and web-based data

Comprehensive Introduction GPT Researcher is an autonomous agent tool based on the Large Language Model (LLM) designed to perform local and web research and generate detailed research reports. The tool provides stable performance and faster speed by parallelizing agent work, ensuring that the information is accurate...
4mos ago
01.2K
STORM:基于Topic搜索网络数据,生成带引用的论文、长文报告

STORM: Search web data based on Topic to generate papers with citations, long paper reports

General Introduction STORM is a knowledge integration and article generation system developed by the Oval team at Stanford University. It focuses on generating exhaustive Wikipedia-like articles (systematic papers) from scratch. The system utilizes large-scale language models for topic research, preparing synopses and simulating actual interconnected...
4mos ago
01.8K
朱雀大模型检测:识别AI生成内容,确保文本和图像真实性

Jubilee Big Model Detection: identifying AI-generated content to ensure text and image authenticity

Comprehensive Introduction Big Model Detection is an AI-generated content detection tool developed by Tencent's hybrid security team, Jubilee Labs. The tool can quickly identify text and images generated by AI and help users distinguish between manually created and AI-generated content. By capturing the differences between AI-generated content and real content...
4mos ago
02.7K
Sonic:音频驱动肖像图片生成面部表情生动的数字人口播视频

Sonic: Audio-driven portrait images generate digital demo videos with vivid facial expressions

General Introduction Sonic is an innovative platform focusing on global audio perception designed to generate vivid portrait animations driven by audio. Developed by a team of researchers from Tencent and Zhejiang University, the platform utilizes audio information to control facial expressions and head movements to generate natural and smooth animated videos.S...
4mos ago
01.6K