Comprehensive Introduction AIstudioProxyAPI is an open source project that uses Node.js and Playwright technology to emulate the OpenAI API by mimicking the Google AI Studio web version of...
General Introduction Step1X-Edit is an open source image editing framework developed by the Stepfun AI team and hosted on GitHub.It combines a multimodal large language model (Qwen-VL) and a diffusion transformer (DiT) to allow users to create images through a simple and natu...
Comprehensive Introduction Klavis AI is an open source platform focused on simplifying the use and integration of the Model Context Protocol (MCP), an open standard that allows AI applications to dynamically connect with external tools and data sources.Klavis AI provides Slack...
General Introduction RealtimeVoiceChat is an open source project focused on real-time, natural conversations with artificial intelligence via voice. Users use a microphone to input their voice, and the system captures the audio through a browser, quickly converts it to text, and a large-scale language model (LLM) generates back...
General Introduction MiMo is an open source large language modeling project developed by Xiaomi, focusing on mathematical reasoning and code generation. The core product is the MiMo-7B family of models, which contains a base model (Base), a supervised fine-tuning model (SFT), a strong chemical trained from the base model...
Comprehensive Introduction GraphGen is an open source framework developed by OpenScienceLab, an AI lab in Shanghai, hosted on GitHub, focused on optimizing supervised fine-tuning of Large Language Models (LLMs) by guiding synthetic data generation through knowledge graphs. It was developed from ...
Synthesis Muyan-TTS is an open source text-to-speech (TTS) model designed for podcasting scenarios. It is pre-trained with over 100,000 hours of podcast audio data and supports zero-sample speech synthesis to generate high-quality natural speech. The model is based on Llama-3.2-3...
General Introduction ACI.dev is an open source infrastructure platform designed to provide AI intelligences with rapid integration to over 600 tools. It ensures secure access to tools such as Google Calendar, S...
General Introduction llm.pdf is an open source project that allows users to run Large Language Models (LLMs) directly in PDF files. Developed by EvanZhouDev and hosted on GitHub, this project demonstrates an innovative approach: by Em...
General Introduction CAD-MCP is an open source project that allows users to control CAD software drawing operations through natural language commands. It combines natural language processing and CAD automation technology , so that users do not need to manually operate the CAD interface , just enter simple text commands that ...
General Introduction Abogen is an open source tool designed to quickly convert ePub, PDF or plain text files to high quality audio. It uses the Kokoro-82M model to generate natural and smooth speech, and supports synchronized subtitle generation, which is suitable for producing audiobooks...
General Introduction Local Deep Research is an open source AI research assistant designed to help users conduct deep research and generate detailed reports for complex problems. It supports local operation, allowing users to accomplish research tasks without relying on cloud services. The tool ...
General Introduction DeepWiki is a free tool from Cognition AI focused on generating structured, Wikipedia-like documentation for GitHub repositories. It analyzes code, README files, and configuration files to automatically create detailed...
General Introduction Trackers is an open source Python tool library focused on multi-object tracking in video. It integrates several leading tracking algorithms, such as SORT and DeepSORT, and allows users to combine different object detection models (such as YOLO...
Comprehensive Introduction Kimi-Audio is an open source audio base model developed by Moonshot AI that focuses on audio understanding, generation and dialog. It supports a wide range of audio processing tasks such as speech recognition, audio Q&A and speech emotion recognition. The model has been tested over 130...
General Introduction Describe Anything is an open source project developed by NVIDIA and several universities, with the Describe Anything Model (DAM) at its core. This tool can be based on the user in the image or video tagged...
General Introduction Cooragent is an open source AI agent collaboration framework developed by LeapLab at Tsinghua University and hosted on GitHub.It allows users to create intelligent AI agents with a one-sentence description and supports multiple agents to collaborate on complex tasks. The framework provides two...
General Introduction InstantCharacter is an open source project developed by Tencent Hunyuan and the InstantX team, hosted on GitHub. It generates consistent-looking character maps with a reference image and a text description...
Comprehensive Introduction MCP Server Deep Research is an open source tool that automatically generates structured research reports for complex problems through artificial intelligence and web search. Users enter a research question, and the tool breaks down the question, searches for authoritative information, assesses source credibility...
Comprehensive Introduction Deep Recall is an open source, enterprise-class memory framework designed for large-scale language models (LLMs). It provides hyper-personalized responsiveness through efficient contextual retrieval and integration. The framework uses a three-tier architecture, including a memory service, a reasoning service, and a coordinator, supporting...
General Introduction CleverBee is an open source AI research assistant hosted on GitHub and developed by SureScaleAI. It helps users by combining web browsing technology with large language models (such as Gemini and Claude)...
General Introduction FantasyTalking is an open source project developed by the Fantasy-AMAP team, focusing on generating realism talking portrait videos through audio drive. The project is based on the advanced video diffusion model Wan2.1 , combined with the audio encoder Wa...
General Introduction Paper2Code is an open source project that aims to solve the problem of lack of code implementations for machine learning papers. It automatically transforms scientific papers into runnable code repositories through the multi-agent Large Language Modeling (LLM) system PaperCoder. The system uses planning ...
Comprehensive Introduction DeepWiki-Open is an open source project designed to automatically generate structured documentation for code repositories on GitHub, GitLab and Bitbucket. It uses AI technology to analyze the code structure , file content and logical relationships , rapid generation ...
General Introduction Audibit is an open source project, the core function is to Hacker News, TechCrunch and other popular technology articles automatically turned into audio podcasts, so that users in the commute, fitness, or busy when listening to information through the Web or mobile. The project makes ...
Comprehensive Introduction Google Labs' Little Language Lessons (LLL) is an interactive English learning platform based on Gemini AI that provides a series of small experiments to help users improve their English through fun conversations and scenario-based practice...
Comprehensive Introduction On-Device AI is an AI app that runs completely offline and is designed for Apple devices with support for iOS, macOS, and visionOS.It provides local large-scale language model (LLM) running, real-time speech transcription, document analysis, and more, without the need to link...
Comprehensive Introduction VoltAgent is an open source TypeScript framework designed for developers to help rapidly build and orchestrate AI intelligences. It provides modular tools and a standardized development model that simplifies interacting with large language models (LLMs), state...
General Quick Prompt is an open source browser extension that focuses on prompt word (Prompt) management and fast input. Users can create, organize and store libraries of Prompts and quickly insert predefined Prompt content into the input box of any web page. This tool is especially ...
General Introduction Suna is an open source general-purpose AI agent developed by Kortix AI, hosted on GitHub, based on the Apache 2.0 license, which allows users to download, modify and self-host it for free. It uses natural language dialog to help users with...
General Introduction Corgea is an AI-based code security platform focused on helping developers and security teams discover, analyze, and automatically fix vulnerabilities in their code. It does this by working with a set of existing static application security testing (SAST) tools such as Snyk and Semgrep...
General Introduction Spring.new is an AI-based online platform focused on helping marketing managers and product managers quickly build customized workflows and small applications. It allows users to describe requirements through natural language input, automatically generating connections Notion, Airtabl...
General Introduction Strawberry is a smart browser with a built-in AI assistant designed to help users automate their daily tasks and improve efficiency. It differs from traditional browsers by integrating AI technology that understands web content in real-time and performs complex tasks such as quick research, content writing...
General Introduction PostRoast is an online tool that uses artificial intelligence to help users optimize social media content, focusing on post analysis for Platform X (formerly Twitter). Users can upload post content and PostRoast will analyze it with AI algorithms...
Comprehensive Introduction InternVL is an open source multimodal big model project developed by Shanghai Artificial Intelligence Laboratory (OpenGVLab) and hosted on GitHub. It integrates visual and linguistic processing capabilities to support the comprehensive understanding and generation of images, videos and texts.In...
Introduction Roop-Unleashed is a Python based open source AI face changing tool, inherited from s0md3v's Roop project, by the developer C0untFloyd continue to maintain and renamed Roop-Unle...
Comprehensive Introduction Potpie AI is an open source platform focused on providing developers with customized AI engineering assistants. It allows AI agents to deeply understand code structure and logic and automate tasks such as debugging, testing, and code generation by building a knowledge graph of the code base. Users can use simple...
General Introduction Extrovert is an AI-based LinkedIn relationship management tool focused on helping enterprise sales teams efficiently build and maintain business relationships. It analyzes a prospect's LinkedIn dynamics through AI to provide personalized comments, likes and private message building...
Comprehensive Introduction Vexa is an open source real-time meeting transcription and knowledge management platform designed to provide efficient meeting recording and intelligent knowledge extraction services for enterprises and individuals. It automatically joins platforms such as Google Meet, Zoom, etc. through API-driven meeting robots...
Comprehensive Introduction RooFlow is an open source AI-assisted programming tool with the core functionality of saving code, decisions and task progress during development through project logging. It is based on Roo Code extension and integrates five modes: architecture, coding, testing, debugging and Q&A. These modes inter...
General Introduction Zev is an easy-to-use command line interface (CLI) tool that helps users quickly query and generate terminal commands in natural language. Instead of memorizing complex command syntax, Zev generates terminal commands by describing your needs in everyday language. Based on Ope...
General Introduction Open Deep Research is a deep research tool developed and open-sourced by the Together AI team, hosted on GitHub. It simulates the human research process through a multi-agent AI workflow, generating detailed research reports...
Comprehensive Introduction LLManager is an open source intelligent approval management tool, developed based on LangChain's LangGraph framework, focused on automating the processing of approval requests while optimizing decision making with human review. It does this through semantic search, sample less learning and...
General Introduction openai-fm is an open source project hosted on GitHub dedicated to demonstrating the capabilities of the OpenAI Text-to-Speech (TTS) API. The project works through an interactive web application...
General Introduction Fellou is the world's first AI-enabled action-oriented browser from Fellou AI. Fellou is the world's first AI-enabled mobile browser, which not only provides the web browsing functionality of a traditional browser, but also automates tasks and enables deep information search through AI technology.
General Introduction Find My Kids is an open source project hosted on GitHub and created by developer Tomer Klein. It combines DeepFace face recognition technology and the WhatsApp Green API...
General Introduction OpenUtau is a free open source song synthesis and editing platform designed to modernize the editing experience for the UTAU community. It is the successor to the UTAU software and solves the compatibility and complexity issues of the original software.OpenUtau supports Wind...
A Comprehensive Introduction NodeRAG is an open source Retrieval Augmented Generation (RAG) system hosted on GitHub and developed by Terry-Xu-666. It optimizes information retrieval and generation through heterogeneous graph structures, significantly improving retrieval accuracy and contextual relevance.Nod...
Comprehensive Introduction SkyReels-V2 is an open source video generation model developed by SkyworkAI. It supports the generation of videos of unlimited length through advanced Diffusion Forcing technology, and is suitable for text-to-video (T2V) and graph...
General Introduction Sidekick CLI is an open source command line tool designed to simplify the project development and deployment process for developers with AI assistance. It is inspired by Claude Code, Copilot, and Cursor, and provides similar functionality...
General Introduction Bake Fonts is an online tool focused on 3D typographic font design and generation by Bake AI, aiming to provide unique and personalized font solutions for designers, creators and brands. The platform allows users to explore diverse font styles...
General Introduction Plandex is an open source end-to-end AI coding assistant designed for large and complex software projects. It can plan and execute multi-step tasks, handle contexts of up to 2 million tokens, and support more than 30 programming languages.Plandex offers...
Comprehensive Introduction BiliNote is an open source AI video note-taking tool that supports extracting content from BiliNote and YouTube video links to automatically generate clearly structured notes in Markdown format. It utilizes native audio transcription and a variety of large models (such as ...
GPT4All General Introduction GPT-4All is an open source project developed by Nomic to allow users to run Large Language Models (LLMs) on local devices. The project emphasizes privacy protection and does not require an Internet connection to use, and is suitable for both personal and business users...
General Introduction OpenAI Codex CLI is an open source terminal coding tool, developed by OpenAI, designed for developers accustomed to terminals. It generates code, edits files, executes commands, and integrates Git version control through natural language commands.Cod...
General Introduction Mailgo is an AI-based cold email marketing platform focused on helping businesses and individuals boost sales and customer conversions through efficient email marketing. It automates email content generation, prospecting and optimizes email deliverability through AI technology to reduce marketing costs...
General Introduction Boxo is a platform that helps mobile apps quickly transform into super apps. With a single SDK integration, developers can embed a wide range of services in their apps, such as e-commerce, travel booking, bill payment, eSIM, and insurance, etc. Boxo offers white-labeled mini-apps that support pin...
General Introduction Pippit AI is a smart authoring tool from CapCut focused on streamlining the process of producing marketing content. Users only need to enter product links or upload materials, the platform can quickly generate videos, images and AI avatars, suitable for social media and e-commerce platforms to make...
Comprehensive Introduction Eden AI is a full-stack AI platform for developers, connecting over 100 AI models covering text, image, speech and video processing functions through a single API interface. Users can quickly call different models and build AI applications without managing multiple vendor accounts...
General Introduction Logent AI is an online tool that utilizes artificial intelligence to quickly generate brand logos. It generates a wide range of professional Logo designs in seconds by analyzing the product name, tagline or reference image entered by the user. The platform supports graphic Logos and monogram Logos, suitable...
General Description Artbreeder Splicer2 is an innovative AI image creation tool with core features of tree-based image blending and editing capabilities. Users can upload images, blend multiple images or adjust features to generate unique portraits, landscapes or other...
Comprehensive Introduction Gemini Balance is an OpenAI API proxy service developed based on FastAPI framework, aiming to provide efficient multi-API Key management and optimization functions. The project supports Gemini model calls, and the main features include multi-API...
General Introduction OneLine is an open source hot event timeline generation tool hosted on GitHub and developed by user chengtx809. It quickly generates a timeline of events by keywords entered by the user, showing the time, title, description and related people of the event...
General Introduction AiPy is an open source Python command-line tool developed by the Knownsec team. It combines the Large Language Model (LLM) and the Python runtime environment to allow users to automatically generate and run Pytho...
General Introduction realtime-transcription-fastrtc is an open source project focused on converting speech to text in real time. It uses FastRTC technology to process low-latency audio streams , combined with the local Whisper model to achieve efficient ...
General Introduction DroidRun is an open source tool that lets AI operate an Android phone like a human. It helps AI automate tasks such as opening apps, sending messages, or browsing the web by extracting interactive elements such as on-screen buttons, input boxes, etc.DroidRun combines...
Comprehensive Introduction AI Humanize is an online tool that focuses on transforming AI-generated text into natural, near-human-written language. It adjusts the syntax, vocabulary, and tone of AI text to make the content more realistic and fluent through advanced natural language processing techniques. The tool supports more than 50...