Muyan-TTS: Personalized Podcast Speech Training and Synthesis
Synthesis Muyan-TTS is an open source text-to-speech (TTS) model designed for podcasting scenarios. It is pre-trained with over 100,000 hours of podcast audio data and supports zero-sample speech synthesis to generate high-quality natural speech. The model is based on Llama-3.2-3...
CAD-MCP: MCP services for controlling CAD software through natural language commands
General Introduction CAD-MCP is an open source project that allows users to control CAD software drawing operations through natural language commands. It combines natural language processing and CAD automation technology , so that users do not need to manually operate the CAD interface , just enter simple text commands that ...
Cotrans: one-stop manga picture translator (open source and free)
Comprehensive introduction manga-image-translator (Cotrans Translator open source version) for translating manga or pictures in the text . Provides command-line interaction and online demo , with batch conversion mode , web server mode and other diverse options for use ...
GraphGen: Fine-tuning Language Models Using Knowledge Graphs to Generate Synthetic Data
Comprehensive Introduction GraphGen is an open source framework developed by OpenScienceLab, an AI lab in Shanghai, hosted on GitHub, focused on optimizing supervised fine-tuning of Large Language Models (LLMs) by guiding synthetic data generation through knowledge graphs. It was developed from ...
ACI.DEV: Integration of 600+ tools for AI intelligences via MCP server
General Introduction ACI.dev is an open source infrastructure platform designed to provide AI intelligences with rapid integration to over 600 tools. It ensures secure access to tools such as Google Calendar, S...
llm.pdf: experimental project to run a large-scale language model in a PDF file
General Introduction llm.pdf is an open source project that allows users to run Large Language Models (LLMs) directly in PDF files. Developed by EvanZhouDev and hosted on GitHub, this project demonstrates an innovative approach: by Em...
Abogen: a tool for converting multiple text formats to audiobooks
General Introduction Abogen is an open source tool designed to quickly convert ePub, PDF or plain text files to high quality audio. It uses the Kokoro-82M model to generate natural and smooth speech, and supports synchronized subtitle generation, which is suitable for producing audiobooks...
Local Deep Research: a locally run tool for generating in-depth research reports
General Introduction Local Deep Research is an open source AI research assistant designed to help users conduct deep research and generate detailed reports for complex problems. It supports local operation, allowing users to accomplish research tasks without relying on cloud services. The tool ...
DeepWiki: Automatically Generate GitHub Repository Documentation and Talk to It with AI
General Introduction DeepWiki is a free tool from Cognition AI focused on generating structured, Wikipedia-like documentation for GitHub repositories. It analyzes code, README files, and configuration files to automatically create detailed...
Trackers: open source tool library for video object tracking
General Introduction Trackers is an open source Python tool library focused on multi-object tracking in video. It integrates several leading tracking algorithms, such as SORT and DeepSORT, and allows users to combine different object detection models (such as YOLO...