AI开源项目 - page 27

Sorting

release update Views Like

Fish Speech: Fast and Highly Accurate Cloning of English and Chinese Speech Using Few Samples

综合介绍 Fish Speech是由Fish Audio开发的一款开源文本到语音（TTS）合成工具。该工具基于VQ-GAN、Llama和VITS等前沿AI技术，能够将文本转换成逼真的语音。Fish S...

4mos ago

01.5K

IMS Toucan: Fast and Controllable Multilingual (7000+ languages supported) Text-to-Speech Tool

General Introduction IMS Toucan is a state-of-the-art text-to-speech (TTS) toolkit developed by the Institute for Natural Language Processing (IMS) at the University of Stuttgart, Germany. The toolkit supports more than 7000 languages and is characterized by fast, controllable and low computational resource requirements.IMS...

Latest AI tools # AI Java Open Source Projecct # AI text-to-speech

4mos ago

0725

ChatTTS: a speech generation model that mimics the voice of a real person speaking (ChatTTS one-click acceleration package)

General Introduction ChatTTS is a generative speech model designed for conversational scenarios. It generates natural and expressive speech, supports multiple languages and multiple speakers, and is suitable for interactive conversations. The model does this by predicting and controlling fine-grained prosodic features such as laughter, pauses and interjections, sup...

Latest AI tools # AI Java Open Source Projecct # AI text-to-speech

4mos ago

01.5K

zChunk: a generic semantic chunking strategy based on Llama-70B

综合介绍 zChunk是由ZeroEntropy开发的一种新型分块策略，旨在为通用语义分块提供解决方案。该策略基于Llama-70B模型，通过提示生成分块，优化了文档的分块过程，确保在信息检索时保持高...

Latest AI tools # AI Java Open Source Projecct # Document Extraction and Cleaning

4mos ago

0663

Chonkie: a lightweight RAG text chunking library

综合介绍 Chonkie 是一个轻量级且高效的 RAG（Retrieval-Augmented Generation）文本切块库，旨在帮助开发者快速、简便地对文本进行分块处理。该库支持多种分块方法，包...

Latest AI tools # AI Java Open Source Projecct # Document Extraction and Cleaning

4mos ago

01.4K

Qwen4Mac: Use Qwen's big models in the Mac menu bar to have conversations on the go!

综合介绍 Qwen4Mac是一个开源项目，旨在将Qwen大语言模型（LLM）集成到Mac的菜单栏中，方便用户随时调用和使用。该项目由andreaturchet开发和维护，提供了一种简便的方式，让用户能...

Latest AI tools # AI Java Open Source Projecct

4mos ago

0682

口袋AI：手机中运行的离线AI助手，适配 DeepSeek-R1 (5.37GB)

Pocket AI: offline AI assistant running in your phone, adapted for DeepSeek-R1 (5.37GB)

综合介绍口袋AI（PocketPal AI 中文版）是一款强大的离线AI助手，旨在让用户随时随地与AI进行对话。该项目基于小型语言模型（SLMs），无需联网即可在手机上运行，特别适配中文用户体验。口...

Latest AI tools # AI Java Open Source Projecct # AI Localized Chat Application

4mos ago

0979

Kokoro WebGPU: A Text-to-Speech Service for Offline Operation in Browsers

General Introduction Kokoro WebGPU is a WebGPU version of the Kokoro text-to-speech (TTS) model, provided by WebML Community on the Hugging Face platform. The project utilizes WebGPU technology to enable users to...

Latest AI tools # AI Java Open Source Projecct # AI text-to-speech

4mos ago

0974

Unsloth: an open source tool for efficiently fine-tuning and training large language models

综合介绍 Unsloth 是一个开源项目，旨在提供高效的微调和训练大语言模型（LLMs）的工具。该项目支持多种知名模型，包括 Llama、Mistral、Phi 和 Gemma 等。Unsloth 的...

Latest AI tools # AI Java Open Source Projecct # Large model fine-tuning

4mos ago

01.1K

Video Analyzer: analyzes video content and generates detailed descriptions

Comprehensive Introduction Video Analyzer (Video Analyzer) is a comprehensive video analysis tool that combines computer vision, audio transcription and natural language processing techniques to generate detailed video content descriptions. The tool transcribes audio content by extracting key frames in the video...

Latest AI tools # AI Java Open Source Projecct # Visual Target Detection

4mos ago

01.4K

CogVLM2: Open Source Multimodal Modeling with Support for Video Comprehension and Multi-Round Dialogue

Comprehensive Introduction CogVLM2 is an open source multimodal model developed by the Tsinghua University Data Mining Research Group (THUDM), based on the Llama3-8B architecture, and designed to provide performance comparable to or even better than GPT-4V. The model supports image understanding, multi-round dialogs, and visual ...

Latest AI tools # AI Java Open Source Projecct # Visual Target Detection

4mos ago

0784