AI speech to text - page 2

Sorting

Whisper Input: a free and high-speed voice-to-text transcription service using Groq

General Description Whisper Input is an open source voice transcription tool that allows users to start recording voice by pressing the Option button and end the recording by lifting the button. The tool calls Groq Whisper Large V3 Turbo ...

4mos ago

07040

LiberSonora: Audiobook Subtitle Extraction and Multilingual Translation, Audiobook Transcription into Multiple Languages

General Introduction LiberSonora, which means "free sound", is a powerful AI-enabled open source audiobook toolset. The toolset supports intelligent subtitle extraction, AI title generation, multi-language translation, etc., and is capable of batch offline processing under GPU acceleration.LiberSo...

Latest AI tools # AI Java Open Source Projecct # AI Translation # AI Speech to Text

4mos ago

06550

AudioNotes: Quickly Extract Audio and Video Content and Generate Structured Notes

Comprehensive Introduction AudioNotes is an audio/video to structured notes system built on FunASR and Qwen2. It can quickly extract audio/video content and call the big model to organize it and generate a structured Markdown notes, which is convenient for...

Latest AI tools # AI Java Open Source Projecct # AI Speech to Text

4mos ago

07850

Orate: A Unified API for Integrating Well-Known Speech Generation, Speech Transcription and Voice Change Models

Comprehensive Introduction Orate is an AI toolkit focused on speech generation and transcription. It provides a unified API that seamlessly integrates with leading AI providers such as OpenAI, ElevenLabs, and AssemblyAI to help users create forced...

Latest AI tools # AI Java Open Source Projecct # AI text-to-speech # AI Speech to Text

4mos ago

09340

PengChengStarling：对比Whisper-Large v3更小、更快的多语言语音转文字工具

PengChengStarling: Smaller and Faster Multilingual Speech-to-Text Tool than Whisper-Large v3

Comprehensive Introduction PengChengStarling (PengCheng Labs) is a multilingual Automatic Speech Recognition (ASR) tool capable of converting speech in different languages into corresponding text. This toolkit is developed based on the icefall project and provides a complete speech recognition process...

Latest AI tools # AI Java Open Source Projecct # AI Speech to Text

4mos ago

08000

RealtimeSTT：实时语音转文字工具，基于Whisper实现低延迟流式语音识别

RealtimeSTT: Real-time Speech-to-Text Tool for Low-Latency Streaming Speech Recognition Based on Whisper

General Introduction RealtimeSTT is an efficient, low-latency real-time speech-to-text library with advanced speech activity detection and wake word activation. It was developed by Kolja Beigel to support applications that require fast and accurate speech-to-text...

AI News # AI Java Open Source Projecct # AI Speech to Text

5mos ago

01.2K0

Sherpa-ONNX: Offline Speech Recognition and Synthesis with ONNXRuntime

General Introduction sherpa-onnx is an open source project developed by the Next-gen Kaldi team to provide efficient offline speech recognition and speech synthesis solutions. It supports multiple platforms including Android, iOS, Raspber...

Latest AI tools # AI Java Open Source Projecct # AI text-to-speech # AI Speech to Text

5mos ago

01.5K0

Acoust: Online AI Speech Generation and Text-to-Speech (TTS) Services Platform

General Introduction Acoust is an online AI speech generation and text-to-speech (TTS) service platform that utilizes the latest AI technology to generate realistic speech. The platform also provides powerful video editing tools that allow users to complete video production without the need to use multiple software.Acou...

Latest AI tools # AI text-to-speech # AI Speech to Text

5mos ago

09170

Notta: AI meeting recording and audio transcription tool to automatically transcribe meetings, interviews or recordings

General Description Notta is a powerful AI meeting recording and audio transcription tool designed to help users automatically convert meetings, interviews or audio recordings into searchable text. With Notta, users can easily transcribe, edit, summarize and collaborate to boost productivity.Notta supports...

Latest AI tools # AI Text and Audio/Video Summarization Tool # AI Speech to Text

5mos ago

01.4K0

AI no jimaku gumi: Automatic generation and translation of multilingual subtitles for videos with the help of AI

General Introduction AI no jimaku gumi (AI no subtitle group) is a powerful command line video subtitle processing tool focused on automating video subtitle extraction, transcription and translation functions. The tool integrates advanced AI technologies, including Whisper speech...

Latest AI tools # AI Java Open Source Projecct # AI Translation # AI Speech to Text

5mos ago

01K0

FunClip: Intelligent editing of video content into short clips, easy to realize accurate video clip extraction/cropping

Comprehensive Introduction FunClip is a fully open source localized automatic video editing tool developed by TONGYI Speech Lab of Alibaba Dharma Institute. The tool integrates the industrial-grade Paraformer-Large speech recognition model, which can accurately recognize the speech in the video...

Latest AI tools # AI Java Open Source Projecct # AI Speech to Text # AI audio/video editor

5mos ago

01K0

BetterWhisperX: Automated speech recognition separated from the speaker, providing highly accurate word-level timestamps

Comprehensive Introduction BetterWhisperX is an optimized version of the WhisperX project focused on providing efficient and accurate automatic speech recognition (ASR) services. An improved offshoot of WhisperX, the project was developed by Federico ...

Latest AI tools # AI Java Open Source Projecct # AI Speech to Text

5mos ago

01.3K0

Freed: AI medical transcription assistant that accurately transcribes doctor-patient conversations and reduces visit documentation paperwork

General Description Freed is an AI medical transcription assistant designed for healthcare professionals. It helps doctors and other healthcare practitioners automate the recording of patient visits, reduce paperwork, and increase work efficiency through advanced AI technology.Freed's AI transcription...

Latest AI tools # AI Speech to Text

5mos ago

01.2K0

Voicenotes: AI voice notes, record and transcribe voice, intelligently manage meeting content

General Introduction Voicenotes is a smart voice notes app designed to help users easily record and manage voice notes and meetings. The app supports voice transcription in more than 100 languages, users just need to say the idea, Voicenotes can automatically transcribe it into text...

Latest AI tools # AI Notes # AI Speech to Text

5mos ago

01.2K0

Voice-Pro：开源多功能视频翻译工具，语音转录并翻译为多语言，Windows一键安装

Voice-Pro: open source multifunctional video translation tool, voice transcription and translation into multiple languages, Windows one-click installation

General Introduction Voice-Pro is a multifunctional tool based on Gradio WebUI that supports speech-to-text, text-to-speech, real-time translation, YouTube video downloads and human voice separation. It integrates Whisper, Faster-Wh...

Latest AI tools # AI Java Open Source Projecct # AI Translation # AI Speech to Text

6mos ago

01.2K0

Zamzar：多功能在线文件格式转换工具，视频转换|音频转换|图片转换|文档转换

Zamzar: Multi-functional online file format conversion tool, video conversion | audio conversion | image conversion | document conversion

General Introduction Zamzar is a powerful online file conversion tool that supports over 1200 file formats. Whether it's documents, pictures, videos, audios or eBooks, Zamzar can do it quickly and efficiently. Users don't need to download any software...

Latest AI tools # AI Open Services # AI Speech to Text

7mos ago

01.4K0

AI Hear: Real-Time Speech Transcription and Translation Software for Native Offline Operation

General Description If you're using a MacBook, try AI Hear: you can record, real-time local speech to text, and translate, and eventually export subtitles. You can use it to assist you in listening to cross-country meetings and English audiobooks. AI Hear is a locally running software that provides one-click real-time...

Latest AI tools # AI Translation # AI Speech to Text

7mos ago

01.2K0

SoniTranslate：开源视频翻译配音解决方案，多人配音、调整语速与模仿原声

SoniTranslate: open source video translation and dubbing solution, multi-person dubbing, adjust the speed of speech and mimic the original sound

General Description SoniTranslate is a powerful and user-friendly video multilingual dubbing tool designed to provide a solution for video translation and synchronized audio. It uses advanced speech recognition and machine translation technologies to translate video content into multiple languages and keep the audio synchronized. The program ...

Latest AI tools # AI text-to-speech # AI Translation # AI Speech to Text

7mos ago

02.4K0

FunASR: Open Source Speech Recognition Toolkit, Speaker Separation / Multi-Person Conversation Speech Recognition

Comprehensive Introduction FunASR is an open source speech recognition toolkit developed by Alibaba's Dharma Institute to bridge academic research and industrial applications. It supports a wide range of speech recognition features, including speech recognition (ASR), voice endpoint detection (VAD), punctuation recovery, language modeling, speaking...

Latest AI tools # AI Java Open Source Projecct # AI Speech to Text

8mos ago

01.7K0