Comment by synesthesiam

synesthesiam May 21, 2021 parent

Author here. Thanks to everyone for checking out voice2json!

The TLDR of this project is: a unified command-line interface to different offline speech recognition projects, with the ability to train your own grammar/intent recognizer in one step.

My apologies for the broken packages; I'll get those fixed shortly. My focus lately has been on Rhasspy (https://github.com/rhasspy/rhasspy), which has a lot of the same ideas but a larger scope (full voice assistant).

Questions, comments, and suggestions are welcomed and appreciated!

tootie May 22, 2021

Is the primary use case for NLP interfaces? I'm looking for a good tool for automated transcriptions of long-form (10-60 minutes) of audio.

synesthesiam OP May 22, 2021

I'd recommend using Vosk directly for that: https://alphacephei.com/vosk/

voice2json is better suited for limited domain speech, where each sentence is a specific voice command (think home automation).

Jeaye May 22, 2021

Have you seen anyone using this for vim? Do you have any example of how that might look, or insight into whether it would work?

synesthesiam OP May 22, 2021

I haven't seen this yet, but I imagine it would involve running at least "voice2json record-command | voice2json transcribe-wav | jq .text". This will record a single command (until silence), and output the text transcription.

markjgx May 22, 2021

Hey there, this looks great. I was wondering why Deepspeech 0.6? Why not the latest version DeepSpeech 0.9?

synesthesiam OP May 22, 2021

I need to cycle back and update voice2json. Rhasspy (the full voice assistant) supports DeepSpeech 0.9.3.

markjgx May 22, 2021

Awesome, thanks.

This item has no comments currently.

Preferences

Keyboard Shortcuts

Story Lists

Navigation

Miscellaneous