Preferences

Author here. Thanks to everyone for checking out voice2json!

The TLDR of this project is: a unified command-line interface to different offline speech recognition projects, with the ability to train your own grammar/intent recognizer in one step.

My apologies for the broken packages; I'll get those fixed shortly. My focus lately has been on Rhasspy (https://github.com/rhasspy/rhasspy), which has a lot of the same ideas but a larger scope (full voice assistant).

Questions, comments, and suggestions are welcomed and appreciated!


Is the primary use case for NLP interfaces? I'm looking for a good tool for automated transcriptions of long-form (10-60 minutes) of audio.
I'd recommend using Vosk directly for that: https://alphacephei.com/vosk/

voice2json is better suited for limited domain speech, where each sentence is a specific voice command (think home automation).

Have you seen anyone using this for vim? Do you have any example of how that might look, or insight into whether it would work?
I haven't seen this yet, but I imagine it would involve running at least "voice2json record-command | voice2json transcribe-wav | jq .text". This will record a single command (until silence), and output the text transcription.
Hey there, this looks great. I was wondering why Deepspeech 0.6? Why not the latest version DeepSpeech 0.9?
I need to cycle back and update voice2json. Rhasspy (the full voice assistant) supports DeepSpeech 0.9.3.
Awesome, thanks.

This item has no comments currently.

Keyboard Shortcuts

Story Lists

j
Next story
k
Previous story
Shift+j
Last story
Shift+k
First story
o Enter
Go to story URL
c
Go to comments
u
Go to author

Navigation

Shift+t
Go to top stories
Shift+n
Go to new stories
Shift+b
Go to best stories
Shift+a
Go to Ask HN
Shift+s
Go to Show HN

Miscellaneous

?
Show this modal