Transcribing audio with Speech-to-Text

Use the built-in STT engine to turn spoken audio into text.

Written By Grout

Last updated 1 day ago

Alongside Text-to-Speech, Grout Film includes a Speech-to-Text (STT) engine for turning spoken audio into text.

How to transcribe

  1. Open the Speech-to-Text tool.
  2. Provide the audio you want transcribed.
  3. Generate the transcription.

[SCREENSHOT: The Speech-to-Text tool transcribing audio]

Running STT locally

Like TTS, the Speech-to-Text engine can be downloaded and run locally on your machine.