Speech Strategy News: VideoDubber provides automated dubbing of video using TTS

December 13, 2017

Speech Strategy News_logo

Content in one language, can, for example, be dubbed in another

Dubbing is a post-production process used in filmmaking and video production, in which additional or supplementary recordings are mixed with original production sound to create the finished soundtrack. The feature is used, for example, to translate a talk in one language into another while it is being shown.

VideoDubber provides an automated video dubbing cloud platform that dubs video content using computer-generated text-to-speech voices that use the voice signature of professional dubbing talents (examples). The company says that the resulting audio track sounds like a natural voice-over artist recorded in a professional recording studio, but since it is digitally produced, it’s perfectly clear, with no hisses or background noises. The software can take a two-hour piece of content (represented as text in the target language) and dub it into another language in less than five minutes, the company indicated.

VideoDubber announced the integration of a voice portfolio of more than 80 distinct premium voices powered by Nuance Communications and enabled through VideoDubber’s cloud platform. Daniel Faulkner, senior vice president of Nuance’s Mobile division noted, “By integrating Nuance’s text-to- speech technology into its platform, VideoDubber is able to bring video content to a broad international audience more quickly and easily.” VideoDubber also announced voices that use Acapela Group’s TTS. (Cavena provides a similar TTS-based dubbing service called Audio Description (AD), which serves the needs of visually impaired people by explaining what is displayed on the screen, p. 35.) Microsoft’s Speech Application Programming Interface (SAPI) can be used to access those (and any) TTS voices. SAPI was originally used widely in the Interactive Voice Response (IVR) market to enable voice interactions.

Boaz Rossano, CEO of VideoDubber commented, “This technology previously used solely in one market has now significantly impacted the broadcast market. Text to Speech can now enable an automated dubbing service that is available 24/7 and can dub masses of video hours to different languages simultaneously, in a small fraction of the time it traditionally took. This is something that will revolutionize the TV market.”

Ifat Grinblat-Yanai, Business Development Manager & ISV Group Lead at Microsoft, explained Microsoft’s support beyond SAPI: “Microsoft has a long-term commitment to support young promising technology startups, and offer them its state of the art technologies and infrastructure, at no cost, as part of the Microsoft BizSpark program. We’re happy to support VideoDubber, our technology partner and a BizSpark graduate, on their product launch, and look forward to see their impact on the future of the TV broadcast industry in the years to come.” Microsoft’s BizSpark program provides startups access to Microsoft software development tools, connects them with key industry players including investors, and provides marketing visibility to help entrepreneurs starting a business.

To the original publication: Speech Strategy News – October 2014, p. 24, by William Meisel, PhD.