how to detect language spoken in google cloud plat

Is there an option to automatically detect the spoken language using Google Cloud Platform Machine Learning's Speech API?

https://cloud.google.com/speech/docs/languages indicates the list of the languages supported and user needs to be manually set this parameter to perform speech-to-text.

Thanks Mahesh

标签： machine-learning google-cloud-platform speech-to-text

2条回答

叼着烟拽天下

2楼-- · 2019-01-25 14:11

Requests to Google Cloud Speech API require the following configuration parameters: encoding, sampleRateHertz and languageCode. https://cloud.google.com/speech/reference/rest/v1/RecognitionConfig

Thus, it is not possible for the Google Cloud Speech API service to automatically detect the language used. The service will be configured by this parameter (languageCode) to start recognizing speech in that specific language.

If you had in mind a parallel with Google Cloud Translation API, where the input language is automatically detected, please consider that automatically detecting the language used in an audio file requires much more bandwidth, storage space and processing power than in a text file. Also, Google Cloud Speech API offers Streaming Speech Recognition, a real-time speech-to-text service, where the languageCode parameter is especially required.

0人赞添加讨论(0) 举报

Root（大扎）

3楼-- · 2019-01-25 14:16

As of last month, Google added support for detection of spoken languages into its speech-to-text API. Google Cloud Speech v1p1beta1

It’s a bit limited though - you have to provide a list of probable language codes, up to 3 of them only, and it’s said to be supported only for voice command and voice search modes. It’s useful if you have a clue what other languages may be in your audio.

From their docs:

alternative_language_codes[]: string

Optional A list of up to 3 additional BCP-47 language tags, listing possible alternative languages of the supplied audio. See Language Support for a list of the currently supported language codes. If alternative languages are listed, recognition result will contain recognition in the most likely language detected including the main language_code. The recognition result will include the language tag of the language detected in the audio. NOTE: This feature is only supported for Voice Command and Voice Search use cases and performance may vary for other use cases (e.g., phone call transcription).”

0人赞添加讨论(0) 举报

how to detect language spoken in google cloud plat

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间