audio file isn't being parsed with Google Spee

This question is a followup to a previous question.

The snippet of code below almost works...it runs without error yet gives back a None value for results_list. This means it is accessing the file (I think) but just can't extract anything from it.

I have a file, sample.wav, living publicly here: https://storage.googleapis.com/speech_proj_files/sample.wav

I am trying to access it by specifying source_uri='gs://speech_proj_files/sample.wav'.

I don't understand why this isn't working. I don't think it's a permissions problem. My session is instantiated fine. The code chugs for a second, yet always comes up with no result. How can I debug this?? Any advice is much appreciated.

from google.cloud import speech
speech_client = speech.Client()

audio_sample = speech_client.sample(
    content=None,
    source_uri='gs://speech_proj_files/sample.wav',
    encoding='LINEAR16',
    sample_rate_hertz= 44100)
results_list = audio_sample.async_recognize(language_code='en-US')

标签： audio google-cloud-platform google-speech-api

2条回答

女痞

2楼-- · 2019-07-31 14:15

Ah, that's my fault from the last question. That's the async_recognize command, not the sync_recognize command.

That library has three recognize commands. sync_recognize reads the whole file and returns the results. That's probably the one you want. Remove the letter "a" and try again.

Here's an example Python program that does this: https://github.com/GoogleCloudPlatform/python-docs-samples/blob/master/speech/cloud-client/transcribe.py

FYI, here's a summary of the other types:

async_recognize starts a long-running, server-side operation to translate the whole file. You can make further calls to the server to see whether it's finished with the operation.poll() method and, when complete, can get the results via operation.results.

The third type is streaming_recognize, which sends you results continually as they are processed. This can be useful for long files where you want some results immediately, or if you're continuously uploading live audio.

0人赞添加讨论(0) 举报

男人必须洒脱

3楼-- · 2019-07-31 14:18

I finally got something to work:

import time

from google.cloud import speech
speech_client = speech.Client()


sample = speech_client.sample(
      content = None
    , 'gs://speech_proj_files/sample.wav'
    , encoding='LINEAR16'
    , sample_rate= 44100
    , 'languageCode': 'en-US'
)

retry_count = 100

operation = sample.async_recognize(language_code='en-US')

while retry_count > 0 and not operation.complete:
    retry_count -= 1
    time.sleep(10)
    operation.poll()  # API call

print(operation.complete)

print(operation.results[0].transcript)

print(operation.results[0].confidence)

for op in operation.results:
    print op.transcript

Then something like

for op in operation.results:
    print op.transcript

0人赞添加讨论(0) 举报

audio file isn't being parsed with Google Spee

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间