I'm building an apache beam streaming pipeline whose source is Pubsub and sink is BigQuery. I've gotten the error messsage:
"Workflow failed. Causes: Unknown message code."
As cryptic as this message is I now believe it to be the case that BigQuery is not supported as a sink for streaming pipelines, it says this here: Streaming from Pub/Sub to BigQuery
Am I certainly correct that this is what's causing the problem? Or if not is it still not supported in any case?
Can anyone hint at when this feature will be released? It's a shame, I was pretty excited to get using this.
Python Streaming pipelines are experimentally available since Beam 2.5.0 as documented in beam docs here
Therefore you will need to install apache-beam 2.5.0 and apache-beam[gcp]
I ran this command:
Using the code below, and it works alright:
This code uses the publicly available topic "--topic projects/pubsub-public-data/topics/taxirides-realtime" and BQ table that I have created with the right schema.
If you use this example be careful not leaving it running or you will incur into costs as you will receive a lot messages coming from this PubSub topic.