It's all in the title. I'd like to run batches off the top of my streaming jobs, and being able to see the watermark as an indicator of when to start would be wonderful.
可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):
问题:
回答1:
You might be able to accomplish this by using pubsub to publish a signal that would trigger what ever external processing you want.
To control the frequency of that signal you could use a ParDo to filter down your records based on some criterion which might take into account the timestamps of the event.
If you explicitly want to use the watermark you could try to use windowing and triggers to produce records after the watermark passes some interval.
I don't think there is any explicit way to access the watermark.