Spark : How to append to cached rdd? - 码农岛

Spark : How to append to cached rdd?

2019-09-12 11:32发布

站内问答 / Spark

763 1

孤傲高冷的网名

女 | 书童

私信

Distinct values are cached with every streamed batch of data.
How do i build the cache by adding the next distinct values in the next batch to the already cached RDD?

标签： caching apache-spark spark-streaming

1条回答

forever°为你锁心

2楼-- · 2019-09-12 11:54

You can not directly append your data with Rdd because its immutable. Using union to create new Rdd and then cache it.

查看更多

0人赞添加讨论(0) 举报

相关问题

相关文章

收藏的人(4)