Difference between Spark dataset write

2020-05-06 12:52发布

I'm using Spark-Java.

I need to know If there is any diffrence (performance etc) between the following write to Hadoop methods:

ds.write().mode(mode).format("orc").save(path);

Or

ds.write().mode(mode).orc(path);

Thanks.

1条回答
▲ chillily
2楼-- · 2020-05-06 13:46

There is no difference

orc(path) is simply a shortcut method for format("orc").save(path)

Same applies for .json(path) and csv(path) with the default write(path) with no format being Parquet

查看更多
登录 后发表回答