How to validate one csv data compare with another

2019-08-17 01:25发布

I have two csv file . In one file i have 10 rows and in another list of data . What i want to do is , check the data of one filed of first csv and compare it with another csv file . So how can i achieve this ? Any help would be great .

标签： pentaho kettle pentaho-spoon pentaho-data-integration

1条回答

\"骚年 ilove

2楼-- · 2019-08-17 02:23

The step you are looking for is named the a Stream Lookup step.`

Read you CSV and the reference files, and drop the two flows in a Stream Lookup and set it up as follow: a) Lookup step = the step that reads the reference b) Keys / field = the name of field of the CSV that contains any field able to identify the row in the reference file. c) Keys / Lookup field = the name of the field in the reference file. d) Field to retrieve = the name of the field in the reference to return (may be the identifier or any other field you need) e) Field to retrieve / Type = Do not forget !

Like that, you will add a column from the reference file to the 10 rows of the CSV file. You may then filter out the rows which the Lookup did not found by testing if the value of the new column is not null.

As in the PDI all the above setup are guided with drop down lists, it should take you 2 minutes.

0人赞添加讨论(0) 举报

How to validate one csv data compare with another

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间