Extract data from large files excel

2019-09-09 07:36发布

I'm using Pentaho Data Integration to create a transformation from xlsx files to mysql, but I can't import data from large files with Excel 2007 xlsx(apache POI Straiming). It gives me out of memory errors.

标签： kettle pdi

2条回答

smile是对你的礼貌

2楼-- · 2019-09-09 08:17

Did you try this option ?

Advanced settings -> Generation mode -> Less memory consumed for large excel(Event mode

(You need to check "Read excel2007 file format" first)

0人赞添加讨论(0) 举报

爱情/是我丢掉的垃圾

3楼-- · 2019-09-09 08:21

I would recommend you to increase jvm memory allocation before running the transformation. By default, pentaho data integration aka kettle comes with low memory allocation which would cause issues with running ETLs involving large files. You would need to modify the -Xmx value so that it specifies a larger upper memory limit in spoon.bat accordingly.

If you are using spoon in windows and edit spoon.bat in the line show below.

if "%PENTAHO_DI_JAVA_OPTIONS%"=="" set PENTAHO_DI_JAVA_OPTIONS="-Xmx512m" "-XX:MaxPermSize=256m"

If you are using kitchen or pan, edit in those pan.bat or kitchen.bat accordingly. If you are using in linux, change in .sh files.

0人赞添加讨论(0) 举报

Extract data from large files excel

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间