String UTF-8 encoding with cyrillic in H2O

2019-07-14 05:42发布

I load csv file of utf-8 encoding with cyrillic strings. After parsing in Flow interface - i see not cyrillic, but not readable symbols like "пїўпѕЂпѕ™пїђпѕ" How can i use utf-8 cyrillic strings in H2O?

2条回答
Summer. ? 凉城
2楼-- · 2019-07-14 06:11

check your csv file in text and binary presentation to find how Cyrillic text is encoded, if it is UTF-8 it should look like this:

Привет

for the word

Привет

查看更多
干净又极端
3楼-- · 2019-07-14 06:12

This appears to be a bug in the Flow interface, but only in the setupParse command. If you continue through and do the import, the data gets imported correctly.

I've reported the bug, with test data and screenshots (taken in Firefox) here:

https://0xdata.atlassian.net/browse/PUBDEV-4640

So if you have additional information, or the bug is behaving differently for you, it'd be good to add it to that bug report.

查看更多
登录 后发表回答