How do I remove unicode characters from a bunch of text files on the terminal? I've tried this but it didn't work:
sed 'g/\u'U+200E'//' -i *.txt
I need to remove these unicodes from the textfiles
U+0091 - sort of weird "control" space
U+0092 - same sort of weird "control" space
A0 - non-space break
U+200E - left to right mark
If you want to remove ONLY particular characters and you have python, you can:
Use iconv:
This will translate characters like "Š" into "S" (most similar looking ones).
Convert Swift files from utf-8 to ascii:
swift auto completion not working in Xcode6-Beta
clear all non-ascii chars of
file.txt
For utf-8 encoding of unicode, you can use this regular expression for sed: