What command can I use to identify and remove certain strange characters that form "words" such as:
í‰äó_
퀌¢í‰ä‰åí‰ä‹¢
it퀌¢í‰ä‰åí‰ä‹¢
í‰äóìgo
from a series of files? Those are some examples... I want to remove such occurrences.
What command can I use to identify and remove certain strange characters that form "words" such as:
í‰äó_
퀌¢í‰ä‰åí‰ä‹¢
it퀌¢í‰ä‰åí‰ä‹¢
í‰äóìgo
from a series of files? Those are some examples... I want to remove such occurrences.
Since you tagged
shell
andcommand-line
, here you goHow about a regex sub?
something like:
Add to the regex any other allowed char.
Using the
string
module after you've gotten the data from the file:Alternative one-liner: