I am looking for a way to ensure that the end-of-line style of a file is maintained in python program while reading, editing and writing.
Python has universal file ending support, which can convert all line endings to \n
when the file is read, and then convert them all to the system default when the file is written. In my case I would like to still do the initial conversion, but then write the file with the original EOL style rather than the system default.
Is there a standard way to do this kind of thing? If not, is there a standard way to detect the EOL style of a file?
Assuming that there is no standard way to do this, a possible work flow would be:
- Read in a file in binary mode.
- Decode into utf-8 (or whatever encoding is required).
- Detect EOL style.
Convert all line endings to
\n
.Do stuff with the file.
Convert all line endings to original style.
- Encode file.
- Write file in binary mode.
In this work flow, what is the best way to do step 2?
To preserve original line endings, use
newline=''
to read or write line endings untranslated.Note that if the text manipulation itself deals with line endings, additional or alternative logic may be needed to detect and match original line endings.
The
'U'
mode also works, but is deprecated.Python Documentation: open
Use python's universal newline support:
newlines
contains the file's delimiter or a tuple of delimiters if the file uses a mix of delimiters.