Determine what line ending is used in a text file

2019-03-25 18:57发布

Whats the best way in C# to determine the line endings used in a text file (Unix, Windows, Mac)?

7条回答
戒情不戒烟
2楼-- · 2019-03-25 19:01

Reading most of textual formats I usually look for \n, and then Trim() the whole string (whitespaces at beginning and end are often redundant).

查看更多
何必那么认真
3楼-- · 2019-03-25 19:02

Notice that text files may have inconsistent line endings. Your program should not choke on that. Using ReadLine on a StreamReader (and similar methods) will take care of any possible line ending automatically.

If you manually read lines from a file, make sure to accept any line endings, even if inconsistent. In practice, this is quite easy using the following algorithm:

  • Scan ahead until you find either CR or LF.
  • If you read CR, peek ahead at the next character;
  • If the next character is LF, consume it (otherwise, put it back).
查看更多
Bombasti
4楼-- · 2019-03-25 19:08

I would imagine you couldn't know for sure, would have to set this in the editor. You could use some AI, the algorithm would be:

  1. Search for each type of line ending, you'd search those specific characters
  2. Measure the distances between the them.
  3. If one type tends to repeat then you assume that's the type. Count the repeats and use some measure of dispersion.

So, for example, if you had repeats of CRLF at 38, 40, 45, and that was within tolerance you'd default to assuming the line end was CRLF.

查看更多
Fickle 薄情
5楼-- · 2019-03-25 19:11

Here is some advanced guesswork: read the file, count CRs and LFs

if (CR > LF*2) then "Mac" 
else if (LF > CR*2) then "Unix"
else "Windows"

Also note, that newer Macs (Mac OS X) use Unix line endings

查看更多
Deceive 欺骗
6楼-- · 2019-03-25 19:21

I'd just search the file for the first \r or \n and if it was a \n I'd look at the previous character to see if it's a \r, if so, it's \r\n otherwise it's whichever found.

查看更多
聊天终结者
7楼-- · 2019-03-25 19:27

If it were me, I'd just read the file one char at a time until I came across the first \r or a \n. This is assuming you have sensical input.

查看更多
登录 后发表回答