Why HTTP protocol is designed in plain text way?

2019-01-18 02:11发布

Yesterday, I have a discussion with my colleagues about HTTP. It is asked why HTTP is designed in plain text way. Surely, it can be designed in binary way just like TCP protocol, using flags to represents different kinds of method(POST, GET) and variables (HTTP headers). So, why HTTP is designed in such way? Is there any technical or historical reasons?

10条回答
Ridiculous、
2楼-- · 2019-01-18 02:27

With HTTP, the content of a request is almost always orders of magnitude larger than the protocol overhead. Converting the protocol into a binary one would save very little bandwidth, and the easy debugability that a text protocol offers easily trumps the minor bandwidth savings of a binary protocol.

查看更多
Rolldiameter
3楼-- · 2019-01-18 02:28

A reason that's both technical and historical is that text protocols are almost always preferred in the Unix world.

Well, this is not really a reason but a pattern. The rationale behind this is that text protocols allows you to see what's going on on the network by just dumping everything that goes through. You don't need a specialized analyzer as you need for TCP/IP. This makes it easier to debug and easier to maintain.

Not only HTTP, but many protocols are text based (e.g., FTP, POP3, SMTP, IMAP).

You might want to take a look at The Art of Unix Programming for a much more detailed explanation of this Unix thing.

查看更多
何必那么认真
4楼-- · 2019-01-18 02:34

Historically, it all starts from RFC822 (STANDARD FOR THE FORMAT OF ARPA INTERNET TEXT MESSAGES), whose latest version is RFC5322 (Internet Message Format). SMTP (RFC 821) was one of the most popular protocol based on RFC822. And, HTTP was born out of SMTP (your mail protocol).

查看更多
做自己的国王
5楼-- · 2019-01-18 02:37

As with RFC 2616 section 3.7.1 for HTTP 1.1, the key identifier to a line of command or header is the text line-break CRLF; text-based application protocols makes it easier to carry out a conversation (for troubleshooting) purely with a Telnet client. It also makes it easier to program with ReadLine() calls and matching text strings.

The CRLF parameter break also gives near-unlimited abitrary header extensions unlike a fixed-size TCP or IP headers where one hard-codes by bit offsets.

查看更多
戒情不戒烟
6楼-- · 2019-01-18 02:38

So it's easier to "read" the traffic or create a client or server?

You can debate whether it actually makes it easier, but surely that was the intent.

查看更多
该账号已被封号
7楼-- · 2019-01-18 02:40

HTTP stands for "Hypertext Transfer Protocol".

It was initially devised as a way to serve text documents, hence the text based protocol.

What we do with HTTP now is far beyond its original intent.

查看更多
登录 后发表回答