Is there a lightweight multipart/form-data parser

2020-05-14 08:25发布

I'm looking at integrating multipart form-data parsing in a web server module so that I can relieve backend web applications (often written in dynamic languages) from parsing the multipart data themselves. The multipart grammar (RFC 2046) looks non-trivial and if I implement it by hand a lot of things can go wrong. Is there already a good, lightweight multipart/form-data parser written in C or C++? I'm looking for one with no external dependencies other than the C or C++ standard library. I don't need email attachment handling or buffered I/O classes or a portability runtime or whatever, just multipart/form-data parsing.

Things that I've considered:

  • GMime - depends on glib, so no go.
  • libapreq - too large, depends on APR, badly documented, no unit tests.

I've also looked at writing a parser with Ragel, but I can't figure out how to do it because the grammar is not static: the boundary can change arbitrarily.

5条回答
一夜七次
2楼-- · 2020-05-14 08:26

cgicc supports it... But is written quite badly and relays on entry buffer in memory.

查看更多
对你真心纯属浪费
3楼-- · 2020-05-14 08:36

mimetic claims to support it. I think GNU cgicc may also support it.

查看更多
霸刀☆藐视天下
4楼-- · 2020-05-14 08:44

Yes, there one. No secret it is my own. Feel free to use it. The link is: MPFDParser. It has no dependencies at all.

查看更多
迷人小祖宗
5楼-- · 2020-05-14 08:47

I know this question is a couple of years old now, but I needed the same and ended up using this:

https://github.com/iafonov/multipart-parser-c

查看更多
做个烂人
6楼-- · 2020-05-14 08:48

this may not answer your question directly, but did you consider hiphop for php from facebook?

it converts your php code to c++ then compiles with g++.

might save you time in writing something on your own.

查看更多
登录 后发表回答