I'm a new Go programmer, coming from the world of web application and service development. Apologies is this is a herp de-derp question, but my googling around for an answer hasn't found anything. Also, this is borderline Server Fault territory, but since I'm more interested in the APIs/programmatic interfaces I'm asking here.
I've written a small go program using the net/http
package's built-in web server. I'm getting ready to deploy to production, but I'm a little unclear on the process of model go Go's web-server and how I should deploy.
Specifically -- in the environments I'm used to (PHP, Ruby, Python) we have a web server (Apache, Nginx, etc.) sitting in front of our application, and we configure these web servers to use a certain number of worker processes/threads, and configure how many individual HTTP(S) connections each thread should process.
I haven't been able to find information on how Go's web-server handles this, or practical information on how to scale/plan-for-scale for a Go web server.
i.e. -- if I have a simple program running, ready to handle an HTTP request
func main() {
http.HandleFunc("/", processRequest)
http.ListenAndServe(":8000", nil)
}
how many connections will HandleFunc
try to handle at once? Or will it start blocking when a connection opens, and only serve the next connection once the connection closes?
Or should I just not worry about this and jam everything into a go routine? But if I do that how do I prevent the system from becoming bogged down by too many threads of execution?
I'm basically trying to
- Understand the process mode of the go web server
- Find the built-in features of go for tweaking this, and/or whatever standard package folks are using
Like I said, I'm very new to go, so if I'm completely missing the plot on this please let me know!
Tweaking / configuring the HTTP server
The type that implements the HTTP server is
http.Server
. If you don't create anhttp.Server
yourself e.g. because you call thehttp.ListenAndServe()
function, that creates anhttp.Server
under the hood for you:So if you want to tweek / customize the HTTP server, then create one yourself and call its
Server.ListenAndServe()
method yourself.http.Server
is a struct, its zero value is a valid configuration. See its doc what fields it has and so what you can tweak / configure.The "Process Management" of the HTTP server is documented at
Server.Serve()
:So each incoming HTTP request is handled in its new goroutine, meaning they are served concurrently. Unfortunately the API does not document any way to jump in and change how this works.
And looking at the current implementation (Go 1.6.2), there is also no undocumented way to do that.
server.go
, currently line #2107-2139:As you can see in line #2137, the connection is served unconditionally on a new goroutine, so there's nothing you can do about that.
Limiting the "worker" goroutines
If you want to limit the number of goroutines serving requests, you can still do it.
You may limit them on multiple levels. For limiting on the listener level, see Darigaaz's answer. To limit on the Handler level, read on.
For example you could insert a code to each of your
http.Handler
or handler functions (http.HandlerFunc
) which only proceeds if the number of concurrent request serving goroutines are less than a specified limit.There are numerous constructs to such limiting-synchronization code. One example could be: create a buffered channel with capacity being your desired limit. Each handler should first send a value on this channel, and then do the work. When the handler returns, it must receive a value from the channel: so it's best done in a deferred function (not to forget to "clean" itself).
If the buffer is full, a new request attempting to send on the channel will block: wait until a request finishes its work.
Note that you don't have to inject this limiting code to all your handlers, you may use a "middleware" pattern, a new handler type which wraps your handlers, does this limiting-synchronization job, and calls the wrapped handler in the middle of it.
The advantage of limiting in the handler (as opposed to limiting in Listeners) is that in the handler we know what the handler does, so we can do selective limiting (e.g. we may choose to limit certain requests such as database operations, and not to limit others like serving static resources) or we can create multiple, distinct limit groups arbitrarily to our needs (e.g. limit concurrent db requests to 10 max, limit static requests to 100 max, limit heavy computational requests to 3 max) etc. We can also easily realize limitations like unlimited (or high limit) for logged-in / paying users, and low limit for anonymous / non-paying users.
Also note that you can even do the rate-limiting in a single place, without using middlewares. Create a "main handler", and pass that to
http.ListenAndServe()
(orServer.ListenAndServe()
). In this main handler do the rate limiting (e.g. using a buffered channel as mentioned above), and simply forward the call to thehttp.ServeMux
you're using.Here's a simple example which uses
http.ListenAndServe()
and the default multiplexer of thehttp
package (http.DefaultServeMux
) for demonstration. It limits concurrent requests to 2:Deployment
Web applications written in Go do not require external servers to control processes, as the Go webserver itself handles requests concurrently.
So you may start your webserver written in Go as-is: the Go webserver is production ready.
You may of course use other servers for additional tasks if you wish so (e.g. HTTPS handling, authentication / authorization, routing, load balancing between multiple servers).
ListenAndServe
starts an HTTP server with a given address and handler. The handler is usually nil, which means to useDefaultServeMux
. Handle andHandleFunc
add handlers toDefaultServeMux
.Look at http.Server, alot of fields are optional and works fine with default values.
Now lets look at http.ListenAndServe, is not hard at all
so the default server is super simple to create.
It Listen on "addr" and Accept every connection, then it spawns a goroutine to handle each connection independently. (HTTP/2.0 is a little bit different, but is the same in general).
If you want to control connections you have 2 options:
Create custom server (its 3 lines of code) with server.ConnState callback and control client connections from there. (but they will be Accepted by kernel anyway)
Create custom server with your own implementation of
net.Listener
(likeLimitedListener
) and control connections from there, this way you will have ultimate power over connections.Since default
http.Server
has no way to be stopped the second way is the only way to gracefully terminate listener. You can combine two methods to implement different strategies, well its been done already.