In Google app engine only one instance handling mo

2019-03-03 01:29发布

We have 4 instances in google app engine and only one instance is handling most of the requests. How can we scale such that all the instances can handle equal number of requests?

标签： google-app-engine

2条回答

虎瘦雄心在

2楼-- · 2019-03-03 02:13

I would also ask the question is if you're using resident instances vs dynamic instances.

For example if you have configured scaling overrides within your application yaml file you may see some instances just "sitting there". Resident instances can handle peak / overflow traffic and are always on but may not ultimately always be serving traffic.

EG:

 automatic_scaling:
      min_idle_instances: 6

0人赞添加讨论(0) 举报

啃猪蹄的小仙女

3楼-- · 2019-03-03 02:31

Evenly balancing the load across the running instances doesn't actually mean scaling. As long as one instance is capable of handling the incoming requests with acceptable performance you're not looking at a scaling issue.

If you're using automatic or basic scaling (which you should, if you're concerned with scalability) the uneven load spread across the running instances can actually be essential for controlling the automatic instance on-demand spinup (when load exceeds a certain threshold) and shutdown (when instances are idling).

For example if a load that could be easily handled by 1-2 instances would be evenly distributed across 4 running instances then none of the 4 instances would be idle long enough to be shut down.

Having a single instance as the "preferred" one to run traffic on and the others just picking op "overflowing"/peak load makes the algorithm for controlling instance spinup/shutdown a lot simpler (and I think more precise as well) - the threshold comparison logic only needs to be applied on one (or just a few) running instances, not on all of them.

0人赞添加讨论(0) 举报

In Google app engine only one instance handling mo

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间