CDP:Scheduled Scale Out pattern

From AWS-CloudDesignPattern
Revision as of 00:42, 15 November 2012 by Cdpadmin (Talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Increasing or Decreasing the Number of Servers Following a Schedule


Problems to Be Solved

The Scale-Out Pattern is effective when handling high levels of traffic in a web service that is structured in a cloud environment. However, when monitoring the load status to manually add virtual servers, or automatically adding instances depending on the load statuses of the virtual servers, the launching of instances may not be able to keep up when there is a sudden increase in accesses (cases wherein traffic doubles in less than five minutes).

Explanation of the Cloud Solution/Pattern

When the timing with which there will be an instantaneous increase in accesses is understood, then scale-out through scheduling is effective. While the fundamental structure is similar to that of the Scale-Out Pattern, the key distinctive feature is that of performing the scale-out through specifying the timing with which to do so. Completing the scale-out in advance makes it possible to handle a rapid increase in traffic with a robust system, and performing the scaling immediately prior to the rapid increase in traffic can minimize wasted costs.


Auto Scaling in AWS has a function for specifying the time at which settings are to be changed. This function can be used to configure scheduled scale-out. Scale-in is also possible through specifying a time band wherein the traffic is anticipated to settle down.

  • Reference the Scale-Out Pattern to set up Auto Scaling (including scale-out triggers and scale-in triggers).
  • Specify the timing with which to increase the number of EC2 instances and change, to the number of instances to be provided, the setting for "minimum number of instances (– – – min-size)." * At the specified time, new EC2 instances will be launched up to the minimum number of instances specified.
  • If the minimum number of instances again is reduced with the timing with which the load is to settle down, then scale-in will follow the trigger that has been set.




  • This makes it possible to increase the number of EC2 instances following the schedule with which the traffic volume is anticipated to increase.
  • This reduces costs because the number of EC2 instances is reduced when there is little traffic.
  • When compared to scale-up, the limit on processing capability is extremely high because the required number of EC2 instances can be provided in parallel, under the control of the ELB.


  • The specified time is in terms of UTC.
  • When there is to be a sudden increase in traffic, the ELB must be scaled-out in addition to the EC2 instances. In this case, pre-warming is requested.


See the Scale-out Pattern.

Personal tools