A framework for controlling multi-environment multi-component terraform-managed AWS infrastructure
Terraform scaffold consists of a terraform wrapper bash script, a bootstrap script and a set of directories providing the locations to store terraform code and variables.
Thing | Things about the Thing |
---|---|
bin/terraform.sh | The terraformscaffold script |
bootstrap/ | The bootstrap terraform code used for creating the terraformscaffold S3 bucket |
components/ | The location for terraform "components". Terraform code intended to be run directly as a root module. |
etc/ | The location for environment-specific terraform variables files:env_{region}_{environment}.tfvars versions_{region}_{environment}.tfvars |
lib/ | Optional useful libraries, such as Jenkins pipeline groovy script |
modules/ | The optional location for terraform modules called by components |
plugin-cache/ | The default directory used for caching plugin downloads |
src/ | The optional location for source files, e.g. source for lambda functions zipped up into artefacts inside components |
The Scaffold is built around the concept that a logical "environment" may consist of any number of independent components across independent AWS accounts. What provides consistency across these components, and therefore defines the environment, are the variables shared between the components. For example, the CIDR block defining a primary VPC for a production environment is needed by the component that creates the VPC, it may also be needed by components in other VPCs or accounts. All components in a production enviroment are likely to share the "envirornment" variable value "production".
Scaffold achieves this by maintaining variables specific to an environment in the file etc/env{region}{environment}.tfvars, and then providing those variables as inputs to all components run for that environment. Any variables not required by a component are safely ignored, but all components have visibility of all variables for an environment.
Important Note: Variables in the Env and Versions variables files are not merged. You cannot, for example, define the same map variable in both files and have the keys in each definition merged into a resulting set. When a variable is defined more than once, the last one terraform evaluates takes precedence and overrides any previously encountered definition. To prevent unexpected consequences, never define variable values in more than one place.
Scaffold uses AWS S3 for storage of tfstate files. The bucket and location are specifically defined to be predictable to organise their storage and permit the use of terraform_remote_state resources. Any scaffold component can reliably refer to the S3 location of the state file for another component and depend upon the outputs provided. The naming convention is: s3://${project}-terraformscaffold-${aws_account_id}-${aws_region}/${project}/${aws_account_id}/${aws_region}/${environment}/${component}.tfstate. Each functional scaffold S3 bucket will only therefore contain content within the keyspace /${project}/${aws_account_id}/${aws_region} as these are unique to the bucket as well as all contents. The reason for the use of this keyspace is to permit the aggregation of state files from multiple bucket into a master bucket for backup or read-only review purposes. All scaffold buckets relevant to a person or organisation could be safely synchronised to a single bucket without fear of keyspace overlap.
Scaffold provides a logical separation of several types of environment variable:
- Global variables
- Region-scoped global variables
- Group variables
- Static environment variables
- Frequently-changing versions variables
- Dynamic (S3 stored) variables
- Secret (S3 stored, KMS encrypted variables - available but not recommended)
This seperation is purely logical, not functional. It makes no functional difference in which file a variable lives, or even whether a versions variables file exists; but it provides the capacity to seperate out mostly static variables that define the construction of the environment from variables that could change on each apply providing new AMI IDs, or dockerised application versions or database snapshot IDs when recreating development and testing databases.
Terraform Scaffold does not provide any mechanism for running terraform across multiple AWS accounts simultaneously, storing state files in a different account to the account being modified or any other functionality that would require Scaffold to intelligently manage AWS credentials. After extensive research and development it has become apparant that, despite some features available in terraform to handle more than one AWS account and the use of IAM roles, the features are not sufficiently mature or flexible to allow their application in a generic form.
Therefore, to ensure widest possible reach and capability of Scaffold, it requires that a specific set of AWS credentials be provided to it at invocation. These credentials must have the necessary access to read and write to the bootstrapped S3 state file bucket, and to create/modify/destroy the AWS resources being controlled via terraform.
Credentials can be provided in any of the standard mechanisms provided by the Boto search path, for example, EC2 Instance Profiles, an ~/.aws/credentials file or AWS_ACCESS_KEY_ID and AWS_SECRET_ACCESS_KEY environment variables.
If you want to make use of instance profiles, MFA tokens, AWS STS, Cross Account Roles or other fantastic IAM trickery, the recommended practice is to use a static access key or instance profile to call AWS STS using the AWS CLI tools, and then assign the temporary credentials that are generated to the AWS credential environment variables so that terraform can make use of them. This is done externally to Scaffold and would normally be integrated into a Jenkins job as a step to perform to prepare the environment before calling Scaffold.
Although as yet somewhat unrefined, Scaffold provides the capacity to incorporate additional scripted actions to take prior to and after running terraform on a given component. If there is a file called "pre_apply.sh" present in the top level of the component you are working with, then it will be executed as a bash script prior to any terraform action. If a file called post_apply.sh is present it will be executed immediately following any terraform action. This capability clearly could do with some improvement to support complex deployments with script dependencies, but as yet I have none to play with.
This is an experimental feature that is not necessarily an appropriate solution for secrets management in production systems due to the way state files are stored with all terraform variables included. In general a resource that requires secret information should look up that information itself once it has been created using role based credentials. If however you are simply looking for a solution to move secrets out of your Git repository and into S3 which can be locked down, then this might be useful.
On invocation, Scaffold checks for a file at s3://${bucket}/${project}/secrets/secret${region}${environment}.tfvars.enc. If it finds one, it attempts to use KMS to decrypt the contents into an array and then process each line as a key=value set of variables. Each one is then provided to terraform as an input variable in the form: -var 'key=value'. This ensures that the secret is never stored unencrypted on disk even temporarily during terraform apply. However.. the completely unencrypted format of the terraform tfplan and tfstate files means these secrets will still make it to disk and be stored in the clear, hence the above caveats. This is not a complete secrets management solution, but it is a useful quick fix to get your secrets out of Git while you work on a better long term plan.
Since 0.10 terraform has split its providers out into plugins which are downloaded separately. This has caused some issues in automation where you can be downloading the same provider endlessly. A long term solution to this has not yet been decided upon as there are many ways to implement a solution, but none really suit all likely scenarios. Ideally I would like to see management of this handled by kamatama41/tfenv. For the moment, terraformscaffold will instruct terraform to cache plugins in the plugin-cache/ top level directory by default. This can be overridden by exporting the TF_PLUGIN_CACHE_DIR variable with an appropriate value. This at least means that within one code checkout, regardless of swapping around components to plan or apply, you will only need to download providers once. If you have a local artifact repository or some other preference for keeping copies of providers locally you can use it with this variable.
Before using Scaffold, a bootstrapping stage is required. Scaffold is responsible for creating and maintaining the S3 buckets it uses to store component state files and even keeps the state file that defines the scaffold bucket in the same bucket. This is done with a special bootstrap mode within the script, invoked with the '--bootstrap' parameter. When used with the "apply" action, this will cause the script to create a bootstrap bucket and then configure the bucket as a remote state location for itself. nd upload the tfstate used for managing the bucket to the bucket. Once created, the bucket can then be used for any terraform apply for the specific combination of project, region and AWS account.
It is not recommended to modify the bootstrap code after creation as it risks the integrity of the state files stored in the bucket that manage other deployments; however this can be mitigated by configuring synchronisation with a master backup bucket external to Scaffold management.
Bootstrapping usage:
bin/terraform.sh \
-p/--project `project` \
-b/--bucket-prefix `bucket_prefix` \
-r/--region `region` \
--bootstrap \
-a/--action plan
bin/terraform.sh \
-p/--project `project` \
-b/--bucket-prefix `bucket_prefix` \
-r/--region `region` \
--bootstrap \
-a/--action apply
Where:
project
: the name of the project to have a terraform bootstrap appliedbucket_prefix
(optional - use only with caution): Defaults to:${project}-terraformscaffold"
region
(optional): Defaults to value of the AWS_DEFAULT_REGION environment variable
The terraformscaffold script is invoked as bin/terraform.sh. Once a state bucket has been bootstrapped, bin/terraform.sh can be run to apply terraform code. Its usage as of 25/01/2017 is:
bin/terraform.sh \
-a/--action `action` \
-b/--bucket-prefix `bucket_prefix` \
-c/--component `component_name` \
-e/--environment `environment` \
-g/--group `group` (optional) \
-i/--build-id `build_id` (optional) \
-p/--project `project` \
-r/--region `region` \
-d/--detailed-exitcode (optional) \
-n/--no-color (optional) \
-w/--compact-warnings (optional) \
-- \
<additional arguments to forward to the terraform binary call>
Where:
action
: Terraform action (or pseudo-action) to take, e.g. plan, apply, plan-destroy (runs plan with the -destroy flag), destroy, showbucket_prefix
(optional): Defaults to:${project_name}-terraformscaffold
- Only for use where a different bucket prefix has been bootstrappedbuild_id
(optional): Used in conjunction with the plan and apply actions,build_id
causes the creation and consumption of terraform plan files (.tfplan)- When
build_id
is omitted:- "Plan" provides normal plan output without generating a plan file
- "Apply" directly applies the component based on the code and state it is given.
- When
build_id
is provided:- "Plan" creates a plan file with
build_id
as part of the file name, and uploads the plan to the S3 state bucket under a key called "plans/" alongside the corresponding state file - "Apply" looks for and downloads the corresponding plan file generated by a plan job, and applies the changes in the plan file
- "Plan" creates a plan file with
- It is usual to provide, for example, the Jenkins $BUILD_ID parameter to Plan jobs, and then manually reference that particular Job ID when running a corresponding apply job.
- When
component_name
: The name of the terraform component in the components directory to run theaction
against.environment
: The name of the environment the component is to be actioned against, therefore implying the variables file(s) to be includedgroup
(optional): The name of the group to which the environment belongs, permitting the use of a group tfvars file as a "meta-environment" shared by more than one environmentproject
: The name of the project being deployed, as per the default bucket-prefix and state file keyspaceregion
(optional): The AWS region name unique to all components and terraform processes. Defaults to the value of the AWS_DEFAULT_REGION environment variable.detailed-exitcode
(optional): Passes detailed exit code flag to terraform.no-color
(optional): Passes no-color flag to terraform.compact-warnings
(optional): Passes compact-warnings flag to terraform.additional arguments
: Any arguments provided after "--" will be passed directly to terraform as its own arguments, e.g. allowing the provision of a 'target=value' parameter.