SeBS: Serverless Benchmark Suite

FaaS benchmarking suite for serverless functions with automatic build, deployment, and measurements.

SeBS is a diverse suite of FaaS benchmarks that allows an automatic performance analysis of commercial and open-source serverless platforms. We provide a suite of benchmark applications and experiments, and use them to test and evaluate different components of FaaS systems. See the installation instructions to learn how to configure SeBS to use selected cloud services and usage instructions to automatically launch experiments in the cloud!

SeBS provides support for automatic deployment and invocation of benchmarks on AWS Lambda, Azure Functions, Google Cloud Functions, and a custom, Docker-based local evaluation platform. See the documentation on cloud providers to learn how to provide SeBS with cloud credentials. The documentation describes in detail the design and implementation of our tool, and see the modularity section to learn how SeBS can be extended with new platforms, benchmarks, and experiments.

SeBS can be used with our Docker image spcleth/serverless-benchmarks:latest, or the tool can be installed locally.

Paper

When using SeBS, please cite our Middleware '21 paper (link coming soon!). An extended version of our paper is available on arXiv, and you can find more details about research work in this paper summary. You can cite our software repository as well, using the citation button on the right.

@inproceedings{copik2021sebs,
  author={Marcin Copik and Grzegorz Kwasniewski and Maciej Besta and Michal Podstawski and Torsten Hoefler},
  title={SeBS: A Serverless Benchmark Suite for Function-as-a-Service Computing}, 
  year = {2021},
  publisher = {Association for Computing Machinery},
  url = {https://doi.org/10.1145/3464298.3476133},
  doi = {10.1145/3464298.3476133},
  booktitle = {Proceedings of the 22nd International Middleware Conference},
  series = {Middleware '21}
}

Benchmark Applications

For details on benchmark selection and their characterization, please refer to our paper.

Type	Benchmark	Languages	Description
Webapps	110.dynamic-html	Python, Node.js	Generate dynamic HTML from a template.
Webapps	120.uploader	Python, Node.js	Uploader file from provided URL to cloud storage.
Multimedia	210.thumbnailer	Python, Node.js	Generate a thumbnail of an image.
Multimedia	220.video-processing	Python	Add a watermark and generate gif of a video file.
Utilities	311.compression	Python	Create a .zip file for a group of files in storage and return to user to download.
Utilities	504.dna-visualization	Python	Creates a visualization data for DNA sequence.
Inference	411.image-recognition	Python	Image recognition with ResNet and pytorch.
Scientific	501.graph-pagerank	Python	PageRank implementation with igraph.
Scientific	501.graph-mst	Python	Minimum spanning tree (MST) implementation with igraph.
Scientific	501.graph-bfs	Python	Breadth-first search (BFS) implementation with igraph.

Installation

Requirements:

Docker (at least 19)
Python 3.6+ with:
- pip
- venv
libcurl and its headers must be available on your system to install pycurl
Standard Linux tools and zip installed

... and that should be all.

To install the benchmarks with a support for all platforms, use:

./install.py --aws --azure --gcp --local

It will create a virtual environment in python-virtualenv, install necessary Python dependecies and third-party dependencies. Then activate the new Python virtual environment, e.g., with source python-virtualenv/bin/activate. Now you can deploy serverless experiments :-)

Make sure that your Docker daemon is running and your user has sufficient permissions to use it. Otherwise you might see a lot of "Connection refused" and "Permission denied" errors when using SeBS.

To verify the correctness of installation, you can use our regression testing.

Usage

SeBS has three basic commands: benchmark, experiment, and local. For each command you can pass --verbose flag to increase the verbosity of the output. By default, all scripts will create a cache in directory cache to store code with dependencies and information on allocated cloud resources. Benchmarks will be rebuilt after a change in source code is detected. To enforce redeployment of code and benchmark input please use flags --update-code and --update-storage, respectively. Note: the cache does not support updating cloud region. If you want to deploy benchmarks to a new cloud region, then use a new cache directory.

Benchmark

This command is used to build, deploy, and execute serverless benchmark in cloud. The example below invokes the benchmark 110.dynamic-html on AWS via the standard HTTP trigger.

./sebs.py benchmark invoke 110.dynamic-html test --config config/example.json --deployment aws --verbose

To configure your benchmark, change settings in the config file or use command-line options. The full list is available by running ./sebs.py benchmark invoke --help.

Regression

Additionally, we provide a regression option to execute all benchmarks on a given platform. The example below demonstrates how to run the regression suite with test input size on AWS.

./sebs.py benchmark regression test --config config/example.json --deployment aws

The regression can be executed on a single benchmark as well:

./sebs.py benchmark regression test --config config/example.json --deployment aws --benchmark-name 120.uploader

Experiment

This command is used to execute benchmarks described in the paper. The example below runs the experiment perf-cost:

./sebs.py experiment invoke perf-cost --config config/example.json --deployment aws

The configuration specifies that benchmark 110.dynamic-html is executed 50 times, with 50 concurrent invocations, and both cold and warm invocations are recorded.

"perf-cost": {
    "benchmark": "110.dynamic-html",
    "experiments": ["cold", "warm"],
    "input-size": "test",
    "repetitions": 50,
    "concurrent-invocations": 50,
    "memory-sizes": [128, 256]
}

To download cloud metrics and process the invocations into a .csv file with data, run the process construct

./sebs.py experiment process perf-cost --config example.json --deployment aws

Local

In addition to the cloud deployment, we provide an opportunity to launch benchmarks locally with the help of minio storage.

To launch Docker containers serving a selected benchmark, use the following command:

./sebs.py local start 110.dynamic-html {input_size} out.json --config config/example.json --deployments 1

The output file out.json will contain the information on containers deployed and the endpoints that can be used to invoke functions:

{
  "functions": [
    {
      "benchmark": "110.dynamic-html",
      "hash": "5ff0657337d17b0cf6156f712f697610",
      "instance_id": "e4797ae01c52ac54bfc22aece1e413130806165eea58c544b2a15c740ec7d75f",
      "name": "110.dynamic-html-python-128",
      "port": 9000,
      "triggers": [],
      "url": "172.17.0.3:9000"
    }
  ],
  "inputs": [
    {
      "random_len": 10,
      "username": "testname"
    }
  ]
}

In our example, we can use curl to invoke the function with provided input:

curl 172.17.0.3:9000 --request POST --data '{"random_len": 10,"username": "testname"}' --header 'Content-Type: application/json'

To stop containers, you can use the following command:

./sebs.py local stop out.json

The stopped containers won't be automatically removed unless the option --remove-containers has been passed to the start command.

Experiments

For details on experiments and methodology, please refer to our paper.

Performance & cost

Invokes given benchmark a selected number of times, measuring the time and cost of invocations. Supports cold and warm invocations with a selected number of concurrent invocations. In addition, to accurately measure the overheads of Azure Function Apps, we offer burst and sequential invocation type that doesn't distinguish between cold and warm startups.

Network ping-pong

Measures the distribution of network latency between benchmark driver and function instance.

Invocation overhead

The experiment performs the clock drift synchronization protocol to accurately measure the startup time of a function by comparing benchmark driver and function timestamps.

Eviction model

Executes test functions multiple times, with varying size, memory and runtime configurations, to test for how long function instances stay alive. The result helps to estimate the analytical models describing cold startups. Currently supported only on AWS.

Authors

Marcin Copik (ETH Zurich) - main author.
Michał Podstawski (Future Processing SA) - contributed graph and DNA benchmarks, and worked on Google Cloud support.
Nico Graf (ETH Zurich) - contributed implementation of regression tests, bugfixes, and helped with testing and documentation.
Kacper Janda, Mateusz Knapik, JmmCz, AGH University of Science and Technology - contributed together Google Cloud support.
Grzegorz Kwaśniewski (ETH Zurich) - worked on the modeling experiments.

Name		Name	Last commit message	Last commit date
Latest commit History 912 Commits
.circleci		.circleci
benchmarks-data @ 6a17a46		benchmarks-data @ 6a17a46
benchmarks		benchmarks
config		config
docker		docker
docs		docs
scripts		scripts
sebs		sebs
tests		tests
third-party		third-party
tools		tools
.black.toml		.black.toml
.dockerignore		.dockerignore
.flake8.cfg		.flake8.cfg
.gitignore		.gitignore
.gitmodules		.gitmodules
.mypy.ini		.mypy.ini
CITATION.cff		CITATION.cff
LICENSE.md		LICENSE.md
README.md		README.md
install.py		install.py
requirements.aws.txt		requirements.aws.txt
requirements.azure.txt		requirements.azure.txt
requirements.gcp.txt		requirements.gcp.txt
requirements.local.txt		requirements.local.txt
requirements.txt		requirements.txt
sebs.py		sebs.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SeBS: Serverless Benchmark Suite

Paper

Benchmark Applications

Installation

Usage

Benchmark

Regression

Experiment

Local

Experiments

Performance & cost

Network ping-pong

Invocation overhead

Eviction model

Authors

About

Releases

Packages

Languages

License

opal-mimuw/serverless-benchmarks

Folders and files

Latest commit

History

Repository files navigation

SeBS: Serverless Benchmark Suite

Paper

Benchmark Applications

Installation

Usage

Benchmark

Regression

Experiment

Local

Experiments

Performance & cost

Network ping-pong

Invocation overhead

Eviction model

Authors

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages