Name	Name	Last commit message	Last commit date
parent directory ..
benches	benches
ruby	ruby
src	src
static	static
Cargo.toml	Cargo.toml
README.md	README.md

foolsgold

foolsgold crate is an integration test for the nemesis and mruby crates. foolsgold uses nemesis to serve a Ruby Rack application that is partially implemented in Rust and backed by an embedded mruby interpreter.

Nemesis is similar to Thin in a traditional Ruby web stack.

Components

foolsgold consists of three parts: a Rust implementation of the foolsgold Ruby gem, a collection of static assets, and a launcher for a Nemesis server.

The nemesis launcher pulls all of these components together. A simplified version of the launcher is:

Builder::default()
    .add_mount(
        Mount::from_rackup("foolsgold", foolsgold::RACKUP, "/fools-gold")
            .with_init(Box::new(|interp| {
                foolsgold::init(interp)?;
                interp.eval("require 'foolsgold'")?;
                Ok(())
            })),
    )
    .add_static_assets(Assets::all()?)
    .serve()

Execution Modes

Unlike Thin, Nemesis allows:

Mounting multiple Rack apps at unique route prefixes.
Declaring interpreter isolation level per Rack app.

foolsgold mounts two copies of the Rack app:

http://127.0.0.1:8000/fools-gold/shared-nothing creates a single use interpreter for each request.
http://127.0.0.1:8000/fools-gold/prefork creates an interpreter for each HTTP worker. Nemesis recycles the interpreter after it has served 150 requests.

App

The FoolsGold::Middleware::Request Rack adapter injects a RequestContext instance into the Rack environment. The RequestContext is implemented in pure Rust. The context closes over a static AtomicI64 request counter, making global state available across mruby interpreter instances. The counter is readable and writable via these APIs:

# retrieve
env[FoolsGold::CONTEXT].metrics.total_requests.get
# increment
env[FoolsGold::CONTEXT].metrics.total_requests.inc

Performance

The pieces of FoolsGold that are implemented in Rust vs Ruby are not performance critical—a mutex and a call to /dev/urandom—so Rust is not expected to provide any speedup for the Ruby sources.

When the webservers are configured similarly:

The shared nothing Rust implementation is about twice as slow as the Ruby implementation.
The prefork Rust implementation performs similarly to the Ruby implementation with better tail latency likely due to Tokio being better than EventMachine.

The application code run in the rackup file is a close to trivial HTML template. With more complex application logic, the startup costs of the shared nothing interpreter will not dominate response time as much.

Setup

Thin uses Ruby 2.6.3. Nemesis uses mruby 2.0.1 @ c078758.

Benches compare Thin with threads vs. Nemesis with Rocket and per worker interpreters.

Benchmarks compare the webservers with concurrency levels of 1, 16, and 64 requests. Each bench iteration makes exactly concurrency level simultaneous requests and waits for them all to complete successfully.

Run

To run benches, first launch foolsgold with Nemesis:

cargo run --release --bin foolsgold

Also launch foolsgold with Thin:

./scripts/foolsgold-thin.sh

Then run the bench for a single concurrency level with:

RUST_BACKTRACE=1 cargo bench --benches -p foolsgold -- "concurrency(64)"

NOTE: It is important to filter tests by concurrency level because each test may consume all of the ephmeral ports pointing to the server backend bind address. On macOS, you should wait at least 15 seconds between bench runs to clear all sockets in TIME_WAIT state.

Results

1 Concurrent Request

nemesis prefork with concurrency(1)
                        time:   [1.3336 ms 1.3717 ms 1.4199 ms]
                        change: [+0.2427% +3.9400% +8.1666%] (p = 0.07 > 0.05)
                        No change in performance detected.
Found 10 outliers among 100 measurements (10.00%)
  3 (3.00%) high mild
  7 (7.00%) high severe

thin threaded with concurrency(1)
                        time:   [904.58 us 914.94 us 926.14 us]
                        change: [-3.4620% -0.8968% +1.4850%] (p = 0.50 > 0.05)
                        No change in performance detected.
Found 4 outliers among 100 measurements (4.00%)
  3 (3.00%) high mild
  1 (1.00%) high severe

16 Concurrent Requests

nemesis prefork with concurrency(16)
                        time:   [5.7928 ms 5.9293 ms 6.0884 ms]
                        change: [-4.3119% -0.5461% +3.2678%] (p = 0.78 > 0.05)
                        No change in performance detected.
Found 6 outliers among 40 measurements (15.00%)
  1 (2.50%) low mild
  3 (7.50%) high mild
  2 (5.00%) high severe

thin threaded with concurrency(16)
                        time:   [5.4166 ms 5.6005 ms 5.8231 ms]
                        change: [-2.9197% +1.2616% +5.7665%] (p = 0.59 > 0.05)
                        No change in performance detected.
Found 3 outliers among 40 measurements (7.50%)
  2 (5.00%) high mild
  1 (2.50%) high severe

64 Concurrent Requests

nemesis prefork with concurrency(64)
                        time:   [18.297 ms 19.179 ms 20.120 ms]
                        change: [-88.892% -72.777% +1.7811%] (p = 0.36 > 0.05)
                        No change in performance detected.
Found 3 outliers among 20 measurements (15.00%)
  2 (10.00%) high mild
  1 (5.00%) high severe

thin threaded with concurrency(64)
                        time:   [20.329 ms 21.134 ms 22.039 ms]
                        change: [-6.7412% -1.2447% +4.1031%] (p = 0.68 > 0.05)
                        No change in performance detected.
Found 1 outliers among 20 measurements (5.00%)
  1 (5.00%) high mild

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

foolsgold

foolsgold

README.md

foolsgold

Components

Execution Modes

App

Performance

Setup

Run

Results

1 Concurrent Request

16 Concurrent Requests

64 Concurrent Requests

Files

foolsgold

Directory actions

More options

Directory actions

More options

Latest commit

History

foolsgold

Folders and files

parent directory

README.md

foolsgold

Components

Execution Modes

App

Performance

Setup

Run

Results

1 Concurrent Request

16 Concurrent Requests

64 Concurrent Requests