StreamBench

StreamBench is a project to measure the performance of popular streaming engines using Yahoo Streaming Benchmark.

Overview

We compare the performance of an efficient stream processing engine designed for single servers, SABER, with that achieved by popular distributed stream processing systems, Apache Spark and Apache Flink. We also compare the results to that by StreamBox, another recently proposed single-server design that emphases out-of-order processing of data. Based on our results, we argue that a single multicore server can provide better throughput than a multi-node cluster for many streaming applications. This opens an opportunity to cut down system complexity and operational costs by replacing cluster-based stream processing systems with (potentially replicated) single server deployments.

This repository contains code for running the Yahoo Streaming Benchmark in SABER, Spark Streaming, Apache Flink and StreamBox. For Spark and Flink, we follow the approach from previous blogposts by Databricks and DataArtisans. We provide a script for each of these engines to setup and run the benchmark on a single node. The code can be configured to run on a distributed deployment as well.

Benchmark Outline

The Yahoo Streaming Benchmark was designed to emulate an advertisement streaming application. It has a streaming query with four operators: filter, project, join (with relational data) and aggregate (a windowed count).

How to run the code

For every engine, the script provided installs, builds and runs the engines as well as the streaming query.

Credits

StreamBench is brought to you by George Theodorakis, Panagiotis Garefalakis, Alexandros Koliousis, Holger Pirk, Peter Pietzuch

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
flink		flink
saber		saber
spark		spark
streambox		streambox
yahoo-streaming-benchmark		yahoo-streaming-benchmark
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

StreamBench

Overview

Benchmark Outline

How to run the code

Credits

About

Releases

Packages

Contributors 2

Languages

License

lsds/StreamBench

Folders and files

Latest commit

History

Repository files navigation

StreamBench

Overview

Benchmark Outline

How to run the code

Credits

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages