H2O API

High-performance HTTP API for H2O-Really.

Build

mvn package

Run

Set environment variables PORT and DATABASE_URL, e.g.:

export DATABASE_URL=postgresql://you:yourpass@localhost:5432/h2o_really
export PORT=5000

Jetty, Simple, Spark, and Undertow are supported.

Jetty:

java -cp target/classes:"target/dependency/*" JettyServer

Simple:

java -cp target/classes:"target/dependency/*" SimpleServer

Spark:

java -cp target/classes:"target/dependency/*" SparkServer

Undertow:

java -cp target/classes:"target/dependency/*" UndertowServer

Browse

For example:

http://127.0.0.1:5000/?compact=true&page_size=2&page=1

Fast?

I have done some basic benchmarking with JMeter to see how we perform against the Python version.

For Python, I used:

Django 1.6 beta-2
Bjoern 1.3.4
Gunicorn 17.5

Java:

Jetty 9.0.5.v20130815
Simple Framework 5.1.5
Spark 1.1 (internally using Jetty 9.0.2.v20130417)
Undertow 1.0.0.Beta1

The workload was 5 threads with a 1 second ramp-up and a loop count of 200 (giving an even 1000 requests for each server -- see benchmark/basic_test_plan.jmx for full details). I had the Servers configured as follows (scripts for running Bjoern and Gunicorn are provided in benchmark):

Django dev server on port 8000 (baseline)
Bjoern (8 workers) on 7777
Gunicorn (8 workers) on 8888
Simple on 5000
Jetty on 5050
Spark on 5555
Undertow on 5550

each tested in isolation.

For small responses, there isn't a huge difference (~30 req/s for Python vs ~80 req/s for Java). However, for larger responses (e.g. page_size=5000), Python totally falls over and starts raising errors, but Java remains steady.

For example, even at page_size=5000 on my laptop all Java implementations can sustain ~50 req/s. Under those workloads, both Gunicorn and Bjoern dropped down to ~1.5 req/s with an error rate of ~25%.

At the extreme, a full export of all ~130k records via the API (in compact form; 16MB or so) takes ~1s with Java, and nearly a full minute in Python.

In all tests, all Java implementations showed similar numbers, suggesting that the performance bottleneck is elsewhere. Whatever the limiting factor, it's not much of a limit because they can each serve 5000 records with an average response time of < 100ms and get through > 50 requests per second.

TODO

~~API response meta (total, previous / next, etc)~~
~~database connection pooling~~
support for full measurement details
~~benchmarks~~

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
benchmark		benchmark
src/main/java		src/main/java
.gitignore		.gitignore
LICENSE		LICENSE
Procfile		Procfile
README.md		README.md
pom.xml		pom.xml
system.properties		system.properties

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

H2O API

Build

Run

Browse

Fast?

TODO

About

Releases

Packages

Contributors 2

Languages

License

openwater/h2o-really-api

Folders and files

Latest commit

History

Repository files navigation

H2O API

Build

Run

Browse

Fast?

TODO

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages