The jmerkle_sequential library offers an extremely simple api to build, compare, and inspect Merkle tree data structures. In addition to these operations, jmerkle_sequential offers capability to marshal and unmarshal binary representations that can be consumed (or produced) in a technology agnostic manner (assuming consumers and producers understand the established binary protocol). Sequential is emphasized in the library's title because there are obvious opportunities for utilizing concurrency when altering as well as navigating Merkle tree structures (or any tree structure, for that matter). Those opportunities shall be explored at some point in the future and perhaps implemented in languages other than Java.
While the virtues of the Merkle tree data structure itself are beyond the scope of this readme, the reader is encouraged to partake in research of their own.
For exploratory purposes, here is a very brief list of Merkle tree data structure usage (please contact me if I'm egregiously missing something):
The (English) wikipedia entry:
Other interesting uses are limited only to one's imagination.
The jmerkle_sequential library currently has no dependencies, but makes use of apache maven to produce its jar.
To build from source, assuming maven has been installed, from the directory where jmerkle_sequential's pom.xml is located, simply invoke
mvn package
The build, compare, and inspect operations are each made available through public static methods on the JMerkle class:
- Building and altering a merkle tree is done via
JMerkle.alter(JMerkle t1, List<JMerkleAlterable> alterations);
In the case of creating a new structure, the t1 parameter must be null. The alter method produces either an abstract JMerkle object or null (in the case that there are no leaves).
- Inspecting a structure's contents is done via
JMerkle.allkeys(JMerkle t1);
The allkeys method produces a List<String> of t1's keys.
- Comparing two merkle tree structures is done via
JMerkle.diff(JMerkle t1, JMerkle t2);
The diff function produces a List<String> of keys representing the values (leaves) that are different between the two trees.
In addition to the above operations, the jmerkle_sequential library offers the capability to marshal and unmarshal JMerkle structures to and from the Java space. Marshaling results in a byte[] that can be utilized by other languages so long as their implementation understands the protocol (for example, merkle_parser provides an Erlang example that performs allkeys and diff operations on the raw binary).
The marshal and unmarshal operations are each made available through public static methods on the JMerkleMarshaler class:
- Marshaling a JMerkle is done via
JMerkleMarshaler.marshal(JMerkle t1);
The marshal method produces a byte[].
- Unmarshaling a marshaled value (originally from JMerkleMarshaler or elsewhere, assuming proper implementation) is done via
JMerkleMarshaler.unmarshal(byte[] treeBytes);
The unmarshal method produces an abstract JMerkle object.
Here's a rudimentary example of using Merkle tree comparisons to reconcile data on a mobile device to that of a central server:
import java.io.FileOutputStream;
import java.io.Serializable;
import java.util.ArrayList;
import java.util.List;
import jmerkle.sequential.JMerkle;
import jmerkle.sequential.JMerkleAlterable;
import jmerkle.sequential.JMerkleMarshaler;
/**
* Shows a simple example of how a mobile device might sync its
* data with a central datastore through the use of Merkle tree
* comparisons.
*/
public class HelloJMerkle {
private static final String TMP_DIR = System.getProperty("java.io.tmpdir");
public static void main(String[] args) throws Exception {
//create the initial JMerkle structure, representative of an inventory:
List<JMerkleAlterable> initialInventoryAlterations = initialInventory();
JMerkle initialInventoryTree = JMerkle.alter(null, initialInventoryAlterations);
//mimic storage of the initialInventoryTree by marshaling it:
byte[] marshaledInitialInventory = JMerkleMarshaler.marshal(initialInventoryTree);
////////////////////////////////////////////////////////////////////
//uncomment below to store the marshaled "initial inventory" tree to
//the java.io.tmpdir (to try merkle_parser, etc):
//FileOutputStream fos1 = new FileOutputStream(TMP_DIR + "/InitialInventoryTree.out");
//fos1.write(marshaledInitialInventory);
////////////////////////////////////////////////////////////////////
//mimic retrieval of the initialInventoryTree by unmarshaling the stored value:
JMerkle unmarshaledInitialInventory = JMerkleMarshaler.unmarshal(marshaledInitialInventory);
//client requests a bootstrap sync with the central data server; this is done
//either by getting all the keys on the current inventory or by diff'ing an empty
//(null) value to the current inventory. we'll use the allkeys method:
List<String> allkeysBootstrap = JMerkle.allkeys(unmarshaledInitialInventory);
System.out.println("mobile device needs bootstrap data for the following inventory Widgets:");
System.out.println(allkeysBootstrap);
//time passes, the mobile device goes offline... meanwhile the inventory changes w/ three updates:
//a brand new Widget:
Widget brandNewWidget = new Widget("brand new!", 99);
//sold 1 widget2:
Widget widget2 = new Widget("widget2", 2);
//and all widget4's were sold:
Widget widget4 = new Widget("widget4", null);
//batch up the alterations:
List<JMerkleAlterable> alterations = new ArrayList<JMerkleAlterable>(3);
alterations.add(brandNewWidget);
alterations.add(widget2);
alterations.add(widget4);
//and merge them into the initialInventoryTree:
JMerkle alteredInventoryTree = JMerkle.alter(initialInventoryTree, alterations);
////////////////////////////////////////////////////////////////////
//uncomment below to store the marshaled "altered inventory" tree to
//the java.io.tmpdir (to try merkle_parser, etc):
//FileOutputStream fos2 = new FileOutputStream(TMP_DIR + "/AlteredInventoryTree.out");
//fos2.write(JMerkleMarshaler.marshal(alteredInventoryTree));
////////////////////////////////////////////////////////////////////
System.out.println("after selling all of the widget4's, 1 widget2, and adding 99 \"brand new!\" widgets, the current inventory consists of:");
System.out.println(JMerkle.allkeys(alteredInventoryTree));
//the mobile device comes back online, requests an update to sync its data... this likely wouldn't involve
//passing the entire JMerkle structure, only a minimal amount of data to identify the tree used when the
//mobile device was bootstrapped:
List<String> diff = JMerkle.diff(unmarshaledInitialInventory, alteredInventoryTree);
System.out.println("mobile device needs to sync only the following values to be up to date on ALL widgets and their counts:");
System.out.println(diff); //notice, the device will be instructed that widget4 is no longer in stock.
//etc., etc.
}
private static List<JMerkleAlterable> initialInventory(){
List<JMerkleAlterable> initialInventory = new ArrayList<JMerkleAlterable>(5);
for(int i=0; i<5; i++) {
initialInventory.add(new Widget("widget"+i, i+1));
}
return initialInventory;
}
/**
* Barebones JMerkleAlterable implementation.
*/
private static class Widget implements JMerkleAlterable {
private String productId;
private Integer count;
public Widget(String productId, Integer count) {
this.productId = productId;
this.count = count;
}
@Override
public String getKey() {
return productId;
}
@Override
public Serializable getValue() {
return count;
}
}
}
The output from the System.out.println calls:
mobile device needs bootstrap data for the following inventory Widgets:
[widget0, widget3, widget1, widget2, widget4]
after selling all of the widget4's, 1 widget2, and adding 99 "brand new!" widgets, the current inventory consists of:
[widget0, widget3, widget1, widget2, brand new!]
mobile device needs to sync only the following values to be up to date on ALL widgets and their counts:
[widget2, widget4, brand new!]