Skip to content

Getting Started

okram edited this page Feb 15, 2011 · 77 revisions

Welcome to Gremlin. Gremlin is a domain specific programming language for graphs. Graphs are data structures where there exists vertices (i.e. dots, nodes) and edges (i.e. lines, arcs). Gremlin was designed to work with a type of graph called a property graph. Property graphs are defined, in detail, in the Defining a Property Graph section of this documentation. Gremlin’s syntax allows for the representation of complex paths.

This section will discuss the basic architecture of Gremlin, how to install Gremlin, and how to issue some simple statements into the Gremlin console.

  1. Basic Gremlin Architecture
  2. Installing Gremlin
  3. Using the Gremlin Console

Basic Gremlin Architecture

There are many graph databases, frameworks, and packages out there. To make it easy to connect these to Gremlin, Gremlin relies on Blueprints and Pipes. Blueprints provides a set of interfaces for a fairly generic graph data model (see Defining a Property Graph) and has implementations of these interfaces for different graph management systems. With Blueprints, its possible to add new implementations and thus, use Gremlin over these new implementations. Pipes is a data flow framework that has support for graph processing. Finally, there are three ways for someone to use Gremlin — through the Gremlin console, through its Java JSR 223 interface (see Using Gremlin through Java), or through some other language that has bindings to the JVM.

Installing Gremlin

Here are the steps to getting Gremlin running on your computer. You can either git clone from http://github.com/tinkerpop/gremlin.git and build the latest version of Gremlin (see Building Gremlin from Source) or you can download a pre-built version. For the latter, follow the steps below.

  1. Download the latest distribution of Gremlin from downloads.
  2. Unzip the downloaded gremlin-xx.zip and cd to the gremlin-xx/ directory it creates.
  3. Run gremlin.sh (unix) or gremin.bat (windows) to start the Gremlin console.

Using the Gremlin Console

marko$ ./gremlin.sh 
         \,,,/
         (o o)
-----oOOo-(_)-oOOo-----
gremlin>

Thats it. The Gremlin interpreter is loaded and ready for commands. Gremlin uses Groovy as its host language. Please review Groovy for help on Groovy-related constructs. In short, Groovy is a superset of Java. What works in Java, works in Groovy. However, Groovy provides many neat shorthands to make it easier to interact with the Java API. Finally, Gremlin provides many neat shorthands to make it easier to express paths through a property graph.

gremlin> i = 'goodbye'
==>goodbye
gremlin> j = 'self'
==>self
gremlin> i + " " + j
==>goodbye self
gremlin> "${i} ${j}"
==>goodbye self

We will use a simple, 6 vertex/6 edge, graph that is provided with Gremlin and is fully diagrammed in Defining a Property Graph.

gremlin> g = TinkerGraphFactory.createTinkerGraph()
==>tinkergraph[vertices:6 edges:6]
gremlin> g.class
==>class com.tinkerpop.blueprints.pgm.impls.tg.TinkerGraph
gremlin> // lets look at all the vertices
gremlin> g.V
==>v[3]
==>v[2]
==>v[1]
==>v[6]
==>v[5]
==>v[4]
gremlin> // lets look at all the edges
gremlin> g.E
==>e[10][4-created->5]
==>e[7][1-knows->2]
==>e[9][1-created->3]
==>e[8][1-knows->4]
==>e[11][4-created->3]
==>e[12][6-created->3]
gremlin> v = g.v(1)
==>v[1]
gremlin> v.keys()
==>age
==>name
gremlin> v.values()
==>29
==>marko
gremlin> v.name + ' is ' + v.age + ' years old.'
==>marko is 29 years old.
gremlin> // lets do a traversal from the '1' marko vertex to its outgoing edges.
gremlin> v.outE
==>e[7][1-knows->2]
==>e[9][1-created->3]
==>e[8][1-knows->4]
gremlin> // what are the weight values on those edges
gremlin> v.outE.weight
==>0.5
==>0.4
==>1.0
gremlin> // lets only traverse to the head vertices of those edges
gremlin> // that have a weight that is less than 1.0
gremlin> v.outE{it.weight < 1.0}.inV
==>v[2]
==>v[3]
gremlin> // lets make these vertices a list
gremlin> list = []                             
gremlin> v.outE{it.weight < 1.0}.inV >> list
==>v[2]
==>v[3]
gremlin> list
==>v[2]
==>v[3]
gremlin> // lets get the property maps of these vertices.
gremlin> list.collect{ it.map() }
==>{name=vadas, age=27}
==>{name=lop, lang=java}
gremlin> // lets do it another way using spread notation
gremlin> list*.map()
==>{name=vadas, age=27}
==>{name=lop, lang=java}
gremlin> // lets get their incoming edge sets
gremlin> list.inE()       
==>e[7][1-knows->2]
==>e[9][1-created->3]
==>e[11][4-created->3]
==>e[12][6-created->3]
gremlin> list.inE{it.label=='knows'}     
==>e[7][1-knows->2]
gremlin> list.inE()[[label:'knows']]
==>e[7][1-knows->2]
gremlin> list.inE('knows').outV.name
==>marko
gremlin> // toString() the pipeline to see what its composed of
gremlin> println list.inE()[[label:'knows']].outV.name
[VertexEdgePipe<IN_EDGES>, LabelFilterPipe<NOT_EQUAL,knows>, EdgeVertexPipe<OUT_VERTEX>, PropertyPipe<name>]
==>null

What was presented is some of the basic functionality in Gremlin. The remainder of this documentation will discuss path construction and their use in performing graph algorithms on property graph structures.