-
Notifications
You must be signed in to change notification settings - Fork 239
Getting Started
Welcome to Gremlin. Gremlin is a domain specific programming language for graphs. Graphs are data structures where there exists vertices (i.e. dots, nodes) and edges (i.e. lines, arcs). Gremlin was designed to work with a type of graph called a property graph. Property graphs are defined, in detail, in the Defining a Property Graph section of this documentation. Gremlin’s syntax allows for the representation of complex paths.
This section will discuss the basic architecture of Gremlin, how to install Gremlin, and how to issue some simple statements into the Gremlin console.
There are many graph databases, frameworks, and packages out there. To make it easy to connect these to Gremlin, Gremlin relies on Blueprints and Pipes. Blueprints provides a set of interfaces for a fairly generic graph data model (see Defining a Property Graph) and has implementations of these interfaces for different graph management systems. With Blueprints, its possible to add new implementations and thus, use Gremlin over these new implementations. Pipes is a data flow framework that has support for graph processing. Finally, there are three ways for someone to use Gremlin — through the Gremlin console, through its Java JSR 223 interface (see Using Gremlin through Java), or through some other language that has bindings to the JVM.
Here are the steps to getting Gremlin running on your computer. You can either git clone
from http://github.com/tinkerpop/gremlin.git
and build the latest version of Gremlin (see Building Gremlin from Source) or you can download a pre-built version. For the latter, follow the steps below.
- Download the latest distribution of Gremlin from downloads.
- Unzip the downloaded
gremlin-xx.zip
andcd
to thegremlin-xx/
directory it creates. - Run
gremlin.sh
(unix) orgremin.bat
(windows) to start the Gremlin console.
marko$ ./gremlin.sh
\,,,/
(o o)
-----oOOo-(_)-oOOo-----
gremlin>
Thats it. The Gremlin interpreter is loaded and ready for commands. Gremlin uses Groovy as its host language. Please review Groovy for help on Groovy-related constructs. In short, Groovy is a superset of Java. What works in Java, works in Groovy. However, Groovy provides many neat shorthands to make it easier to interact with the Java API. Finally, Gremlin provides many neat shorthands to make it easier to express paths through a property graph.
gremlin> i = 'goodbye'
==>goodbye
gremlin> j = 'self'
==>self
gremlin> i + " " + j
==>goodbye self
gremlin> "${i} ${j}"
==>goodbye self
We will use a simple, 6 vertex/6 edge, graph that is provided with Gremlin and is fully diagrammed in Defining a Property Graph.
gremlin> g = TinkerGraphFactory.createTinkerGraph()
==>tinkergraph[vertices:6 edges:6]
gremlin> g.class
==>class com.tinkerpop.blueprints.pgm.impls.tg.TinkerGraph
gremlin> // lets look at all the vertices
gremlin> g.V
==>v[3]
==>v[2]
==>v[1]
==>v[6]
==>v[5]
==>v[4]
gremlin> // lets look at all the edges
gremlin> g.E
==>e[10][4-created->5]
==>e[7][1-knows->2]
==>e[9][1-created->3]
==>e[8][1-knows->4]
==>e[11][4-created->3]
==>e[12][6-created->3]
gremlin> v = g.v(1)
==>v[1]
gremlin> v.keys()
==>age
==>name
gremlin> v.values()
==>29
==>marko
gremlin> v.name + ' is ' + v.age + ' years old.'
==>marko is 29 years old.
gremlin> // lets do a traversal from the '1' marko vertex to its outgoing edges.
gremlin> v.outE
==>e[7][1-knows->2]
==>e[9][1-created->3]
==>e[8][1-knows->4]
gremlin> // what are the weight values on those edges
gremlin> v.outE.weight
==>0.5
==>0.4
==>1.0
gremlin> // lets only traverse to the head vertices of those edges
gremlin> // that have a weight that is less than 1.0
gremlin> v.outE{it.weight < 1.0}.inV
==>v[2]
==>v[3]
gremlin> // lets make these vertices a list
gremlin> list = []
gremlin> v.outE{it.weight < 1.0}.inV >> list
==>v[2]
==>v[3]
gremlin> list
==>v[2]
==>v[3]
gremlin> // lets get the property maps of these vertices.
gremlin> list.collect{ it.map() }
==>{name=vadas, age=27}
==>{name=lop, lang=java}
gremlin> // lets do it another way using spread notation
gremlin> list*.map()
==>{name=vadas, age=27}
==>{name=lop, lang=java}
gremlin> // lets get their incoming edge sets
gremlin> list.inE()
==>e[7][1-knows->2]
==>e[9][1-created->3]
==>e[11][4-created->3]
==>e[12][6-created->3]
gremlin> list.inE{it.label=='knows'}
==>e[7][1-knows->2]
gremlin> list.inE()[[label:'knows']]
==>e[7][1-knows->2]
gremlin> list.inE('knows').outV.name
==>marko
gremlin> // toString() the pipeline to see what its composed of
gremlin> println list.inE()[[label:'knows']].outV.name
[VertexEdgePipe<IN_EDGES>, LabelFilterPipe<NOT_EQUAL,knows>, EdgeVertexPipe<OUT_VERTEX>, PropertyPipe<name>]
==>null
What was presented is some of the basic functionality in Gremlin. The remainder of this documentation will discuss path construction and their use in performing graph algorithms on property graph structures.