9

I want to use Java API to manipulate graph on a remote server, the server actually hosts in localhost. The code I use to connect server is:

JanusGraphFactory.Builder b = JanusGraphFactory.build();
b.set("hosts", "[localhost]");
JanusGraph graph = b.open();

But after I run the program, it throws exception like this:

Exception in thread "main" java.lang.IllegalStateException: Need to set configuration value: root.storage.backend

So how can I connect to a remote JanusGraph server using Java API?

Misha Brukman
  • 10,866
  • 4
  • 54
  • 71
Gao
  • 690
  • 5
  • 14

6 Answers6

9

The documentation I've found suggests to create an EmtpyGraph and get a remote traversal from that one:

EmptyGraph.instance().traversal().withRemote(config);

where config is your a configuration object with the remote properties, e.g:

    config.setProperty("clusterConfiguration.hosts", HOST);
    config.setProperty("clusterConfiguration.port", PORT);
    config.setProperty("clusterConfiguration.serializer.className", "org.apache.tinkerpop.gremlin.driver.ser.GryoMessageSerializerV1d0");
    config.setProperty("clusterConfiguration.serializer.config.ioRegistries", ioRegistries); // (e.g. [ org.janusgraph.graphdb.tinkerpop.JanusGraphIoRegistry) ]
    config.setProperty("gremlin.remote.remoteConnectionClass", "org.apache.tinkerpop.gremlin.driver.remote.DriverRemoteConnection");
    config.setProperty("gremlin.remote.driver.sourceName", "g");

However, I've run into issues using JanusGraph specific features (for example committing a transaction) with this since the graph instance is the EmptyGraph itself, not a JanusGraph. So, try:

GraphTraversalSource g = JanusGraphFactory.open("inmemory").traversal().withRemote(config);

This will give you a traversal source to the remote gremlin-server, and you can do things like g.addV("vertexLabel")....; , g.tx().commit(); and so on.

Niklas
  • 274
  • 1
  • 6
  • 1
    Can you give an example on how to pass the values for `ioRegistries`? – drum Mar 14 '19 at 15:44
  • 1
    basically there is a bug and ioRegistries value doesn't work if you provide it with a List of only 1 String, I'm currently working with a workarround with Arrays.asList("org.janusgraph.graphdb.tinkerpop.JanusGraphIoRegistry","org.janusgraph.graphdb.tinkerpop.JanusGraphIoRegistry"), but i did not tested fully this. – Samy Elaiassi Mar 26 '19 at 17:16
  • how can I connect to particular graph, which is already available in server? – Sagar Vaghela Mar 13 '20 at 18:22
4

Here is a simple way:

graph = JanusGraphFactory.open("conf/janusgraph-cassandra-solr.properties") juno = graph.addVertex() //Automatically opens a new transaction juno.property("name", "juno") graph.tx().commit() //Commits transaction

Janus User Doc

Janus uses an existing storage solution like cassandra, hbase, berkelydb to store data.

You can connect by 2 ways: 1 - Connect to the remote gremlin server and execute traversal/queries remotely. This is by using the Cluster and EmptyGraph of tinkerpop gremlin 2 - Connect directly from your application using the technique i suggested int his post above.

Pros/Cons of connecting to the remote gremlin server

Pros

  • The server has much more control and all the queries are centralized.
  • Since every one is running traversal/queries via the remote gremlin server, all are transactionally protected. The remote gremlin server runs your traversal/queries by default in a transaction.
  • Central strategy management
  • Central schema management

Cons

  • Tough to do a manual transaction management
  • You have to use groovy script as string and send it to remove (Cluster submit) for transactional execution of your code.

BY connecting directly from your client code (avoiding the remote connection) you get much more control.

Also, you can use the JanusGraph instance directly in your code, which is still gremlin complaint, to take full advantage of the JanusGraph API's.

  • 1
    How would one go about doing "manual transaction management" in a "remote gremlin server and execute traversal/queries remotely way"? I can't seem to find any example anywhere. Would you please kindly show an example? thanks – Tin Ng Jul 23 '18 at 02:26
  • Link is dead (404) – Ben Fulton Apr 03 '20 at 12:49
  • Hello, can you provide an example of how you make transaction with remote JanusGraph if you have, please, can't make work properly rollback on remote – Serhii Zadorozhnyi Jul 01 '20 at 05:55
2

Try this:

JanusGraphFactory.Builder builder = JanusGraphFactory.build().
            set("storage.hostname", "localhost").
            set('storage.backend', 'cassandra') //or whatever you are using as backend
builder.open();
Mansha Chuttani
  • 351
  • 3
  • 7
  • How can I connect to it using gremlin's standard API? – Gao Aug 14 '17 at 13:47
  • For that you need to configure your remote.yaml file and provide the following: `hosts: [server-address] port: [port] serializer: { className: org.apache.tinkerpop.gremlin.driver.ser.GryoMessageSerializerV1d0, config: { serializeResultToString: true }} ` Then you can use following code to connect: `import org.apache.tinkerpop.gremlin.driver.Client; import org.apache.tinkerpop.gremlin.driver.Cluster; Cluster cluster = Cluster.open("conf/remote.yaml"); Client client = cluster.connect(); client.submit("graph.addVertex(T.label,'x','name','tom')");` – Mansha Chuttani Aug 16 '17 at 06:13
  • Giving below error. Gremlin Server must be configured to use the JanusGraphManager. – Sagar Vaghela Mar 13 '20 at 03:45
1

Check RemoteGraph in janusgraph examples.

You can find under Class RemoteGraphApp how you can connect to remote janusgraph server (Cluster).

 conf = new PropertiesConfiguration(propFileName);

    // using the remote driver for schema
    try {
        cluster = Cluster.open(conf.getString("gremlin.remote.driver.clusterFile"));
        client = cluster.connect();
    } catch (Exception e) {
        throw new ConfigurationException(e);
    }

    // using the remote graph for queries
    graph = EmptyGraph.instance();
    g = graph.traversal().withRemote(conf);

where the cluster config file contains:

 hosts: [127.0.0.1]
 port: 8182
 serializer: {
   className: org.apache.tinkerpop.gremlin.driver.ser.GryoMessageSerializerV3d0,
   config: {
    ioRegistries: [org.janusgraph.graphdb.tinkerpop.JanusGraphIoRegistry]
   } 
 }
Ali Aboud
  • 179
  • 9
1

Most of the answers above are outdated. right now , to connect to janusgraph or any tinkerpop complaint graph database over remote we need to make cluster object. Using this cluster object, we can get graphTraversalSource. Both these objects need to be closed when the programs ends to release the connection pool.

    private static Cluster cluster;
    private static GraphTraversalSource gts;

    private static void init() {
        cluster = Cluster.build()
                .addContactPoint(uri)
                .port(port)
                .serializer(Serializers.GRYO_V3D0)
                .maxInProcessPerConnection(32)
                .maxSimultaneousUsagePerConnection(32)
                .maxContentLength(10000000)
                .maxWaitForConnection(10)
                .minConnectionPoolSize(poolSize)
                .maxConnectionPoolSize(poolSize+5)
                .create();

        gts = AnonymousTraversalSource
                .traversal()
                .withRemote(DriverRemoteConnection.using(cluster));
    }


    public GraphTraversalSource getConnection() {
        return gts;
    }

    public static void finalise() throws Exception {
        gts.close();
        cluster.close();
    }

GraphTraversalSource is a thread safe object and should ideally be a singelton. Each graphTraversalSource object keeps its connection pool within its context.

Nischal Kumar
  • 362
  • 4
  • 10
0

Use the following code:

        JanusGraphFactory.Builder config = JanusGraphFactory.build();
        config.set("storage.backend", "cassandrathrift");
        config.set("storage.cassandra.keyspace", "graph1");
        config.set("storage.hostname", "127.0.0.1");

        JanusGraph graph = config.open();
KayV
  • 9,560
  • 8
  • 68
  • 111