Questions tagged [tdb]

TDB is an open source RDF database developed and maintained by the Apache Jena project

TDB is an open source RDF database developed and maintained by the Apache Jena project. It is a Java based embedded database that may be exposed over HTTP using the Fuseki Server also from the Apache Jena project.

Features

It's features includes the following:

Persisting RDF Triples or Quads
Full SPARQL execution
Write ahead logging to provide serializable transactions and fault tolerance

Scalability and Performance

TDB typically scales up to datasets of a few hundred million triples or quads.

Compared to commercial alternatives TDB is often less scalable compared primarily because while it is persisted to disk it is fundamentally designed as an in-memory database. It relays heavily on RAM caches and memory mapped files so scalability tends to be limited by machine RAM.

Also TDB does not have any option of a clustered mode so cannot be scaled horizontally without additional technologies (e.g. manually created replicas and load balancers)

TDB is typically included in the Berlin SPARQL Benchmark Results for those interested in comparative performance data.

127 questions

vote

0 answers

How to configure fuseki jena to use a reasoner and TDB + tdb:UnionDefaultGraph

Context i am new to fuseki, i am currently using it in a project, the main idea is: start the server with a pre-defined ontology (turtle file) + a reasoner I process files with python, and for each file, I generate an rdf graph that I put in a…

jena fuseki tdb

asked Nov 03 '20 at 15:35

kam13

vote

1 answer

Does anyone know how to get the tdb2.dump command to actually do anything

I'm trying to dump a jena database as triples. There seems to be a command that sounds perfectly suited to the task: tdb2.dump jena@debian-clean:~$ ./apache-jena-3.8.0/bin/tdb2.tdbdump --help tdbdump : Write a dataset to stdout (defaults to…

jena tdb

asked Sep 08 '20 at 10:17

Ben Hillier

2,051
1
8
14

vote

1 answer

How to Build a Fuseki TDB over a POST HTTP Request with Assembler.ttl? (How to send File over POST request)

https://jena.apache.org/documentation/fuseki2/fuseki-server-protocol.html Reading the documentation, we see that we are able to send a POST request containing assembler .ttl definitions to Fuseki Endpoint. Although, when trying that, my application…

http post jena fuseki tdb

asked Feb 11 '20 at 15:09

Lucas Resende

vote

0 answers

JENA TDB load progression using Java API

I am wondering if it is possible to get load speed information when using the Java API. The code I have to load "large" files (few gb) is this: try (InputStream in = new FileInputStream(arguments.input)) { RDFParser.create() …

java jena tdb

asked Dec 04 '19 at 08:11

Jasper

vote

1 answer

What does Apache Jena's tdb2.tdbcompact do?

I have read the description of this command, but still don't know what it compresses, and why I should use it? BTW: the subdirectory Data-NNN is for previous and current versions of the databases, which means it can only have 1000 versions of the…

jena tdb knowledge-graph

asked Apr 19 '19 at 03:27

Gao

vote

1 answer

What indices does jena TDB2 use?

I am trying to find out what indices TDB2 builds. I found out by the code that it uses B+ trees to store them on disc but I didn't get what they contain and how they are used. So my detailed questions are: For which collation order of RDF triples…

jena indices tdb

asked Apr 10 '19 at 11:54

A. Bigerl

vote

1 answer

Is jena's fuseki not compactible with tdb2.tdbloader?

I have a requirement that incremental update tdb files daily. So I'm using tdb2.tdbloader to do the job with a generated N-Triples file. But when the job is done, the data directory which contains tdb data have a new directory called "data-0001" or…

rdf jena triplestore tdb tdbloader

asked Jun 07 '18 at 07:55

Gao

vote

1 answer

Jena TDB Dataset begin() fails

I want to use Jena TDB in a project. This is what I added in my POM: org.apache.jena apache-jena-libs 3.7.0 pom These are my…

maven jena semantic-web tdb

asked Jun 07 '18 at 07:14

Janothan

vote

1 answer

SPARQL : OFFSET whithout ORDER BY to get all results of a query?

I have a large TDB dataset (cf. this post Fuseki config for 2 datasets + text index : how to use turtle files? ) and I need to extract data in order to make a "subgraph" and import it in fuseki. I found that OFFSET could be a solution to get all…

sparql jena tdb

asked Jan 29 '18 at 11:39

vvffl

vote

1 answer

Fuseki config for 2 datasets + text index : how to use turtle files?

I'm new to fuseki and want to use 2 TDB datasets for our project : a small one for our own data, and a large one (168 M triples, imported data from http://data.bnf.fr). We need to index the data because SPARQL queries using "FILTER(CONTAINS())"…

lucene jena fuseki tdb easyrdf

asked Jan 18 '18 at 10:24

vvffl

vote

2 answers

Java OutOfMemoryError in apache Jena using TDB

Hi I've been using Jena for a project and now I am trying to query a Graph for storage in plain files for batch processing with Hadoop. I open a TDB Dataset and then I query by pages with LIMIT and OFFSET. I output files with 100000 triplets per…

java graph rdf jena tdb

asked Sep 01 '17 at 01:29

Nord

vote

0 answers

Fuseki 2 low performance

I have stored 80GB triples using FUSEKI 2 (latest version), which is actually a DbPedia dump. My problem happens when I run the following query. It is taking more than 5 minutes to return whereas at Snorql it takes less than one second. SELECT …

performance sparql dbpedia fuseki tdb

asked Mar 27 '17 at 09:54

user7806

vote

1 answer

Can I use Apache Jena and persist the ontology using Apache Solr

We have a cloud base Java application that uses Oracle DB and Apache Solr for document indexing/searching. I need to implement an ontology and I intend to use Apache Jena. It's an uncharted territory for me. According to the docs, seems that using…

solr apache-jena tdb

asked Feb 24 '17 at 02:25

Jimo

vote

1 answer

Does Jena TDB load all data into memory every time?

I am a newbie of Jena. I try to deal with the Yoga dataset using TDB. The dataset is about 200M and everytime I run the same query, it will have to take about 5 minutes to load the data then give out the results. I am wondering do I misunderstand…

sparql jena tdb

asked Feb 16 '17 at 01:15

Charlotte

vote

1 answer

command line tdbquery with text index

I trying to run a text search query with Jena via command line. tdbquery --desc textsearch.ttl --query search.rq The query return empty results with the messages: 17:23:46 WARN TextQueryPF :: Failed to find the text index : tried…

sparql jena apache-jena tdb

asked Dec 20 '16 at 16:29

Samur Araujo

Prev 1 2

…

8 9 Next