Questions tagged [distributed-database]

Anything related to distributed databases and the techniques and the tools used to manage them. A distributed database is a database whose information is not stored in a single physical location, but it is spread over various devices, often placed far apart.

Anything related to distributed databases and the techniques and the tools used to manage them. A distributed database is a database whose information is not stored in a single physical location, but it is spread over various devices, often placed far apart.

165 questions
190
votes
15 answers

What scalability problems have you encountered using a NoSQL data store?

NoSQL refers to non-relational data stores that break with the history of relational databases and ACID guarantees. Popular open source NoSQL data stores include: Cassandra (tabular, written in Java, used by Cisco, WebEx, Digg, Facebook, IBM,…
knorv
  • 45,461
  • 71
  • 205
  • 289
16
votes
10 answers

Need a distributed key-value lookup system

I need a way to do key-value lookups across (potentially) hundreds of GB of data. Ideally something based on a distributed hashtable, that works nicely with Java. It should be fault-tolerant, and open source. The store should be persistent, but…
sanity
  • 33,862
  • 40
  • 131
  • 218
11
votes
1 answer

Amazon DynamoDB Conditional Writes and Atomic Counters

The application im working on currently requires me to increment an attribute belonging to an item in DynamoDB many times in a 20 to 30 minute period. I've been doing some additional reading about DynamoDBs conditional writes and atomic…
john
  • 519
  • 3
  • 11
  • 20
7
votes
2 answers

How can MySQL Cluster 7.3 achieve 99,999% Availability? Antithesis to CAP Theorem

According to the "Guide to Scaling Web Databases with MySQL Cluster", MySQL Cluster 7.3 can acchieve 99,999% availability while using synchronous update replication. This would be a antithesis to the CAP Theorem since it states that perfect…
NorRen
  • 691
  • 2
  • 8
  • 22
4
votes
1 answer

Looking for a mature, scalable GraphDB with .NET or C++ binding

My basic requirements from a GraphDB: Mature (production-ready) Native .NET or C++ language binding Horizontal scalability: both Automated data redundancy and sharding Distributed graph algorithms / query execution Currently I disqualified the…
Lior Kogan
  • 18,061
  • 4
  • 49
  • 79
4
votes
1 answer

Aerospike: How Primary & Secondary Index works internally

We are using Aerospike DB and was going through the documentation. I could not find good explanation of algorithm explaining how Primary & Secondary index works. The documentation says it uses some sort of distributed hash + B Tree. Could someone…
Tarun
  • 2,917
  • 2
  • 24
  • 40
4
votes
3 answers

Data allocation in the distributed databases

How to optimize a data allocation in the distributed database? Are there any software products for solving this problem? For example: There are some number of connected servers for the distributed database. Each server simultaneously is a client of…
4
votes
2 answers

Looking for distributed, in-memory Graph DB

Is there a database which will satisfy all (or at least most) of this requirements? Graph oriented - optimized for storing graphs and traversal(e.g HyperGraphDB, Neo4j) Running in memory, but having a persisted storage (e.g. Redis) Distributed…
Insomniac
  • 3,084
  • 2
  • 20
  • 23
4
votes
0 answers

How to shard using OrientDB

How to achieve sharding on OrientDB? Suppose I have three nodes viz. node1, node2 and node3. I have two clusers viz. zip_india, zip_usa. Now I tried to set the servers up such that zip_india will be on node1 and node2 and zip_usa will be on node3.…
Abhilash Panigrahi
  • 1,195
  • 1
  • 10
  • 28
4
votes
2 answers

What node does Cassandra store data on?

Is there a command or any way at all to know what data is stored on what nodes of Cassandra? Im pretty new to Cassandra and haven't had much luck googling this question. Thanks!
user3376961
  • 747
  • 1
  • 10
  • 17
4
votes
1 answer

Distributed Key/Value store in C/C++ With pluggable storage

I am looking for a C/C++ based distributed key/value store which has a clean enough design so I can plug in my own in-memory storage engine. It is OK even if I have to do code changes to be able to do that. Does anyone have a recommendation? Or a…
user1461001
  • 613
  • 1
  • 6
  • 16
3
votes
1 answer

Combine multiple Rocksdb databases

There is a use case for which I have to read huge Parquet file and convert into Rocksdb binary, So I decided to use spark (because everybody is familiar with it in my team). And from Rocksdb side I know it's not distributed and you can not…
Kaushal
  • 2,981
  • 3
  • 27
  • 44
3
votes
1 answer

How does YugaBytes performance compare between Redis client and Postgres client for simple Key-Value schema?

Given that YugaByte offers client drivers for Redis and Postgres, I was wondering about performance differences between the two if used in similar fashion. For example, assume a Postgres table with 2 columns, 1 VARCHAR as the primary key and 1 TEXT…
3
votes
0 answers

Can I use bigchainDB server with django instead of using sqlite?

I am creating degree verification process using blockchain approach which contain six main entities. By entities I mean to say consensus mechanism will evolve around these six entities, so for this I need to build a distributed database. Two…
3
votes
2 answers

Cassandra Error: "Unable to complete request: one or more nodes were unavailable."

I am a complete newbie at Cassandra and am just setting it up and playing around with it and testing different scenarios using cqlsh. I currently have 4 nodes in 2 datacenters something like this (with proper IPs of…
user3376961
  • 747
  • 1
  • 10
  • 17
1
2 3
10 11