Hi, We are looking forward to deploy object graph with 5 billion nodes and 20 billion edges. We are yet to decide on certain criteria before taking the decision. Here are few of our concerns wrt Dgraph :-
- The largest active deployment in production we can find is of Factset with 160 million nodes and 2 billion edges. But we are looking for even bigger deployment. Are there any customers with clusters supporting greater sepcs.
- What will be the size of metadata associated with each node and edge ? Maybe if you can roughly give estimate as function of node size.
- If I deploy, say 3 node cluster, with no replica for each shard. If one node goes (permanently) down, will the recovery be initiated by dgraph zero with resharding on remaining 2 nodes or do we need to manually bring up third node with backup data ?
- Is the storage layer customizable or pluggable?
It will be really helpful if you can help us with the answers of above question.