Hello Everyone,
I am new for DGraph. I read documents but I didn’t find exact answer.
We use HGraphDB(a tinkerpop implemention with Hbase) and we have some problems so we want to migrate another graph db. But I have some question;
We have petabytes level of linked data with many different vertex and edge types. However the big portion of edges consists of one edge type. For example, for 1PB, 800TB of edge data has same type, other 200TB differs.. I read some document that mentions DGraph stores same edges in same server!!. I don’t know it true or not.
Can anyone explain this issue or share related documents? If yes how can I handle it?
Yes and no. Each edge will be stored in groups. A group can have one or more edges. And each group will be stored in a single Alpha. But if you have replication, this data will be replicated* across the cluster accordingly to your cluster config.
In the future we gonna work in even more sharding, at the pred/edge level. But this will be something to 2021 Q4 I guess - dunno exact.
Thanks Michel,
So you mean, DGraph stores same edge types in a single Alpha. In my case, 800TB of data that consists same edge type will be stored in a single Alpha.