Does Dgraph have a scalability problems with graphs having single heavy predicate?

harshil_goel · December 9, 2024, 11:40am

Yes its quite correct in the understanding. We currently don’t split shards. So if you have one big predicate, it would have to reside on one disk. But querying it would still be fast. Badger supports vlog, so big values are part of logs and we directly query the pointer in the vlog. We would face problems if you query the entire data at once because we won’t able to read it in memory. So stuff like traversal, indexes could face issues.
We are thinking that we can split a predicate in multiple groups after a certain size to avoid this issue. We are thinking we can shard on the basis of nodes, but researching different avenues as well.

We did it this way so that it’s faster to traverse within a predicate. Hence we want to try to maintain the performance of queries even with splits.

Topic		Replies	Views
Splitting predicates into multiple groups Dgraph	12	2745	May 5, 2021
There are 500 million new tweets everyday. Is dgraph able to scale/shard that volume horizontally? is it true that if a predicate becomes large enough, the only way to deal with that is vertical scaling? Dgraph kind:question	10	1463	November 13, 2021
Dgraph Scalability Users	4	528	January 6, 2020
Database sharding: How to scale a graph database - Dgraph Blog Blog	3	951	December 1, 2021
Split predicates into multiple groups Dgraph dgraph , kind:enhancement , status:accepted	1	651	February 13, 2020

Does Dgraph have a scalability problems with graphs having single heavy predicate?

Related topics