Discussion: Wikipedia backed by DGraph

EnricoMi · July 26, 2020, 5:18pm

I have loaded DBpedia, an RDF version of Wikipedia, into Dgraph, specifically the 2016-10 dump. In contrast to Wikipedia, DBpedia is less texty (at most there is a long abstract but not the entire article iirc).

The goal was to get a large graph (500m triples) with a wide long-tail schema (230k predicates) and to query those data with simple single-step path queries, so like a benchmark dataset. Benchmarking is meant not to measure how fast it is but how performance degrades with scale (constant, linearly, polynomial). Loading that data was very painful with 20.03.3 (memory-wise), but discussion with core devs gave me the impression the next version is much more stable and performant.

Query performace in my use-case is satisfying, except for some issues around pagination.

I think your problem statement needs a why, as in why would a graph database be beneficial, what is the use case / access pattern that Dgraph can improve.

Topic		Replies	Views
Wikipedia entry in list of graph databases Users	2	320	July 24, 2020
Dgraph as candidate? Dgraph	1	310	April 27, 2020
Is anyone using dgraph in production? Dgraph	21	6852	January 23, 2021
FAQ - Dgraph documentation Documentation	1	681	December 30, 2020
Hello, World! - Dgraph Blog Blog	0	1277	June 27, 2016

Discussion: Wikipedia backed by DGraph

Related Topics