The query time is very long, 7 seconds on average. How to optimize it?

tiankonghewo · October 18, 2019, 6:59am

dgrapg version is 1.0.15
import stage:
rdf data size is 400G,
node memory size is 256GB
Originally I wanted to use the new version v1.1.0, but when importing data, the new version always reported an OOM error at map stage. So I had to use the old version 1.0.x

when run up, my cluster server configuration is:
3 nodes:16core64G
each data:170G,380G,440G

second, my query is very simple, use pydgraph==1.2.0,my dgraph version is 1.0.15
but the query time is very long, 7 seconds on average

query = “”“query all($a: string, $value: string) {
all(func: eq(type, $a))@filter(eq(value, $value)) {
uid
type
value
}
}”“”
variables = {‘$a’: ‘PERSON’, ‘$value’: value}
t0 = time.time()
res = client.txn(read_only=True).query(query, variables=variables)
ppl = json.loads(res.json)

Finally, I would like to ask, for TB data size,
What is the recommended cluster size and node configuration?
Is there any production case for reference?
If I need to add nodes, how many should I add? 3 or 6

system · November 17, 2019, 6:59am

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
8 process parallel query dgraph, in less than a minute, memory runs out Users	0	425	October 18, 2019
Slow performance on a single node with millions of documents Dgraph performance , area:performance	7	1899	August 24, 2020
How to plan the cluster size Dgraph kind:question , dgraph , area:operations	5	784	April 23, 2020
How to improve dgraph cluster performance？ Dgraph kind:question , dgraph , cluster	6	802	September 29, 2020
Using bulk to import 300G data leads to oom Dgraph dgraph	7	518	December 19, 2020

The query time is very long, 7 seconds on average. How to optimize it?

Related topics