Performance is not good when finding the paths between 2 nodes with k-shortest

peterZ · November 1, 2018, 4:41am

Hi there,
I tried to find paths between 2 nodes, currently only k-shortest path queries could be used, the performance was not good when i queried with numpaths larger than 10.
Is there any suggestion on that ?
Why the performance lose a lot when numpaths become larger?
Why the performance changed with depth 10?
Any way to improve it?

I used the dgraph-ha.yaml to deploy my dgraph
and here’s my ql to query
{
path as shortest(from: 0x107db5, to: 0x138ac1, numpaths:10) {
_Concept
~_Concept
_MainProduct
~_MainProduct
}
path(func: uid(path)) {
Name
StockCode
}
}

here’s the results:
10<=numpaths<=20 : 9.5s to 10s
numpaths<10 : 100 to 300ms

BTW, i used the HDD for this test

Thanks

kortschak · November 1, 2018, 5:50am

k-shortest paths is not a particularly cheap calculation. Yen’s
algorithm (I don’t know if dgraph uses this, but it likely uses
something similar - it does look much more complicated than it needs to
be though) has a time complexity of O(k|V|(|E|+|V|log|V|)).

peterZ · November 1, 2018, 3:29pm

hi kortschark, thanks for your reply. With the time complexity, it should cost more time with larger k. But by my testing, the time didn’t change a lot when i changed the k to 10 or larger, it looks very strange. Do you have any idea on that?

kortschak · November 1, 2018, 8:58pm

I’ve just had a closer look at the code, and despite the illegibility of it it’s reasonably clear that they are not using Yan, though they are doing similar work less efficiently.

I can’t see any discontinuity at 10 unless it’s something in the runtime (grow map?, though that seems unlikely) or something to do with the backing store (more likely).

peterZ · November 2, 2018, 12:52am

thanks for your info, i’ll try to find if there’s some limitation on the backing store

system · December 2, 2018, 12:52am

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Dgraph queries the shortest path between two nodes, and queries multiple relationships through numumpths, which is inefficient Dgraph kind:question , dgraph	1	770	January 18, 2022
K-shortest path query becomes slow with k>3 Dgraph	0	504	October 31, 2018
Depth parameter in K-Shortest path queries Dgraph	9	1160	May 11, 2020
Path existence (NOT shortest path) Dgraph	1	1211	November 5, 2018
Finding path to node Users	3	736	January 26, 2020

Performance is not good when finding the paths between 2 nodes with k-shortest

Related topics