Very poor dateTime index/filtering performance when large numbers of nodes have the same date

ahctangU · August 14, 2020, 1:55am

This really a domain problem, but my dataset was exported from a MySQL database which currently has large numbers of data nodes that have a date time of 2999-12-31T15:00:00Z. This represents data nodes that have no expiry for the foreseeable future.

However, I have noticed that executing something like @filter((le(min_sale_start_datetime, “2020-08-13T17:00:00Z”) AND ge(max_sale_end_datetime, “2020-08-13T17:00:00Z”), is dreadfully slow. Somewhere on the range of 500ms.

I realize after reading Datetime Indexes in Dgraph - Dgraph Blog, that this is probably a result of having too many uids in the same date time index bucket. However, this is a fairly common thing to have in an SQL database, so I feel like it would make sense to support this better?

Would the recommended work around in this case, either to change the data into a boolean, or find some way to distribute the dates better?

poketulhu · December 17, 2020, 8:34am

Have the same problem
I would be very glad if someone suggests some solution

Kuririn · July 5, 2021, 10:02am

Bump, similar problem

Topic		Replies	Views
Query results are flaky and inconsistent with filter usage Dgraph dgraph , status:accepted , kind:bug	3	380	July 20, 2020
Bug in date filtering Dgraph kind:bug	7	591	June 10, 2020
Filter Datetime on month or day / advanced Datetime functions Dgraph	0	1145	October 14, 2019
Filter by DateTime Users	2	709	April 16, 2018
True datetime list filtering Dgraph	3	441	June 25, 2020

Very poor dateTime index/filtering performance when large numbers of nodes have the same date

Related topics