Optimizing sum() performance, Dgraph takes 10s (16 vCPU, 128GB RAM), PostgreSQL takes 1.12s (4vCPU, 16GB RAM)

geoyws · May 10, 2020, 5:19pm

Given this schema:

type Hotel {
	name
}

type Room {
	hotel
	name
}

type Ledger {
	hotel
	room
	createdTs
	amount
}

name: string @index(exact, term) .
hotel: uid @reverse .
room: uid @reverse .
createdTs: datetime @index(hour) .
amount: int @index(int) .

And there being:

100 Hotels,
1000 Rooms per Hotel, and
2000 Ledgers per Room;

this query took 10.09s:

{
  getAmountSumForAllLedgersOfHotel99(func: eq(name, "Hotel99")) {
    uid
    ~hotel @groupby(hotel) {
      sum(amount)
    }
  }
}

and this query timed out:

{
  getLedgerSumForHotel99WIthTimeRange(func: eq(name, "Hotel99"))  {
 		~hotel @filter(ge(createdTs, "2019-03-01") AND le(createdTs, "2019-03-31")) @groupby(hotel) {
    sum(amount) 
  }
    }
}

10.09s is too long because PostgreSQL and MSSQL could do it under 1.5s with this equivalent query:

SELECT Ht.NAME, SUM(lgr.Amount) AS TotalAmount, COUNT(lgr.LedgerID) AS TotalBill
 FROM Hotel Ht
 JOIN Room Rm ON Ht.HotelID = Rm.HotelID
 JOIN Ledger lgr ON Ht.HotelID = lgr.HotelID AND Rm.RoomID = lgr.RoomID
 WHERE Ht.NAME = 'Hotel 99'
 GROUP BY Ht.NAME

I have 2 questions:

How should I optimize my queries? WIth these perf numbers… my management will likely drop Dgraph from their consideration.
Is hotel: uid @reverse @count . permitted?

ashishgoswami · May 11, 2020, 4:07am

Hi @geoyws, thanks for reaching out to us with above numbers. We would like to look more what is causing the slowdown. Can you somehow share you alpha p directory with us, if its testing data?

geoyws · May 11, 2020, 8:44am

Hi @ashishgoswami, how should I share that directory? It’s about 400GB in size possibly… Yes it’s just randomly generated test data don’t worry, nothing sensitive. Would you like ssh access into the server itself?

ashishgoswami · May 11, 2020, 9:18am

Hi @geoyws, will it be possible for you share your data generation script? If yes, please share it on my mail - ashish@dgraph.io.
If possible please share the CPU profile while Dgraph is running above query. https://dgraph.io/docs/howto/#profiling-information

geoyws · May 11, 2020, 9:25am

@ashishgoswami I shared it to your email address. I’ll message you directly about how to use it.

mrjn · May 12, 2020, 1:14am

I’d suspect that groupby might be taking time. It happens serially.

Some things to try to identify that would be to:

Try a query without groupby and with sum.
Try a query with groupby and without sum.
Try a query without either.

And see how the latencies vary. That would help identify which part is slow.

geoyws · May 12, 2020, 3:00am

@mrjn We’re having a meeting with @dereksfoster99 at 9pm PST to figure out the best way to run our sum queries later… will try your suggestion (don’t mind you joining in!)

LGalatin · May 12, 2020, 11:19pm

Created this GitHub issue: Optimizing sum() performance · Issue #5432 · dgraph-io/dgraph · GitHub

geoyws · May 21, 2020, 8:57am

@dmai Hi Daniel, this thread is also related to the script I provided you. I forgot to mention that you had the script here. I appreciate y’all taking the time. Really hoping to use Dgraph extensively here.

dmai · May 21, 2020, 11:24pm

Hey @geoyws. Thanks for sharing your test scripts. It’ll take time for us to really dig deeper to optimize Dgraph to perform well for your specific queries. It’s not a simple task, so I’d expect at least three months for any update on this.

geoyws · May 25, 2020, 3:33am

no worries man. Will be here rooting for you guys.

questiondgraph · May 25, 2020, 7:08pm

@geoyws , would you share your data generation script in this thread ? thanks

geoyws · June 3, 2020, 7:20am

@questiondgraph Apologies for the late reply. I updated its README.md with some instructions on how to use it.

The script was hastily put together, so there’s no multi-threading.

questiondgraph · June 5, 2020, 3:39am

thank you for sharing.

Topic		Replies	Views
Optimizing sum() performance Dgraph dgraph , status:accepted , area:performance	0	523	May 12, 2020
Understanding why `count(~fooPredicate) @filter(eq(dgraph.type, "fooType"))` is slow Dgraph	1	532	May 11, 2020
Query optimization to reduce query time and memory usage Dgraph	4	414	July 25, 2020
Query to slow, how to optimize query Dgraph	5	471	April 25, 2021
Advice Needed on Optimizing Dgraph Query Performance Users	1	80	July 29, 2024

Optimizing sum() performance, Dgraph takes 10s (16 vCPU, 128GB RAM), PostgreSQL takes 1.12s (4vCPU, 16GB RAM)

Related topics