Cascade Directive with Pagination produces unexpected results

diggy · July 10, 2020, 1:53pm

Moved from GitHub dgraph/5930

Posted by amaster507:

What version of Dgraph are you using?

docker image dgraph :master v2.0.0-rc1-448-gd5892dc0c

Have you tried reproducing the issue with the latest release?

N/A as @cascade is only available for graphql in 20.07+

What is the hardware spec (RAM, OS)?

AWS ec2.large (8gb RAM, Ubuntu)

Steps to reproduce the issue (command/config used to run Dgraph).

I have a query that returns around 15K nodes and want to get the first 5 that match the @cascade directive as well. The result is 0. However if I remove the pagination limit then I get all the ones that match cascade.

I believe the problem is that pagination happens before the cascade directive on the rood of the query. Is there a way to reverse this order? I don’t see any need why someone would want to paginate and then apply a cascade directive

This will return all of the completed Tasks:

query {
  queryTask @cascade {
    name
    completed
  }
}

This should return the first 5 completed Tasks, but it instead returns an empty set:

query {
  queryTask(first: 5) @cascade {
    name
    completed
  }
}

Expected behaviour and actual result.

I expect pagination to work in conjunction with cascade but that does not seem to be true. I understand for performance that pagination reduces the load much faster then cascade does, but I do not believe that is the understood effect.

I did test on Dgraph in Ratel and got similar results. So this issue is not specific to the graphql endpoint.

diggy · July 17, 2020, 1:48pm

josh-mercarto commented :

Also having this issue

diggy · July 18, 2020, 6:25am

quotationmarks-jzj commented :

Also having this issue

MichelDiz · October 20, 2020, 10:53pm

@amaster507 I think this is GraphQL, right? It should be moved to GraphQL issues.

amaster507 · October 21, 2020, 1:14am

I believe it applies to both, I discovered it with GraphQL but also compared with DQL. I moved it now. It really applies to both.

maaft · December 16, 2020, 3:04pm

Any news on this?

I think this is possibly related to Filter on non-scalar fields or relations (Types) - #17 by maaft

I.e. when we have more powerful filters (filter on child-child-child… properties), we wouldn’t need @cascade in most cases.

amaster507 · December 16, 2020, 7:44pm

Knowing how cascade works leads to an understanding that this bug may be non fixable. The first directive is applied before the cascade as cascade is near the end of the process tree. This makes the query faster but “wrong” results. If the cascade was applied first then the query would be less efficient but with “correct” results. Some users may expect the “wrong” results. So fixing this may be actually breaking it. I hope to have the filter on edges that would heavily negate the need for cascade for the most part. The has filter already helps quite a bit but still not perfect.

ahctangU · January 25, 2021, 11:54am

Is there some kind of fix for this? This is a pretty catastrophic issue for me.

abhimanyusinghgaur · January 25, 2021, 1:45pm

No, there is no fix for this issue as of now.

If it solves your problem, you may try to use has filter on every level in your query.
But note that doing so is not the same as @cascade. For example, consider these 2 DQL queries:

Query-1

query {
  queryPerson(func: type(Person)) @cascade(fields: ["name","friends"]) {
    name
    friends {
      name
    }
  }
}

Query-2

query {
  queryPerson(func: type(Person)) @filter(has(name) AND has(friends)) {
    name
    friends @filter(has(name) AND has(friends)) {
      name
    }
  }
}

Given this DQL schema:

type Person {
  name
  friends
}
name: string .
friends: [uid] .

And the following data-set:

_:a <name> "Alice" .
_:b <name> "Bob" .
_:c <name> "Charlie" .

_:a <friend> _:b .
_:b <friend> _:c .

Then the result of the two queries will be like this:

Query-1 response

{
  "queryPerson": [
    {
      "name": "Alice",
      "friends": [
        {
          "name": "Bob"
        }
      ]
    }
  ]
}

Query-2 response

{
  "queryPerson": [
    {
      "name": "Alice",
      "friends": [
        {
          "name": "Bob"
        }
      ]
    },
    {
      "name": "Bob",
      "friends": []
    }
  ]
}

Note that the 2nd query will work correctly with pagination, but it will not remove those parents for which a deep descendent was missing some data.

ahctangU · January 25, 2021, 3:18pm

Hello,

Thanks so much for the response! Dgraph is really amazing but these small issues (that seem to have big implementation ramifications) really hurt the usability.

Unfortunately, I was relying on the cascade to remove the parent after applying a bunch of filters to the parent-relationship.

I wish there was some kind of caveat in the documentation that this doesn’t work with pagination before I designed my data model! There is a lot of guidance on this discuss forum that points towards using cascade without any kind of warning.

Also, just out of curiosity, is this slated to be fixed in the foreseeable future? If it is, I can just wait for it.

pawan · February 4, 2021, 9:10am

Yes, we have started working on a fix which should be available as part of the next release i.e 21.03.

ahctangU · February 5, 2021, 6:31am

Thank you so much!

minhaj · February 19, 2021, 1:09pm

This has been fixed in the master. Please see this PR for more details.

himanshu-msphere · January 1, 2025, 5:33pm

n Dgraph v24.0.5, when you try to use the @cascade directive in a DQL query, it causes an error. The @cascade directive is only supported in GraphQL queries, not in DQL queries?
Below is an example of such a query and the error response you may encounter.

query {
  queryPerson(func: type(Person)) @cascade(fields: ["name", "friends"]) {
    name
    friends {
      name
    }
  }
}

Error response
line 2 column 49: Unexpected item while parsing: : In graphql query with @casecase fields working fine.

vnium · January 1, 2025, 6:27pm

The syntax is different in DQL, review the docs.

Topic		Replies	Views
Cascade does not work with pagination Dgraph status:accepted , kind:bug	11	1838	May 10, 2021
WIP: cascade with pagination Dev dql	4	801	February 9, 2021
UID variables, cascade, and pagination with first Issues status:accepted , kind:bug , ticket:created	2	1120	January 11, 2022
Sorting doesn't work on queries with cascade Dgraph kind:bug	9	1379	June 20, 2024
V21.03: After pagination+cascade change, queries are too slow to finish Dgraph performance , kind:feature	23	1944	May 25, 2021