Filtering by Relation?

Emil · January 6, 2020, 6:32pm

Hi there!

I’m struggling a bit with a query I’m trying to write. The idea behind it seems very simple to me, yet after reading the docs and playing around with Dgraph for a bit, I’m still not able to accomplish the following:

With a schema something like

name: string .
relation: [uid] @reverse .

and this data:

{
    set {
        _:one <name> "One" .
        _:one <relation> _:ten (name="child") .
        _:two <name> "Two" .
        _:two <relation> _:ten (name="child") .
        _:ten <name> "Ten" .
        _:two <relation> _:eleven (name="child") .
        _:eleven <name> "Eleven" .
        _:three <name> "Three" .
        _:three <relation> _:ten (name="child") .

    }
}

(The relation name is stored on a facet since users should be able to define custom relations in my application)

How would I find nodes that have a certain relation to all of a list of uids?

The only thing I could come up with is

query {
  q(func: has(name)) @filter(uid_in(~relation, "0x2744") AND uid_in(~relation, "0x2741")) {
    uid
    name
  }
}

But I want something more dynamic where I can pass a list of uids as variable, instead of hardcoding a query like that.
So with my example data, I would expect the query

called with the uids of [one, two, three] to return ten
called with the uids of [one, two] to return ten
called with the uids of [two] to return ten and eleven

I hope my explanation makes sense.
Can I write such a query with Dgraph?

Thanks in advance

MichelDiz · January 6, 2020, 9:16pm

At first, using cascade is a bit tricky. But it is useful in many situations.

{
  q(func: has(name)) @cascade  {
    uid
    name
    ~relation @filter(eq(name,["One", "Two", "Three"]) and not eq(name,"Two"))
  }
}

OR (If you know that the set of nodes you need has more than 1 relationship).

{
  q(func: has(name)) @filter(gt(count(~relation), 1)) @cascade  {
    uid
    name
    ~relation @filter(eq(name,["One", "Two", "Three"]))
  }
}

{
  q(func: has(name)) @cascade  {
    uid
    name
    ~relation @filter(eq(name,["One", "Two"]) and not eq(name,"Two"))
  }
}

 { q(func: has(name)) @filter(eq(count(~relation), 1) ) @cascade  {
    uid
    name
    ~relation @filter(eq(name,"Two") )
  }
}

In the last example, as both Ten and Eleven are related to Node “Two”. I needed to set a parameter to make sure it has only a single relationship and no more than one. Using eq (count (~ relation), 1). It’s kind of a trick, but avoids returning unwanted sets. You can use @filter(gt(count(~relation), 1) (Greater than) if you need tho.

Emil · January 7, 2020, 8:17pm

Thanks for your thorough response @MichelDiz!
Seems like I failed to explain what I need

First, I’m trying to query by uid, not name. Could those simply be swapped in your queries?
Your queries also make assumptions about the data if I’m not mistaken.

Would it be possbile to build one universal query that does “find nodes that have a certain relation to all nodesof a list of uids”, making no assumptions about the available data (e.g. any node could have any amount of relations to any node, and you know nothing about the data except the schema)?

MichelDiz · January 7, 2020, 9:58pm

Yes. In that case you gonna use uid func instead of uid_in.

e.g:

~relation @filter(uid(0x2744) AND uid(0x2741) ) #you can use OR too.

I used your sample and tried to interpret your text.

I think if you use cascade and uid func you can accomplish this.

Emil · January 8, 2020, 7:31pm

Are you sure this would work?
In my testing this doesn’t return anything.
I think that’s because it searches for a relation that is pointing to two nodes, which isn’t possible. There can’t be a single relation that matches @filter(uid(0x2744) AND uid(0x2741) )

Do I miss something?

MichelDiz · January 8, 2020, 7:49pm

yep. Try these queries.

{
  # q1(func: has(relation)){
  #   uid
  #   name
  # }
  q(func: has(name)) @cascade  {
    uid
    name
    ~relation @filter( uid("0xc0e32") AND uid("0xc0e34") OR uid("0xc0e36"))
  }
}

{
  # q1(func: has(relation)){
  #   uid
  #   name
  # }
  q(func: has(name)) @cascade  {
    uid
    name
    ~relation @filter( uid("0xc0e32") OR uid("0xc0e34") AND uid("0xc0e36")) 
  }
}

{
  # q1(func: has(relation)){
  #   uid
  #   name
  # }
  q(func: has(name)) @filter(eq(count(~relation), 1)) @cascade  {
    uid
    name
    ~relation @filter(uid("0xc0e34"))
  }
}

Emil · January 8, 2020, 7:55pm

All of the queries you just posted try to match just one uid, by WONT_MATCH AND WONT_MATCH OR WILL_MATCH.

So for example

would also match when the node doesn’t have a relation with the first two uids

MichelDiz · January 8, 2020, 8:36pm

The point was

The same queries done before

but the func swapped from eq(name, “x”) to uid(x).

{
  q(func: has(name)) @cascade  {
    uid
    name
    ~relation @filter(uid(0xc0e32,0xc0e34,0xc0e36) and not uid(0xc0e34))
  }
}

{
  q(func: has(name)) @filter(gt(count(~relation), 1)) @cascade  {
    uid
    name
    ~relation @filter(uid(0xc0e32,0xc0e34,0xc0e36))
  }
}

{
  q(func: has(name)) @cascade  {
    uid
    name
    ~relation @filter(uid(0xc0e32,0xc0e34) and not uid(0xc0e34))
  }
}

{ 
  q(func: has(name)) @filter(eq(count(~relation), 1) ) @cascade  {
    uid
    name
    ~relation @filter(uid(0xc0e34))
  }
}

That’s the values.

{
  "data": {
    "q1": [
      {
        "uid": "0xc0e32",
        "name": "One"
      },
      {
        "uid": "0xc0e34",
        "name": "Two"
      },
      {
        "uid": "0xc0e36",
        "name": "Three"
      }
    ]
  }
}

ppp225 · January 8, 2020, 9:07pm

Hey Emil,

I think you can create this logic with use of query blocks like this:

{
  Q1 as var(func: has(name)) @cascade {
    uid
    name
    ~relation @filter(eq(name,"One"))
  }
  Q2 as var(func: has(name)) @cascade {
    uid
    name
    ~relation @filter(eq(name,"Two"))
  }
  Q3 as var(func: has(name)) @cascade {
    uid
    name
    ~relation @filter(eq(name,"Three"))
  }
  result(func: has(name)) @filter(uid(Q1) AND uid(Q2) AND uid(Q3)) { 
    uid
    name
  }
}

You could programmatically create this query for any number of arguments without any need for hardcoding, and since dgraph processes query blocks in parallel, it should return results quite fast even for bigger queries; however I wonder, if this can be done in an more elegant/efficient way.

Emil · January 10, 2020, 6:14pm

Thanks @ppp225!

That works. I’ll do some performance testing comparing it to my version in the op

Yep, was just thinking that maybe it would be possible to not have to create a new query, but I guess that’s how you do it

Thanks

system · February 9, 2020, 6:14pm

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Filter by relation predicate? Users	18	2206	December 28, 2018
Filtering by relation Dgraph	9	1000	April 1, 2022
Filtering issue Dgraph kind:question , dgraph	1	308	November 9, 2020
Query for a node, filtering using uid of child nodes (relations) Dgraph Clients untagged , dgraph-js	7	566	July 11, 2020
Filter query issue Dgraph kind:question , dgraph	3	329	November 12, 2020

Filtering by Relation?

The same queries done before

Related topics