Combine different predicates

makitka · May 23, 2018, 1:01pm

hi there. i have a schema like this. nodes are linked one to another, each link belongs to customer, but when browsing, one customer can have access to other customer links:
_:A <link_customer1> _:B .
_:A <link_customer2> _:B .
_:A <link_customer1> _:C .
_:A <link_customer3> _: D .

for example, i need to count for A node all distinct outcoming nodes with link_customer1 link and link_customer2 predicates only. so, result should be 2 (B and C).

it works with storing all predicates in one and adding a “customers” facet, so query would be like:

nodes_count as count(link @facets(anyofterms(customers, “customer1 customer2”))

but as i understand this solution is quite bad for sharding due to using one predicate for all customers.

makitka · May 24, 2018, 6:19am

any ideas? in gremlin and cypher such query is not a problem

MichelDiz · May 24, 2018, 3:57pm

First of all, let me know if it’s all right below.

mutation

{
  set {
    _:Anna <name> "Anna" .
    _:Cris <name> "Cris" .
    _:Bob <name> "Bob" .
    _:Dude <name> "Dude" .
    
    _:Anna <link_customer1> _:Bob .
    _:Anna <link_customer2> _:Bob .
    _:Anna <link_customer1> _:Cris .
    _:Anna <link_customer3> _:Dude .
  }
}

Query

{
  me(func: eq(name, "Anna")) {
    name
   link_customer1 {
    expand(_all_)
  }
   link_customer2 {
    expand(_all_)
  }
   link_customer3 {
    expand(_all_)
  }
    count(link_customer1)
    count(link_customer2)
    count(link_customer3)

  }
}

Result


{
  "data": {
    "me": [
      {
        "name": "Anna",
        "link_customer1": [
          {
            "name": "Cris"
          },
          {
            "name": "Bob"
          }
        ],
        "link_customer2": [
          {
            "name": "Bob"
          }
        ],
        "link_customer3": [
          {
            "name": "Dude"
          }
        ],
        "count(link_customer1)": 2,
        "count(link_customer2)": 1,
        "count(link_customer3)": 1
      }
    ]
  },

makitka · May 24, 2018, 5:28pm

thanks for the answer, but unfortunately it’s not what i actually want.
for your example, the sequence should be:

get all nodes through link_customer1 and link_customer2 predicates
(:Bob and :Cris) and (:Bob)
get a union of these results with duplicates removed
(:Bob and :Cris)
count them or display their names
so, result should be 2, or [{“name”: “Bob”}, {“name”: “Cris”}]

i see that there is no union on predicates in dgraph, but may be it’s possible to do somehow using vars…

ideal would be something like:
count(union(link_customer1, link_customer2)) and

union(link_customer1, link_customer2) {
  name
}

but it’s just a wish for future

makitka · May 24, 2018, 7:43pm

seems like i found a solution:

{
	var(func: eq(name, "Anna")) {
		customer1_result as link_customer1 {
			uid
		}

		customer2_result as link_customer2 {
			uid
		}
	}
	
	count(func: uid(customer1_result, customer2_result)) {
		count(uid)
	}

	names(func: uid(customer1_result, customer2_result)) {
		name
	}
}

    "data": {
        "count": [
            {
                "count": 2
            }
        ],
        "names": [
            {
                "name": "Cris"
            },
            {
                "name": "Bob"
            }
        ]
    }

is there a more elegant way to achieve the same result?
thanks in advance

MichelDiz · May 24, 2018, 8:14pm

Maybe this

{
	var(func: eq(name, "Anna")) {
		customer1_result as link_customer1 {
			uid
		}

		customer2_result as link_customer2 {
			uid
		}
	}
	
	names(func: uid(customer1_result, customer2_result)) {
		name
    total : count(uid)
	}
}

{
  "data": {
    "names": [
      {
        "total": 2
      },
      {
        "name": "Cris"
      },
      {
        "name": "Bob"
      }
    ]
  }

makitka · May 24, 2018, 8:22pm

yep, i tried this, but as for me it’s better to have these values in separate places.
i was wondering if there is a way to do this without using vars.

anyway, thanks for help!

MichelDiz · May 24, 2018, 8:44pm

I believe that this is the only way. Other mathematical parameters only work with variables. But count is perfect for what you want. If there are repeated UIDs it returns only unique values.

The form count(uid) counts the number of UIDs matched in the enclosing block.

tamethecomplex · May 24, 2018, 9:58pm

@makitka @MichelDiz @mrjn

I filed a similar issue here: Allow facets at the schema level · Issue #2009 · dgraph-io/dgraph · GitHub

We moved toward using many separate predicates for sharding purposes but pay the price in cumbersome queries. For example if I want to retrieve all predicate values that are a “name” but there are 100 different “name” predicates, we have to list all 100 predicates in the query. What we need is a higher level grouping of predicates so that we get sharding efficiency without sacrificing query conciseness.

The way we get around this is by storing these groupings ourselves and using server side string building to build the queries…Because it’s not practical to write these queries by hand.

Topic		Replies	Views
Compare predicates from two different nodes Users	5	1341	June 3, 2018
Retrieve a set of predicates for all nodes Dgraph performance	10	2152	July 3, 2020
Filtering on same predicate using multiple indices Dgraph	9	467	July 3, 2020
Multiple query - one for predicates and one for shema - in one block? Dgraph	4	381	October 14, 2019
How to query all predicates from one node by one predicate? Users kind:question , dgraph	9	1677	September 3, 2020

Combine different predicates

Related topics