Understanding indexing better

navaneeth · February 16, 2019, 4:30am

Dgraph is a pretty interesting project. I was curious to know how the indexing is done and the querying is done over it.

From which place in the code can I read more about it?

martinmr · February 19, 2019, 8:50pm

applyMutations in worker/draft.go is a good entry point on how mutations are added. If an index exists on this predicate, new index entries will be added during this process as well. Another file to check is posting/index.go.

For the querying part, ProcessQuery in query/query.go is a good start point. Also look at worker/task.go, in particular the methods ProcessTaskOverNetwork and helpProcessTask.

In general, what we do is generate list of tokens to a list of UIDs. For example, if you index a predicate using the term tokenizer and add a triple with subject “Anne Smith”, two lists will be generated (one for Anne and another for Smith), each of which contains the uid of the new triple.

During queries, Dgraph looks at that list to reduce the number of UIDs it needs to look at in order to retrieve results.

system · March 21, 2019, 8:50pm

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Get Started with Dgraph - Basic Operations - Dgraph documentation Documentation	0	418	August 28, 2020
Implementing indexing and filtering Users	5	813	November 28, 2017
Optimizing Indexing in Dgraph - Dgraph Blog Blog	0	977	January 29, 2019
Query and mutate in one request Dgraph	7	1316	June 16, 2020
Filtering on same predicate using multiple indices Dgraph	9	467	July 3, 2020

Understanding indexing better

Related topics