Live loader produces duplicates with upsertPredicate enabled

martaver · September 7, 2021, 12:48pm

Report a Dgraph Bug

I’m pushing a graphql schema to dgraph using xid fields on types that map to the xid dgraph predicate. When I run live loader with upsertPredicate ‘xid’ however, subsequent imports result in duplicate records being created.

What version of Dgraph are you using?

Dgraph Version

$ dgraph version
 
Dgraph version   : v21.03.2
Dgraph codename  : rocket-2
Dgraph SHA-256   : 00a53ef6d874e376d5a53740341be9b822ef1721a4980e6e2fcb60986b3abfbf
Commit SHA-1     : b17395d33
Commit timestamp : 2021-08-26 01:11:38 -0700
Branch           : HEAD
Go version       : go1.16.2
jemalloc enabled : true

Have you tried reproducing the issue with the latest release?

Yes

What is the hardware spec (RAM, OS)?

MacOS / Docker / 16GB (8GB allocated)

Steps to reproduce the issue (command/config used to run Dgraph).

This is the schema I’m pushing to my dgraph instance:
index.graphqls (2.2 KB)

This is the JSON file I’m live loading.
out.json (98.7 KB)

This is the command I’m using: dgraph live --files /path_to/out.json --upsertPredicate "xid"

Expected behaviour and actual result.

Buildings with the same xid value should not be duplicated, but they are.

martaver · September 7, 2021, 12:49pm

It’s entirely possible that I’m missing something obvious here, in that case please point me in the right direction

MichelDiz · September 7, 2021, 2:03pm

Duplicate (or rename it) the XID value in the dataset to the key “uid”. In fact, you don’t need the XID field during the live load. This field is necessary for the dataset already in the DB only. For the case you are loading new data, you have to use UID instead of XID.

The upsert feature in live load takes UID in the dataset into consideration instead of XID. And when it loads it, it creates the XID field to compare with the new data during the live load.

BTW, the UID key have to be a blank node. It will throw an error if you don’t prefix it with _:

MattH · March 18, 2022, 6:05pm

@martaver Curious if you ever got this working? I’m encountering the same issue with Live Loader using
a graphql schema and json formatted data.

I’m trying to follow the advice of MichelDiz above who suggested mapping the XID value to the key “uid” but not having any luck.

If you got it working, could you please post what your schema and .json file looked like?

Thanks!

Topic		Replies	Views
Duplicate nodes with Live Loader and upsertPredicate Dgraph	9	858	June 15, 2023
Live Import Upsert using xid creates duplicates Dgraph	0	207	November 23, 2023
Duplicate Nodes while using live loader Dgraph dgraph	1	393	November 12, 2020
How to merge nodes or avoid Duplicate nodes in Dgraph live loading? Dgraph	5	401	July 29, 2021
Live Loader came up with a lot of aborts Dgraph faq	13	1294	June 24, 2020