I’m trying to load initial data into dgraph via the bulk loader and afterwards add new/modify existing nodes with the live loader. The problem is that if the live loader uploads nodes, which were uploaded by the bulk loader before, it creates duplicate nodes with new uids. I don’t want duplicate nodes. I need those new nodes either not to be loaded at all (if they bring no new edges for bulk loaded nodes) or to modify already existing nodes (if those new nodes do bring new edges for old nodes).
If I use only live loader, then it is trivial: I just add -x dirname to all my dgraph live commands and I get a xid directory named dirname, so I don’t get duplicate nodes with new uids from the following live loader data. The issue with bulk loader is that -x option does not create a folder for xids. Therefore, when I live load nodes with the uids, which were uploaded by the bulk loader before, I get duplicate nodes with new uids.
How do I prevent duplicate nodes when I use bulk load first and live load afterwards?