Add support for json-lines in bulk loader

diggy · August 15, 2019, 1:20pm

Moved from GitHub dgraph/3819

Posted by dmsolow:

Json-lines (http://jsonlines.org/) is a commonly used format for storing a large number of JSON objects in a file. It’s better than a single JSON array of objects because it makes it easy to read a file object by object without loading the entire thing into memory.

Popular big data processing frameworks like Apache Spark write JSON-lines natively (df.write.json("out.json") writes a JSON-lines file for each partition)

Support would probably be trivial to add for Dgraph and it would help people easily integrate Dgraph into existing ETL workflows.

diggy · August 15, 2019, 9:38pm

martinmr commented :

It doesn’t seem like it would be too difficult to implement. We are gearing up for the 1.1 release so this issue most likely sit in the back-burner for a little while. If anybody else is interested in this feature, give a thumbs up to the issue so we can gauge interest and prioritize accordingly.

Topic		Replies	Views
How to batch import json data Dgraph Clients	1	565	August 11, 2020
How to import json file Dgraph kind:question	1	486	August 21, 2020
Batch insertion in dgraph Dgraph mutation	3	1386	November 19, 2019
How to generate a proper json file using scala spark? Dgraph	0	438	July 23, 2021
Not all of JSON file consumed Dgraph dataset , kind:bug , area:bulk-loader , area:import-export , area:live-loader	2	674	March 29, 2021

Add support for json-lines in bulk loader

Related topics