Adding security to dgraph

mrjn · June 17, 2016, 12:57am

Hello,

As I work more and more with dgraph, I find that I really like it and am excited to use it in the project that is currently underway.

Not really sure if this qualifies as an issue, but its more of a security access question that could be looked at as an issue since it is rather important to deployment of dgraph nodes across the LAN/WAN.

The one thing that just occurred to me is a question of security and injection attacks that could result from an open call like in the examples:

# Make Alice follow Bob, and give them names.
$ curl localhost:80/query -X POST -d $'mutation { set {<alice> <follows> <bob> . \n <alice> <name> "Alice" . \n <bob> <name> "Bob" . }}'

# Now run a query to find all the people Alice follows 2 levels deep. The query would only result in 1 connection, Alice to Bob.
$ curl localhost:80/query -X POST -d '{me(_xid_: alice) { name _xid_ follows { name _xid_ follows {name _xid_ } } }}'

# Make Bob follow Greg.
$ curl localhost:80/query -X POST -d $'mutation { set {<bob> <follows> <greg> . \n <greg> <name> "Greg" .}}'

# The same query as above now would now show 2 connections, one from Alice to Bob, another from Bob to Greg.
$ curl localhost:80/query -X POST -d '{me(_xid_: alice) { name _xid_ follows { name _xid_ follows {name _xid_ } } }}'

I need to be able to find a way to add some simple security to regulate, with perhaps passwords or access control lists, or keys, etc… as to who & what can be written or read from any shard in the collective graph database cluster.

With this in mind, can you please advise me on what might be achievable in this regard?

Any help would be greatly appreciated.

Cheers,
Lonnie

mrjn · June 17, 2016, 1:05am

Hey Lonnie,

We’ve thought about a feature like this – this is an advanced feature that I think would be relevant and useful once we’ve crossed v1.0. Until then, we have a lot of GraphQL features to bake into the database, to make it respond to the kind of queries people expect from a graph database.

If your Dgraph nodes are distributed across different networks, and you’re concerned about man in the middle attacks, we could provide an option to encrypt the communication between nodes using TLS. That wouldn’t give you ACL, but it would help specifically with man in the middle attacks.

You could potentially bake in ACLs on top of Dgraph, a security layer which sits right above it, and interacts with your application. That might be one solution to this problem – that typically would be necessary for other databases as well, for e.g. MySQL.

Let me know how this sounds. I’m very curious to learn what usage you have, and if we can help you in any way. We can definitely expedite any planned features for v1.0 release – if you have any particular needs. Here’s a list of all planned features:

github.com/dgraph-io/dgraph

Product Roadmap

opened 04:39AM - 30 Nov 15 UTC

closed 06:51PM - 14 Jan 19 UTC

manishrjain

roadmap

- [x] Low Latency - [x] High Throughput - [x] RDF Parsing - [x] ~[Rocks DB](http…://rocksdb.org/)~ Badger DB for persistence - [x] ~Commit Logs~ (Replaced by RAFT logs) - [x] Query Language - [GraphQL](http://facebook.github.io/graphql/)-like - [x] Query - [x] Root Arguments - [x] Fields - [x] Response in JSON - [x] Field Arguments - [x] Field Alias - [x] Mutations #23 [v0.2] - [x] Fragments #8 - [x] Variables - [x] Type System - [x] Scalar Types - [x] ~Object Types~ - [x] Mutation validation for scalar types - [x] String matching / Name search - [x] Sort by attribute - [x] Limit number of results #9 - [x] Filter - [x] anyof - [x] allof - [x] eq (equal) - [x] inequality (>=, <=, >, <) - [x] Aggregate Functions - [x] count - [x] sum - [x] max - [x] min - [x] Geospatial Queries - [x] Nearby - [x] Within - [x] Contains - [x] Intersects - [x] Official Clients - [x] Javascript - [x] Go [v0.3] - [x] Java - [x] Distributed Transactions - [x] Distributed #14 [v0.2] - [x] Distributed Loader [v0.2] - [x] Distributed Server [v0.2] - [x] Clustering - [x] Node discovery and membership via Dgraph Zero - [x] High Availability - [x] Raft - [x] Automatic Data Replication - [x] Automatic Failover for reads - [x] Read linearizability - [x] Resilience - [x] Shard moves to handle server failure - [x] Export - [ ] Backup --- ### After v1.0 / Proprietary Plugins - [ ] Multi-homing support - [ ] Cypher Support - [ ] Access Control Lists - [ ] Query Graphical User Interface - [ ] User authentication - [ ] Cluster Management - [ ] SPARQL [maybe] - [ ] Tinkerpop Support [maybe] - [x] Distributed transactions [maybe]

Thanks!
Cheers,
Manish

lonnietc · June 17, 2016, 12:07pm

Hi Manish,

Thanks for moving over the DGraph security discussion as this is a better place to talk about this idea.

At this time, I would rather discuss with you, off-forum, the specifics of the project as it is still in the molding state but is starting to form well based upon projects like DGraph.

As to the security concerns, in my initial basic testing to the single-instance dgraph example (I plan to set up a distributed multi-instance across 2 physical servers for testing over the next couple of days) I found in my first setup that you can easily add information to the graph database with simple “curl” and POST commands which is great, but as this systems is to be physically spread out I need to be able to ensure that only specific users can read/write to any of the nodes.

The TLS communications might be very useful to help protect against the man-in-the-middle attack as you have mentioned but I also wonder about latency with this approach.

The idea of an ACL layer might be very useful as well even if a simple one to get things moving along since I could see needing this before a live system is ready.

Your current demo (https://dgraph.io/ - which is very nice by the way and is what I am going to replicate this weekend if I get time) is very functional, but given that curl POST can be utilized, I do not see how you can protect the demo data from becoming corrupt by someone sending information into the dgraph database and some kind of basic protection seems warrented.

Anyway, maybe we can talk more about my specific project and how we can work together but I do see great potential for DGraph and its use in this way.

Cheers,
Lonnie

mrjn · June 18, 2016, 11:56am

We have a flag which can turn off mutations, so only read-only queries would be run on the server. We should have such a flag set for our demo server – though I’ll have to double check if that’s the case. Otherwise, you’re right. Someone can just send bad data and mess it up – it won’t corrupt the database per se, the data would just be wrong.

Feel free to send me an email if you want at manish@dgraph.io.

mrjn · July 18, 2016, 11:56am

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Injection attacks Dgraph kind:question , dgraph	1	630	May 22, 2021
[Breaking change proposal] DGraph --bindall is insecure by default Users	1	446	March 28, 2019
How is dgraph protected? Dgraph kind:question	6	663	March 13, 2021
Some trouble with the final query Dgraph Clients untagged , dgraph-js	2	468	July 11, 2020
Returning to explore dgraph, a few questions Dgraph	1	784	July 29, 2020

Adding security to dgraph

Related Topics