Better upgrade guide

artooro · September 26, 2018, 12:56am

I just finished upgrading from Dgraph 1.0.5 to 1.0.8. And I’m hoping the documentation on upgrading can be improved. Here’s what I did this time.

First I made sure I did an export by calling /admin/export. Then I updated the docker image for zero to 1.0.6 and let it reboot. And then updated docker image for server to 1.0.6 and let it reboot.
The result was zero would no longer start and crashed with “Assert failed”

So I updated all the images straight to 1.0.8, deleted the data directories from each of the 4 pod volumes by hand (3 server and 1 zero)
Rebooted them all and used live loader to import the backup.

This is not a small amount of down-time. How can upgrades be improved in the future so we have minimal down time of the database?

dmai · September 26, 2018, 5:47pm

The underlying data format in Dgraph can change between release versions, which is why the docs a data export and importing the data into a new cluster running the updated version.

A way to minimize downtime when upgrading is to do a blue-green deployment for upgrades. That is, keep the original cluster online for clients while setting up a new Dgraph cluster. Once the upgraded cluster is set up, you can redirect clients over to the new cluster and then take down the previous version.

artooro · October 5, 2018, 5:27pm

Can I just clarify how you would do this. As the new deployment would need to have the data replicated to it. Are you saying you’d do an export/import to the new deployment, or is it possible to do a live replication from the existing deployment?
For eg. would it look like this?

Deploy new dgraph servers that are pointed to the existing zero
Once data is replicated point the servers to an upgraded zero
Clients start connecting to the new servers

Or is there no way to do live replication?

dmai · October 5, 2018, 5:43pm

Yes, an export/import. Live replication is not currently a feature.

Don’t connect servers and zeros running different versions. Everything should be the same version within a cluster.

robregonm · April 4, 2019, 2:15am

So, this means that there’s no way to upgrade versions without downtime?
It would be nice to be able to upgrade without a downtime or to be able to upgrade node by node or to have async replication when adding a new node with an upgraded version.
I’ve not found related documentation.
Thoughts?

dmai · April 4, 2019, 6:59pm

Hey @robregonm. There can be cases where specific changes between versions would allow a rolling update that would maintain availability of the cluster in an HA setup. But this is not usually the case in general.

As mentioned in this thread, a blue-green deployment would be one way to perform an upgrade. The original cluster could also be set to read-only mode so both clusters would contain the same data and then read/write traffic can be directed to the new cluster.

santo · January 22, 2020, 10:54am

Hi all,

I have created a FR in GitHub to propose an improvement of the current upgrade process. Please feel free to subscribe there / comment / vote:

Improve Dgraph upgrade experience by supporting in-place & rolling upgrades · Issue #4644 · dgraph-io/dgraph · GitHub

Many Thanks,

Topic		Replies	Views
Database migrations / Dgraph update GraphQL kind:question	1	678	May 3, 2021
Upgrade to a production dgraph cluster Dgraph	2	426	June 23, 2020
Feature request: Zero-downtime upgrades Dgraph dgraph , status:accepted , kind:feature , ticket:created	2	744	April 29, 2021
Mixing Dgraph versions in a cluster while upgrading Dgraph dgraph	7	746	April 29, 2021
My present version is 1.0.15 need to upgarde it to latest, can u help me out Users kind:question	51	1887	January 27, 2021

Better upgrade guide

Related topics