I’ve got a DGraph cluster, consisting of a single alpha and single zero, deployed in ECS. It’s been hapilly running away for the past couple of months. We recently updated the schema, which is significantly larger than it was previously.
The Alpha node just won’t start any more. The significant line in the logs is:
2021/02/05 15:53:12 Buffer length: 285038378 greater than file size: 14093. Manifest file might be corrupted
The Alpha has 16384 MB memory and 4096 vCPU. Alpha is started with:
["dgraph","alpha","--my=alpha.develop.dgraph.imaging:7080","--zero=zero.develop.dgraph.imaging:5080","--lru_mb=5460","--whitelist=10.250.0.0:10.250.2.254"]
Can anyone help? Let me know if you need any more information