All times UTC.

Summary of the Incident

Date Time (UTC) Event
Mar 27 4:06 PM API & dashboard are unavailable. We immediately identify an outage of our cloud provider Render (link). All of our databases are unavailable.
Mar 27 4:19 PM Render communicates on the outage. Render needs to sequentially recover all of their database fleet, which takes time.
Mar 27 6:28 PM Our Render databases recover. Incident resolved.

Root cause

Render databases were down for all regions.

It took a bit more than 2 hours for Render to mitigate their outage.

We could not leverage our database replica which was hosted on Render as well. We were powerless to mitigate the situation within a short timeframe.

Prevention measures

Render has been the cause of multiple outages since December.

We cannot accept to lose control of our reliability due to our cloud provider’s instability.

As a result, we will be progressively migrating off of Render to AWS or GCP within 6 months, with the option to accelerate if Render has additional outages.

We are deeply sorry for the inconvenience caused by this outage 🙏