Tim's Blog: "Zero-downtime website rewrite migrations"
New blog post available on timwise.co.uk
Zero-downtime website rewrite migrations
https://timwise.co.uk/2023/05/22/zero-downtime-website-rewrite-migrations/
As we know, big-bang rewrites are incredibly dangerous. They are likely to fail or be rejected. For example, the new thing never reaches “feature parity” and gets rejected in favour of the system you were trying to replace.
But how do you practically avoid it? If it’s one website on one domain then that seems irreducible. Right?
But there is a way.
A change looms
You need to change the technology for your website for some reason.
Perhaps your tech debt is out of control and you basically need to do a complete rewrite from scratch, perhaps you’ve decided to switch technology.
Perhaps you have decided to move everything in your company to a different tech stack.
Either way you are making a BIG change to your technology. Out with the old, in with the new.
A C# to Rails migration at DfE
I was lucky enough to be involved in a technology shift at the Department for Education (UK) with some exceptionally talented people. I’d been part of a team creating an MVP (Minimal Viable Product) in C# with a couple of web front ends and a postgres database (yay no sql server!). The department then decided to shift wholesale to Ruby on Rails (for good reasons, but they are not important here).
The approach taken was superb, went really well, and is well worth learning from, I’ll outline it here for the benefit of all.
How to seamlessly migrate
What’s unsurprising here is that you spin up a new web service in your new technology, or with your new way of doing things.
There are two simple key elements that allow a seamless piece-by-piece migration:
- HTTP redirects in both directions
- Shared state
These two allow you to build a new service, one tiny piece of functionality at a time, and immediately put them into production, retiring the legacy version of that feature immediately.
Bi-directional HTTP redirects
It goes as follow:
- Chose one feature to build in the new technology
- Spin up the new service on a similar subdomain (e.g. “
www2.example.org
” next to the legacy “www.example.org
”) - In the new service, redirect all routes that are not yet implemented to the legacy domain
- In the legacy service redirect the single route for the chosen feature to the new service.
Now users will seamlessly transition in and out of the new service as they click around the site.
This is genius and I wish I could say I thought of it myself!
Repeat until all desired features are on the new service and the legacy service can be turned off.
Shared state
It’s obviously important that if a user does something on the new system that it is immediately reflected in the state on the legacy system. For us this meant shared access to the postgresql database. The exact method probably isn’t that important so long as you can get it to be seamless.
Whatever your state store is, it’s important to “freeze” schema changes while you do the transition (within reason) so that you avoid the overhead of updating both system’s understanding of the storage schema.
No more legacy
This will go smoother if you observe the cardinal rule of rewrites: no new functionality can be added to the legacy system. Any changes and fixes must be postponed or made on the new service.
This is to avoid the trap where the old system keeps being improved and the new system can never catch up. It also prevents your time and resources being invested in something that you plan to delete.
The only exception to this is whatever changes are needed to support the seamless user experience across the two services. Notably the redirections.
Monitoring
As you migrate your tech, keep an eye on your telemetry, error reporting and anything else you have to make sure your user experience remains good throughout.
The end
There’s not much more to say on this. It’s a simple concept that packs a huge punch. If you’re thinking of modernising a web stack, or changing tech, I would 100% recommend this approach. The risks of failure are so much lower.
Many thanks to those who I worked with showed the way.
This an automated email for you, my email subscribers, based on the blog RSS feed, sent in addition to my usual newsletters. Hit reply and let me know what you think of it!
Full blog post below (because it can’t be just me that hates being forced to go to a website to read something that could have just been in the email!)