Googler but nowhere near Gmail, so just educated speculation: \* We have a lot o...

userbinator · on Dec 16, 2020

We have a lot of automation/tools to prevent incidents when mitigation is straightforward (e.g. roll back a bad flag, quarantine unusual traffic patterns), which means that when something does go wrong it's often a new failure mode that needs custom, specialized mitigation.

As Douglas Adams says, "The major difference between a thing that might go wrong and a thing that cannot possibly go wrong is that when a thing that cannot possibly go wrong goes wrong it usually turns out to be impossible to get at or repair."

missblit · on Dec 16, 2020

Rollback proof bugs are rare, but boy howdy are they exciting. I think I've only seen one so far (unless you count bad data / bad state that persists after a bad change is rolled back... which can also be pretty exciting)

Andrex · on Dec 16, 2020

Is "exciting" a synonym for "harrowing" where you're from? :P

vitus · on Dec 16, 2020

Chrome web store has no rollback strategy, there is only roll forward :(

joshuamorton · on Dec 16, 2020

You can build rollbacks out of rollforwards, although it certainly isn't particularly fun. You patch an update to version N version code so that it's higher than N+1 and roll out the N+2 labelled N.

Aperocky · on Dec 16, 2020

> what if you're in a situation where rolling back could make the problem worse?

Here comes the poison pills!