The outage, which lasted from 5:45 am through 7:31am PST, was the first major outage for DoubleClick’s DoubleClick for Publishers (DFP) program.
Google’s Neal Mohan, VP, Display & Video Advertising and Scott Silver, VP, Engineering sent an email to publishers explaining the cascading issue that caused the loss of ad serving.
The details:
• The DFP ad server relies on an internal service that began degrading in performance. This caused a cascading failure on DFP ad servers, leading to the outage.
• We designed our systems to gracefully handle performance degradation from dependent services. However, due to a misconfiguration, we were unable to prevent the outage.
• To restore ad serving and prevent cascading failures, we restarted the services by provisioning additional resources.
• We reproduced the failure in a test by degrading the availability of the internal service, proving the misconfiguration caused the cascading failures. We have since rolled out a fix to the configuration globally.
• We are conducting a complete review of all our processes and production configurations to prevent this from happening again.
It is unknown if Google will offer any compensation for those websites and publishers that were affected by the outage.
Jennifer Slegg
Latest posts by Jennifer Slegg (see all)
- 2022 Update for Google Quality Rater Guidelines – Big YMYL Updates - August 1, 2022
- Google Quality Rater Guidelines: The Low Quality 2021 Update - October 19, 2021
- Rethinking Affiliate Sites With Google’s Product Review Update - April 23, 2021
- New Google Quality Rater Guidelines, Update Adds Emphasis on Needs Met - October 16, 2020
- Google Updates Experiment Statistics for Quality Raters - October 6, 2020