Production Architecture and Infrastructure Requirements

This is a set of requirements agreed upon by the Communities Software Development Team and the Enterprise Cloud Management team for products moving to production.

  • Definition of Done

    • Development Team responsibilities

      • From a code-monitoring perspective – can add logging and monitoring and error handling within the code to capture issues here

    • ECM responsibilities

      • Monitoring/logging/tracing systems

        • New Relic

        • Data Dog

        • Pingdom

        • Azure built-in monitoring/logging

        • Would like to make logs accessible to devs as appropriate and possible

    • InfoSec requirements

      • @Surya Bhatt (Deactivated) to look into checklist or requirements that might exist

    • Shared responsibilities

      • Health check pages

        • Displays an up/down status for all services

        • Built and maintained by the development team, but utilized by ECM and Development team for monitoring

        • Hooks into Pingdom

        • Includes checks for production and the lower environments (staging + sandbox)

      • Need to work together to ensure third party solutions meet all technology needs

        • i.e. should be able to be made highly available

      • Work together to establish reasonable and controlled access to developers in production for logging and low-level interactions

      • Defined and agreed-upon code deployment process based on the product’s need

      • Defined SLAs and response plan based on the product’s need