Production Architecture and Infrastructure Requirements
This is a set of requirements agreed upon by the Communities Software Development Team and the Enterprise Cloud Management team for products moving to production.
Definition of Done
Development Team responsibilities
From a code-monitoring perspective – can add logging and monitoring and error handling within the code to capture issues here
ECM responsibilities
Monitoring/logging/tracing systems
New Relic
Data Dog
Pingdom
Azure built-in monitoring/logging
Would like to make logs accessible to devs as appropriate and possible
InfoSec requirements
@Surya Bhatt (Deactivated) to look into checklist or requirements that might exist
Shared responsibilities
Health check pages
Displays an up/down status for all services
Built and maintained by the development team, but utilized by ECM and Development team for monitoring
Hooks into Pingdom
Includes checks for production and the lower environments (staging + sandbox)
Need to work together to ensure third party solutions meet all technology needs
i.e. should be able to be made highly available
Work together to establish reasonable and controlled access to developers in production for logging and low-level interactions
Defined and agreed-upon code deployment process based on the product’s need
Defined SLAs and response plan based on the product’s need