Xylem introduced Red Hat Ansible Automation Platform into its private datacenter. This unified platform brings security features, integration, and flexibility into one place to automate and orchestrate workflows at scale.
“Automation was new to most of our datacenter teams. The presentation layer in Ansible Automation Platform was really helpful to make them feel more comfortable,” said Johnson. “People worry about triggering a playbook when they don’t know who wrote it or what it will do. I made diagrams to visualize the workflow, and the platform’s audit trail gives them clear visibility of every stage of the automation.”
Xylem used Ansible Automation Platform and the Event-Driven Ansible feature, which delivers advanced event-handling capabilities, to automate security scans. These are triggered when a user opens a Jira ticket. Event-Driven Ansible receives the payload via a webhook and initiates an API call to Ansible Automation Platform. This executes the next steps in the process and generates notifications to tell users when they have an action, and to update them when a scan is taking place.
While users have visibility of the process and remain in control, time-consuming tasks are automated, which speeds up the process considerably. Xylem also built safeguards into the playbook to prevent issues such as someone accidentally triggering too many scans in one go. “Certain conditions have to be met in order to trigger the next step,” said Johnson.
If the scan finds vulnerabilities, Xylem’s security team is notified but only the engineer to whom the ticket is assigned can trigger a rescan, which eliminates the risk of unauthorized actions. When issues have been fixed, simply adding a comment to the ticket enables the virtual machine to rescan.
This is just one example of automation at Xylem, which also spent six months working with the storage team to automate more than 30 manual tasks. “The storage team had the least amount of experience with automation and the biggest need for it,” said Johnson. “They had a lot of process documentation, which helped me to understand where we could automate things and move quickly.”
For example, the backup team can now pull automated reports on backups. The playbook took just 6 hours to write and will save 1 hour per week for its main report, while automated health-check reports are now delivered daily – and every 4 hours if an error is found. This saves around 2 hours a day compared to the previous manual process.
Automations are also in place to manage virtual machines, from provisioning and storage requirements to networking and ongoing maintenance. Xylem is currently working on auto-remediation for common alerts such as monitoring virtual machines, CPU, and disc alerts.