Updating disconnected systems with Red Hat Ansible Automation Platform reduces provisioning timescales from months to days without compromising stability
Delivering an uninterrupted, always-on banking experience for more than 28 million customers requires an infrastructure that is stable, consistent, and simple to update. However, TD’s vast network, which covers around 2,200 branches and campuses, was growing increasingly complex due to some disconnected automation tools, inconsistent configurations, and accumulated technical debt.
To standardize operations and manage its network efficiently, TD selected Red Hat Ansible Automation Platform. The new unified approach reduced technology provisioning and upgrade timescales from months to days. By automating manual, repetitive tasks, the bank gave its network team and engineers valuable time to focus on problem-solving and innovation, improving their work-life balance while maintaining service reliability.
Modernizing network operations to support uninterrupted banking experiences
With a vast network supporting over 1,300 branches and approximately 10,000 devices, TD faced a slow and complex change-management process. Tight maintenance windows left no room for error, while manual processes forced engineers to work around the clock, impacting their work-life balance.
For a bank where even a minor misconfiguration could disrupt branch services or customer transactions, the lean team needed a way to reduce management effort without compromising trust. The team selected Red Hat Ansible Automation Platform as the foundation for its network automation strategy due to its agentless architecture, vendor-agnostic flexibility, and ability to provide centralized visibility and control across the entire network.
Starting with a small pilot, the team methodically scaled its efforts, developing reusable playbooks, execution environments, and automated workflows to support provisioning, migrations, upgrades, and Infrastructure as Code deployments.
Increasing speed, consistency, and resilience through automation
Accelerated infrastructure updates and provisioning
With standardized, end-to-end automation, TD drastically increased speed and reliability across its infrastructure. The bank can now provision, migrate, and modify branches and campuses in days rather than months. Notably, one major infrastructure update project that previously would have taken over a year was completed in just 3.5 months. Device provisioning times plummeted from 5 hours to less than 1 hour.
Automated routine tasks for large-scale parallel changes
Routine tasks that previously required extensive engineering effort—such as port enablement, internet protocol updates, domain name system changes, documentation, and validation—are now fully automated. This capability allows the team to execute large-scale changes in parallel across the network, delivering new capabilities faster while maintaining the strict reliability required in a high-risk banking environment.
Improved efficiency and allowed proactive development
By removing the burden of repetitive overnight changes and manual documentation, engineers can now focus on innovation and higher-value problem-solving. Automation has now become essential to the network team’s daily operations. It has built more than 100 playbooks and generated 20,000 pieces of change documentation, creating a scalable framework that continuously improves with every iteration.
Expanding automation with AI and preparing for the future
To increase development speed without expanding headcount, TD’s small network automation team incorporated AI copilots and custom agents to handle repetitive coding, quality assurance, and documentation tasks. The team continues to work closely with Red Hat to support continuous improvement and prepare the bank for future self-service and AI-driven capabilities.
Timeline
- 2022: initial implementation of the Red Hat Ansible Automation Platform to start the network automation journey.
- Pilot phase: launched a small-scale pilot, starting with a small number of branches per night before successfully scaling up.
- Scaling and production: scaled to 50-60 locations a night, executing a major infrastructure update across over 1,300 branches and completing the project in a compressed 3.5-month period.
Key outcomes