Case studyBanking · ANZData · Multi-petabyte$50M+/yr saved

Enterprise data archival — $50M+ annual savings.

Tier-1 ANZ bank. 1,000+ legacy applications, multi-petabyte estate, strict regulatory controls. Cloud-native archival on AWS unlocked legacy decommissioning, reduced infrastructure overhead, and turned data archival from a constraint into a strategic enabler.

SectorBanking · Tier-1 ANZ
Scale1,000+ apps · multi-PB estate
My rolePlatform architecture lead
Engagement modelDiscovery + build + enablement
Discovery3 months across 1,000+ apps
PlatformAWS-native, custom-built

The challenge.

A Tier-1 ANZ bank with no enterprise-grade data archival solution. Every application team had a different approach — duplicated tooling, duplicated process, duplicated cost. Legacy systems could not be decommissioned because retention obligations had no centralised home. No standardised onboarding model; no consistent operational procedures.

The estate was multi-petabyte. The application count was 1,000+. The regulatory floor was strict. The opportunity — if a single platform could carry the load — was an order of magnitude beyond what any local solution could deliver.

The constraints.

The approach.

Discovery phase — three months.

Deep assessment across all 1,000+ applications. Evaluation of cloud-native services vs custom platform via decisioning trees built on data patterns, compliance requirements and cost. The honest output of discovery: a custom-built solution on AWS, not an off-the-shelf platform.

Platform design & build.

Phased rollout:

Automated lifecycle management (retention, expiration, archival, purge). Self-service retrieval via Active Directory groups. The retrieval experience was the platform’s adoption mechanism — if it didn’t work for application teams, the platform wouldn’t carry the load.

Enterprise enablement.

Complete documentation suite (design, SOPs, operations manuals). Security and compliance frameworks. Peer-review and QA processes. Built to be operated by the bank, not by the implementation team.

Technology stack
  • Serverless processing AWS Lambda
  • Data movement AWS DataSync
  • Storage Amazon S3
  • Metadata & indexing MongoDB Atlas
  • Orchestration AWS Step Functions
  • Data lineage Alation
  • Infrastructure as code Terraform
  • CI/CD Custom-built pipelines
  • Classification AI-driven metadata analysis

Outcomes.

$50M+
Annual cost savings
achieved
1,000+
Applications onboarded
to one platform
Multi‑PBestate
Structured + semi +
unstructured supported
Legacydecom'd
Retention satisfied;
systems retired

Financial: >$50M USD in annual cost savings; reduced infrastructure overhead across the estate.
Operational: Legacy applications became decommissionable; operational complexity dropped meaningfully.
Strategic: Enterprise-grade, cloud-native platform under full bank ownership with minimal ongoing maintenance. AI-enabled foundation for continuous archival. Accelerated the wider legacy-modernisation programme. Regulatory and security alignment achieved at platform level, not at application level.

Closing insight
Data archival stops being a constraint and becomes a strategic enabler the moment one platform carries the regulatory floor for the entire estate.

What I would do again on a similar engagement.

Originally published on Medium · read the original