Automating Customer Lifetime Value Analysis in the Financial Sector

A financial data analytics firm sought to modernize and automate their approach to Customer Lifetime Value (CLV) analysis. Previously, this process was handled manually by their Managing Director of Customer Growth, involving extensive use of spreadsheets, manual data cleaning, and repeated steps for each client engagement. The primary objective was to develop a robust, scalable engine capable of ingesting transaction data from various third-party sources (such as e-commerce platforms), automate the CLV modeling process, and produce actionable insights through dashboards — all with minimal manual intervention.

5/8/20242 min read

Sector: Finance
Duration: May 2024 – January 2025

Work Delivered

  • End-to-End Databricks Integration: Designed and implemented an automated data workflow using Databricks, encompassing ingestion, transformation, modeling, and dashboarding.

  • AWS Infrastructure Setup: Established a secure, scalable Databricks workspace within AWS, aligned with the client’s cloud strategy.

  • Automated Pipelines: Created automated data pipelines to handle new client data, reducing manual input and turnaround time.

  • Machine Learning Model: Built and integrated custom CLV models capable of handling varying transaction behaviors across clients.

  • Airflow Orchestration: Deployed Apache Airflow to orchestrate and monitor the full data lifecycle, from ingestion to dashboard publication.

People

  • Managing Director, Customer GrowthLinkedIn Profile
    Sponsored the project and served as the primary subject matter expert. Although not technical, she played a key role in guiding the CLV strategy and outcomes.

  • Product Manager – Acted as the consistent point of contact, coordinating feedback and approvals.

  • Support Engineers – Occasionally assisted with AWS access and environment setup.

Pressure

  • Reliance on Excel made processes tedious and error-prone

  • Manual steps led to inconsistent results and delays

  • No data lineage to track how insights were generated

  • Each client required a full rework of the same manual process

  • Delivering value to clients was slow and inefficient

Impact of Automation:

  • Enabled faster iterations and reduced turnaround time

  • Significantly decreased human errors

  • Created a consistent, reliable, and traceable system

Program/Project

  • The project was driven by the client's product manager, who coordinated feedback and approvals.

  • Technical decision-making, architecture, and implementation were fully owned by the consulting team.

  • The client focused primarily on the output (CLV insights and reports), not on the underlying technology stack.

  • Minimal technical support was needed from the client’s side, other than provisioning access and clarifying domain concepts.

Problems and Pains (Pre-Project)

  • Slow delivery of insights to clients

  • Frequent data quality and cleansing issues

  • Lack of centralized data management and history

  • No automation, causing repetitive and time-consuming processes

Quantified Impact of Pre-Project Pain Points:

  • Client delivery time dropped from months to days

  • Data cleaning time reduced from days to a few hours

  • Manual data logs replaced by a centralized, queryable data repository

  • Laid the groundwork for data governance and compliance

Promises

  • Implemented an error-free, fully automated process from data ingestion to dashboard delivery

  • Created a reusable and modular system that can easily be scaled to serve new clients

Problems and Pains (In-project)

  • Understanding the CLV model and its business logic was challenging

  • Translating domain-specific finance insights into technical implementations required extensive collaboration

  • The main sponsor, while highly knowledgeable, was not technical, which extended the onboarding of domain logic

How These Were Overcome:

  • Conducted structured working sessions at each phase to align business understanding with technical implementation

  • Developed prototypes early and iterated with feedback

  • Created documentation and visual guides to bridge knowledge gaps

Payoffs

For Stakeholders:

  • Reduced client onboarding time from 2-3 months to just 3 weeks

  • Gained access to centralized historical data, enabling long-term value tracking and future AI integration

  • Simplified client management by standardizing the delivery process

For the Company:

  • Increased client satisfaction and confidence due to timely and consistent delivery

  • Enhanced ability to scale operations and take on more clients without increasing internal workload

  • Positioned the company for future data-driven innovation