Illustration Image

One of the largest digital transformations to deliver real-time analytics backed by Cassandra

Erick Ramirez on April 27, 2023

One of the largest digital transformations to deliver real-time analytics backed by Cassandra

Highlights

Needing to mine data in multiple warehouses and legacy systems, Capital One pulled off one of the largest digital transformations by migrating to Cassandra to achieve near real-time analytics.

450M rows consolidated from legacy systems

21K transactions per second


Best known for its credit cards and Samuel L. Jackson-hosted “What’s in your wallet?” commercials, Capital One is a diversified bank that offers a broad array of financial products and services to consumers, small businesses, and commercial clients. A Fortune 500 company, Capital One is one of the most recognized brands in the United States.

Challenge

A financial industry leader, Capital One’s applications generate huge amounts of data. Mining of which has become of paramount importance for making critical business decisions. Capital One was storing this data in multiple warehouses and databases and using traditional batch analysis for decision making. They needed to quickly process and analyze their data to make faster decisions in near real-time. Capital One sought quickly to build a new, real-time reporting and analytics platform. “It’s a challenge that the business is going through from thinking about batch analysis to a near-real-time platform,” said Javed Roshan, Director of Data Services at Capital One.

Solution

The stringent service level agreement (SLA) requirement and the need to quickly process large amounts of data made Apache Cassandra a natural choice for Capital One. Capital One attained a distributed environment out of the box with multi-datacenter replication. This environment is also extremely flexible and able to handle incredibly fast writes, which Capital One required, and has tunable consistency for different types of queries. Furthermore, reliable scalability makes it easy to grow data clusters and support new features.

Results

The new platform kick-started Capital One’s big data intelligence by quickly and easily incorporating a year’s worth of data – 450 million rows – from a legacy Oracle database. Consolidating this into far fewer rows (11.5 million), all handled on a six-node cluster at the lightning-fast rate of 21,000 transactions per second and meeting Capital One’s 99.99% uptime requirement. “We pulled in one year of data from Oracle and once we got it into Cassandra, it was a smooth ride and it was processing at a very high rate,” said Mukram Aziz, Senior Manager of Data Services at Capital One.

Become part of our
growing community!
Welcome to Planet Cassandra, a community for Apache Cassandra®! We're a passionate and dedicated group of users, developers, and enthusiasts who are working together to make Cassandra the best it can be. Whether you're just getting started with Cassandra or you're an experienced user, there's a place for you in our community.
A dinosaur
Planet Cassandra is a service for the Apache Cassandra® user community to share with each other. From tutorials and guides, to discussions and updates, we're here to help you get the most out of Cassandra. Connect with us and become part of our growing community today.
© 2009-2023 The Apache Software Foundation under the terms of the Apache License 2.0. Apache, the Apache feather logo, Apache Cassandra, Cassandra, and the Cassandra logo, are either registered trademarks or trademarks of The Apache Software Foundation.

Get Involved with Planet Cassandra!

We believe that the power of the Planet Cassandra community lies in the contributions of its members. Do you have content, articles, videos, or use cases you want to share with the world?