Key Benefits


  • Application-consistent point-in-time backups
  • Scale-out software to meet backup and recovery requirements (RPOs and RTOs)
  • Built for scale with large sized clusters (hundreds of nodes)

Automated and Orchestrated Recovery in Minutes

  • Single-click, fully orchestrated and granular recovery (column family level)
  • Fast recovery with a single-copy transfer and no repairs upon recovery
  • Flexible recovery to the same or different sized database clusters

Massive Operational Efficiency

  • Elastic scale-out software only allows for failure resiliency
  • 70%+ reduction in backup storage costs (deduplication and compaction savings)
  • Automated refresh of test/dev environments
  • Ease of deployment with API based architecture and native UI interface
  • Support for multiple enterprise data sources; all with a single protection platform
  • Enterprise policy management capabilities for ease of day-to-day operations

The Challenge

In response to digital transformation requirements and associated explosive data growth, organizations are increasingly turning to scalable, distributed databases such as Apache Cassandra / DataStax to build next generation customer-centric applications.

These applications require massive scalability and ongoing availability without sacrificing speed, which creates critical data protection challenges.

Data Protection Use Cases

Customers building applications on Apache Cassandra databases look for the following data protection use cases and associated features:

  • Backup and Recovery (collection-level and/or database-level)
  • Non-operational recovery use cases such as migrations and test/dev
  • Support for private cloud deployments both physical and virtual
  • Support for public cloud deployments including compute and storage services on AWS and GCP
  • Policy-based management for day to day operational tasks

The Solution: RecoverX

Datos IO RecoverX is a scale-out, elastic software-only data protection product that delivers scalable and fully featured point-in-time backup and automated recovery for next-generation applications built on Cassandra databases. Datos IO RecoverX semantic deduplication and ease of deployment leads to 70%+ TCO savings for customers when compared with native solutions or any other tools.

Features and Benefits

Datos IO RecoverX is built to address the unique data protection requirements of database administrators (DBAs) and application developers building applications on Cassandra.

Scale-Out Data Management Architecture

Datos IO RecoverX is founded upon Consistent Orchestrated Distributed Recovery (CODR), cloud-first, scale-out data management architecture. CODR uses elastic compute services that can be autoscaled with application load and removes any dependency on media servers for data movement from database clusters to backup storage.


Flexible Granular Backup

RecoverX creates application-consistent point-in-time backup copy of Apache Cassandra column families at user-specified intervals across the entire cluster without quiescing the Cassandra database. Backups can be generated as frequently as every 30 minutes and at any granularity (column family or entire keyspace), providing operational ease of use to database administrators.

Fully Orchestrated and Granular Recovery

RecoverX provides fully orchestrated recovery to a target Cassandra database at a column family level granularity. Data can be recovered directly to the same database cluster for operational recovery or recovered to a different database cluster for test/dev refresh use case.

Most importantly, since a single copy of data (after semantic dedupe) is transferred and inserted into target Cassandra database, the recovery is at least three times faster than any other recovery approach. Furthermore, there are no database repairs required after recovery, resulting in significant reduction in the application downtime.


Query-able Recovery

This industry-first feature allows maximum flexibility for recovery. Using this feature, DevOps or database administrators (DBAs) can recover exactly the columns and rows that they want to recover from a column family without recovering the entire column family and improve the RTO (speed of recovery). DevOps teams can use this feature to refresh test/dev environments with selective data that is required for testing or development purposes without over provisioning test/dev systems.


Any Cloud: Private Cloud, Hybrid Cloud, and Public Cloud

Datos IO RecoverX is a software-only product that can be deployed on a physical server, a virtual machine for private cloud deployments or any cloud-native compute instance (for example, Amazon EC2).


Policy Based Operational Management

With a powerful policy-based data protection engine, Datos IO RecoverX empowers database administrators (DBAs), DevOps and IT operations teams to be able to schedule backup policies at any granularity that they need, make changes to existing policies such as retention time period and others, pause/resume backup operations, and remove backup policies.


Deployment Efficiency

Simplification of deployment processes and infrastructure is a key advantage of Datos IO RecoverX. Most native solutions require provisioning and configuration of several servers, databases and storage repositories. RecoverX requires only a single virtual machine or cloud compute instance for deployment. We enable you to get up and running in less than an hour!


Database Fault Resilience

Failures are norm in any kind of distributed software, and this is also true for non-relational databases such as Apache Cassandra. Datos IO RecoverX has built significant intelligence to ensure such Apache Cassandra failures don’t have any impact on backup and recovery operations.


Native Integration with Apache Cassandra

Datos IO RecoverX operates at the cqlsh level, leverages the well-defined API’s and other metadata information needed to provide customers with fully featured backup and recovery product. Datos IO RecoverX supports latest Apache Cassandra and DataStax production releases including Apache Cassandra v3.0+ and DSE 5.0+.

Ready to become a data hero?