There is a major ongoing disruption in the IT application stack resulting from enterprises adopting cloud computing as a mainstream IT model and developing hyper-scale, always-on applications. In simple terms, it means that enterprises are redesigning their legacy applications for cloud and distributed architectures. Also, all new applications are written with cloud-first principles in mind. Needless to say that legacy data protection and management products are unable to cope. The proof is in the tremendous success Datos IO achieved with RecoverX 1.0, a cloud-first product to meet the enterprise needs for the multi-cloud world. Our enterprise customers, some of them using RecoverX for over a year now, have given us excellent feedback on the product leading us to RecoverX 2.0, which we announced just a few weeks back. In this multi-part blog series, I will start peeling the onion on What’s NEW in RecoverX 2.0. First, I will describe the major platform enhancements that enable us to keep building a deeper and deeper moat as we expand into new enterprise data sources.

Scale and Elastic Compute Infrastructure

RecoverX is based on our cloud-first CODRTM architecture that natively scales out to meet the SLA requirements of the application environment. RecoverX 1.0 could be deployed in a single node or 3-node configuration. With no media server bottleneck, our customers and partners were able to achieve backup throughput of > 60TB/day with RecoverX 1.0 — wow, just wow! This is the result of having no media servers in your data path as a bottleneck.

With RecoverX 2.0, we are enabling even greater performance and scalability. The motivation: customers!  We are getting inbound requests from enterprise customers to help them protect their 100TB+ clusters.  With RecoverX 2.0, organizations can now backup 100’s of tables at an interval of less than 30 minutes, something unheard of with any traditional backup product! Yes, RecoverX is truly the aTesla of The Backup World.

A keystone of our CODR architecture is that its performance scales with compute resources. This means that you do not need static infrastructure (CAPEX!) for your backup system. You may use 4-core virtual machines for light load conditions and then ramp up to 8-cores when the application change rate (workload) is high. Think about optimizing infrastructure costs in real time! That’s what cloud-native architecture enables, and RecoverX is built for the cloud from day one.  This feature provides massive savings for customers that experience cyclical workloads such as e-commerce, financial services, etc.

In the spirit of putting the control in the hands of the user to elastically size their infrastructure footprint in line with their application workload, we are also enabling you to add or remove nodes from a RecoverX cluster in real time. For example, you may start with a 3-node RecoverX cluster and later add a couple of nodes if your application requirements grow to a 100TB+ scale. Also, you can decommission nodes from RecoverX cluster while it is online. Talk about a true distributed scale-out software platform!  

Flexible Policy Management

RecoverX has a policy based framework that allows users to create intuitive data management workflows. With RecoverX 1.0, we enabled table-level (or collection-level) policy management, whereby, a user (e.g. DBA) could define a single policy to capture all tables/databases or a separate policy with single or multiple tables. With RecoverX 2.0, we have completely redesigned our policy framework to allow much more flexibility for user and massive operational efficiency.

  • Add or remove tables from a policy: This feature allows users to add or remove a table from an existing policy. If a new table is added, the backups are initiated automatically at the next versioning interval. If a table is removed, the backups stop at the next interval but the existing backups are maintained until they naturally expire with retention time.
  • Suspend or resume backup of a table: This feature allows users to pause the backup of a table or a policy for a duration. Users may resume the backup whenever they want in the future.
  • Ad hoc backup of a table: You may now backup your data source out of schedule depending on your requirements. This is extremely useful if you know that important changes are being made to the environment and you would like to take a backup right before the changes are made to ensure easy data recoverability if things go wrong.
  • Rule-based automated backup: This is an advanced feature that fully automates policy management. For example, you can specify a rule to automatically add any new table that is created to an existing policy for hands-free management!!!

Operational Metrics

Another major enhancement in RecoverX 2.0 is a new metrics database that captures multiple operational metrics to highlight key trends. These metrics enable users to understand their data and capacity trends better and are the next step in our journey to expose valuable data for users to make informed decisions. Metrics available in RecoverX 2.0 include.

  • Backup Metrics: This chart shows the size of each incremental backup. For Cassandra, users can also visualize the number of sstables that were backed up.
  • Storage Savings: This chart shows the secondary storage savings due to our industry-first semantic deduplication and due to advanced supported for compaction in Cassandra.
  • Storage Heatmap: This map shows the amount of capacity each table occupies on the backend / secondary storage.
  • Storage Trend: This chart shows how the storage consumption is trending over time. Looking at this, users can predict the secondary storage required in future.

We are committed to delivering a reliable and scalable cloud data management platform for enterprise cloud data management. RecoverX 2.0 is our next major milestone in delivering on that vision. There are many other exciting updates from RecoverX 2.0 that I will cover in my following blogs. Stay tuned till then!

In the meantime, if you are interested in getting your hands on RecoverX 2.0, please reach out to me at pm@datos.io to learn about our Rebellion Early Access Program.