By using website you agree to our use of cookies as described in our cookie policy. Learn More


RainStor Launches Archive Application on Hadoop 2.0

Enterprise archive app accelerates query, enhances governance and management using Hadoop’s YARN, HCatalog, and Ambari.

Note: TDWI’s editors carefully choose vendor-issued press releases about new or upgraded products and services. We have edited and/or condensed this release to highlight key features but make no claims as to the accuracy of the vendor's statements.

RainStor, has released an archive application for Hadoop 2.0 with its latest release, RainStor 6. The new solution makes it easier to deploy an end-to-end solution on Hadoop for managing and analyzing high value, sensitive data. With RainStor’s Archive App, users conduct high-performance queries against secure, multi-structured data in a very efficient way. An archive is deployed when an organization has rapidly growing data that needs to be retained for ongoing business queries or when governance rules mandate that data be online and fully accessible for specific timeframes. Business users require analytic access to multiple years of history storing raw detailed data in order to derive business value and insights.

“Hadoop adoption is being driven by its low-cost to scale, and by the perceived value of the rapidly expanding ecosystem of capabilities to support business analytics,” said Mark Cusack, chief architect at RainStor. “RainStor has been delivering analytical archive solutions for a decade, and with RainStor 6, you can now take advantage of those capabilities running on Hadoop 2.0.”

New archive capabilities include:

Faster analytics performance: Building on RainStor’s existing interactive SQL-on-Hadoop stack, the new archive application features XQuery for hierarchical data and documents, and extends analytics support to SQL 2003. Users benefit from a 10-100X query boost using native SQL against a mix of structured data, semi-structured data, and documents in the same cluster. Performance improvements also apply to queries against Hive, Pig, and MapReduce. An archive on Hadoop should achieve performance levels on par with the source environment, which is typically a data warehouse.

RainStor application management on Hadoop 2.0: RainStor is open, standards-based, and is ideally suited to run on HDFS. Certified on Hortonworks 2.1 and Cloudera Enterprise 5, RainStor integrates with YARN to ensure full co-operation in managing resources across a busy Hadoop cluster. RainStor integrates with Apache Ambari for cluster monitoring, and with Hue for managing archive workflows. RainStor also provides connectivity through HCatalog, the de facto interface to relational data. These capabilities offer users increased flexibility in selecting the tools that best fit their needs.

Governance for greater control: With this new archive app, you gain enterprise-grade control of the data in Hadoop, through life-cycle data management features for retention and expiry. With a rules-based workflow, you specify a record or groups of records to keep or delete, as they are loaded. Adhering to data governance practices has become a critical requirement and now you have greater control with your data, which eliminates time-consuming manual intervention or lost data.

The RainStor Archive App on Hadoop 2.0 is available now. More information about key capabilities and pricing, visit

TDWI Membership

Get immediate access to training discounts, video library, BI Teams, Skills, Budget Report, and more

Individual, Student, and Team memberships available.