Alluxio Updates Interface Support for Onboarding Additional Data-Driven Applications
Alluxio 2.5 focuses on POSIX and S3 interface access to improve performance and compatibility with popular interfaces for analytics and machine learning data pipelines.
Note: TDWI’s editors carefully choose vendor-issued press releases about new or upgraded products and services. We have edited and/or condensed this release to highlight key features but make no claims as to the accuracy of the vendor's statements.
Alluxio, the developer of open source cloud data orchestration software, released version 2.5 of its Data Orchestration Platform featuring access via POSIX and S3 interfaces enabling data platform teams to accelerate data pipelines for both business intelligence and model training using frameworks such as Tensorflow and PyTorch.
Alluxio 2.5 also improves compatibility with the S3 API. Together S3, HDFS, and POSIX make up a majority of the APIs preferred by data driven applications and data management tools. Administrators now have the flexibility to manage the Alluxio file system namespace through a standard object storage console. This flexibility makes it simpler to integrate Alluxio into existing large scale data pipelines.
New and improved storage connectors on Amazon Web Services, Azure Cloud, and Google Cloud Platform improve the onboarding experience with seamless authentication and improved performance. Data lakes on all major cloud platforms can now integrate Alluxio to orchestrate data management. A new Quickstart guide using Data Orchestration Hub for single, hybrid, or multi-cloud data orchestration is also included, along with support for the Hub on Kubernetes.
Kubernetes is a popular deployment choice for Alluxio with both data analytics and machine learning pipelines across on-premises and cloud environments. With the dynamic nature of containerized environments, log collection is a challenge when containers are frequently killed or restarted. Now Alluxio logs can be aggregated on a centralized collection server in Kubernetes.
Hybrid Cloud Quickstart with Alluxio Data Orchestration Hub
Alluxio is frequently used for multi-datacenter and hybrid cloud environments. Version 2.5 provides enhancements for getting started with the deployment and configuration of such environments. Data Orchestration Hub is now supported on Kubernetes to aid cluster configuration and connectivity across private data centers or public clouds. AWS users now also have access to a quickstart using Terraform to deploy an Alluxio cluster with AmazonEMR. Once an Alluxio cluster is deployed, either using the new Terraform or helm on Kubernetes, the Hub is available to manage subsequent changes.
For details, visit https://www.alluxio.io/.