Alluxio provides global access to all of the data in your enterprise – on premise, in the cloud, or both. Applications are presented a single virtual data source with a standard interface, connecting disparate storage regardless of location and format. Intelligent caching and data management ensure fast performance, data continuity, and high availability.
MapReduce
Spark
Hive
HBase
Presto
HDFS
S3
Ceph
GCS
EMC ECS

Key Innovations

Global Unified Namespace

One View, All Your Data

Applications have a single point of access to multiple independent storage systems regardless of physical location. Avoid complicated ETL or Master Data Management processes and eliminate the need to create duplicate storage silos.

Standard Application and Storage Interface

One Connection, Any Data Format

Server-side API translation converts from a client-side interface to any storage interface. Alluxio manages communication between applications and file or object storage, eliminating the need for complex system configuration and management. File data can look like object data and vice versa. If you have multiple versions of HDFS in your enterprise Alluxio also gives an application the ability to talk to different versions of the same storage.

Intelligent Cache

Data on Demand at Memory Speed

Alluxio clusters act as a read/write cache for data in connected storage systems. Temporarily storing data in memory, or other media near compute, accelerates access and provides local performance from remote storage. This capability is even more critical with the movement of compute applications to the cloud and data being located in object stores separate from compute. Caching is transparent to the user, and uses read/write buffering to maintain continuity with persistent storage. Intelligent cache management utilizes configurable policies for efficient data placement and supports tiered storage for both memory and disk (SSD/HDD).

Unified Security

End to End Policy Enforcement

Alluxio fits within existing frameworks and enforces the security you have in place. User authentication, authorization, access control, and data encryption policies from both applications and storage are applied within Alluxio. Support is provided for multi-tenancy, Active Directory, LDAP, Kerberos, and encryption.

Alluxio Editions

Alluxio Enterprise Edition

Enterprise Support
Security
High Availability
Data Management
Alluxio Community Edition
Alluxio Manager
Open Source
Alluxio Enterprise Edition is designed for mission critical deployments with enterprise features, performance and 24/7 support. We also offer a Community Edition that is based on Alluxio Open Source and includes Alluxio Manager. Community Edition can be downloaded for free and used without restrictions on time or cluster size. Please visit Alluxio.org to participate in our community and download the source code.

Alluxio Subscriptions

Subscription pricing for Alluxio Enterprise Edition is based on the number of nodes in your environment, and subscriptions include either Premium (24x7) or Standard (8x5) support. Alluxio Community Edition is free.

Community Edition

Enterprise Edition

APIs

Alluxio File System API

check-icon
check-icon

Hadoop Compatible File System API

check-icon
check-icon

REST File System API

check-icon
check-icon

S3 API

check-icon
check-icon

FUSE API

check-icon
check-icon
DATA STORE INTEGRATIONS

Public Cloud Stores (S3, GCS, Azure, OSS)

check-icon
check-icon

Object Stores (ECS, Cleversafe, Ceph, FusionStor, Minio)

check-icon
check-icon

File Systems (HDFS, NFS)

check-icon
check-icon

Concurrent Multi-Store Runtime Support

check-icon

Different supported data stores (e.g. HDFS, MapRFS) and different versions of the same storage can be integrated concurrently with the same Alluxio cluster.

Data Store Ecosystem Certifications

check-icon

Certified and guaranteed support from both Alluxio and storage vendors.

ECOSYSTEM APPLICATIONS

General Compute (Spark, MapReduce)

check-icon
check-icon

SQL (SparkSQL, Presto, Hive)

check-icon
check-icon

Streaming (Flink, Spark Streaming)

check-icon
check-icon

Key Value (HBase)

check-icon
check-icon

Notebook (Zeppelin)

check-icon
check-icon

Deep Learning (Tensorflow)

check-icon
check-icon

Ecosystem Application Certifications

check-icon

Certified and guaranteed support from both Alluxio and application vendors.

DATA MANAGEMENT

Global Unified Namespace

check-icon
check-icon

Transparent API Translation

check-icon
check-icon

In-Alluxio Intelligent Tiered Store

check-icon
check-icon

Manage storage resources with media-aware tiering policies across memory, SSD and HDD within Alluxio.

Tiered Locality

check-icon
check-icon

Fine-grained data locality policies can be configured at the server, rack, and data center level.

Cross Store Data Migration

check-icon

Migrate data between any supported data store, regardless of where it originated (e.g. HDFS to S3).

Cross Store Data Replication

check-icon

Replicate data to any supported data store regardless of original format to support HA or other operational needs.

In-Alluxio Policy Based Data Replication

check-icon

Replicate data within an Alluxio cluster with user defined policies. Manage policies dynamically as workloads change.

Cluster Partitioning

check-icon

Configure zones within an Alluxio cluster to isolate resource usage and manage data placement. Partition to support multi-tenancy, AWS Availability Zones, and disaster recovery.

PERFORMANCE

Memory Centric Architecture

check-icon
check-icon

Designed with a memory-first architecture to effectively leverage modern data center hardware.

Intelligent Memory-speed Cache

check-icon
check-icon

Transparently cache hot data from connected storage systems and provide memory-speed access.

Durable High-Speed Writes

check-icon

Write operations are replicated for short term high availability within Alluxio. Writes to persistent storage are asynchronous and maintain consistency even in the event of an outage.

DEPLOYMENT

Bare Metal

check-icon
check-icon

Docker

check-icon
check-icon

Mesos

check-icon
check-icon

YARN

check-icon
check-icon

Kubernetes

check-icon
check-icon

DC/OS

check-icon
ENTERPRISE SECURITY & COMPLIANCE

Unified Security Enforcement

check-icon
check-icon

Integrate with and enforce existing enterprise security. User authentication, authorization, access control, and data encryption policies from both applications and storage are applied within Alluxio.

Multi-user Support

check-icon
check-icon

Support multiple users on the system, each with their own identity and privileges.

File level access control

check-icon
check-icon

Fine grained access control at the file level. Individual files can have specific security constraints.

Security audit logging

check-icon
check-icon

An audit log of data accesses including the time, user, operation, and success/failure.

Kerberos Secured Under Store Integration for Single User

check-icon
check-icon

Integrate with a Kerberos secured under store, provided that Alluxio can act as a superuser.

Kerberos Secured Under Store Integration for Multiple Users w/ Delegation

check-icon

Integrate with a Kerberos secured under store through delegation and without superuser privileges.

In-Alluxio Kerberos Authentication

check-icon

Authenticate users through Kerberos to validate identity.

Identity management integration (Active Directory/LDAP)

check-icon

Integrate with an LDAP based identity management system such as active directory.

Data encryption (in-motion and at rest)

check-icon

Client-side data encryption with a configurable key management service. Data in Alluxio or connected storage is encrypted at rest and when sent over the network.

Transport Layer Security (TLS)

check-icon

TLS 1.2 support for communication between all the Alluxio components (client, master, workers, job master and job worker) as well as between Alluxio and the under store.

HIGH AVAILABILITY & DISASTER RECOVERY

Zookeeper based fault tolerance

check-icon
check-icon

Master failover through Zookeeper based leader election. Requires Zookeeper deployment.

External storage system based journaling

check-icon
check-icon

External storage system for journaling. When running in HA mode, the storage system must be accessible from all master nodes.

Built-in fault tolerance

check-icon

Internal master failover and election.

Built-in journaling

check-icon

Internally managed, highly available journal with local storage resources.

SYSTEM MANAGEMENT & MONITORING

Command Line Interface (CLI)

check-icon
check-icon

Graphical User Interface (GUI)

check-icon
check-icon

Service discovery and health checks

check-icon
check-icon

System metrics and logging

check-icon
check-icon

Alluxio Manager

check-icon
check-icon
ENTERPRISE GRADE SUPPORT

8x5 Support

check-icon

24x7 Support

check-icon

Emergency Patching

check-icon

Get Started with Alluxio

Download