Apache Impala (Incubating): Analytic Database for Apache Hadoop

Apache Impala (curently an ASF incubator project) provides high-performance, low-latency SQL queries on data stored in popular Apache Hadoop file formats. The fast response for queries enables interactive exploration and fine-tuning of analytic queries, rather than long batch jobs traditionally associated with SQL-on-Hadoop technologies. (You will often see the term "interactive" applied to these kinds of fast queries with human-scale response times.)

Impala integrates with the Apache Hive metastore database to share databases and tables between both components. The high level of integration with Apache Hive, and compatibility with the HiveQL syntax, lets you use either Impala or Hive to create tables, issue queries, load data, and so on.

This white paper contains featured blog posts from the Cloudera Engineering Blog about key Impala concepts, Impala performance, and best practices.

Work Email

First Name

Last Name

Company

Job Title

Role

Industry

Country

I agree to receive email communications from 1105 Media, Inc. containing news, updates and promotions regarding offers from select vendors. I understand that I can withdraw consent at any time.

Your e-mail address is used to communicate with you about your registration, related products and services, and offers from select vendors. Refer to our Privacy Policy for additional information.

Research & Resources

Webinars

Virtual Summits

Apache Impala (Incubating): Analytic Database for Apache Hadoop