Questions tagged [impala]

Questions related to Apache Impala SQL Query Engine.

Apache Impala

...is the open source, native analytic database for Apache Hadoop.

Overview

Impala raises the bar for SQL query performance on Apache Hadoop while retaining a familiar user experience. With Impala, you can query data, whether stored in HDFS or Apache HBase – including SELECT, JOIN, and aggregate functions – in real time. Furthermore, Impala uses the same metadata, SQL syntax (Hive SQL), ODBC driver, and user interface (Hue Beeswax) as Apache Hive, providing a familiar and unified platform for batch-oriented or real-time queries. (For that reason, Hive users can utilize Impala with little setup overhead.)

Reference: Apache Impala | Overview (Apache Impala)


Apache Impala is an open source massively parallel processing SQL query engine for data stored in a computer cluster running Apache Hadoop. Impala has been described as the open-source equivalent of Google F1, which inspired its development in 2012.

Reference: Apache Impala (Wikepedia)

5 questions
2
votes
1 answer

how to insert data into extra columns of target avro table when source table is having less no of columns compared to target using hive or impala?

Suppose I am having a source Avro table having 10 columns and my target Avro table is having 12 columns, while inserting data into the target table I need to add null values to the extra 2 columns. But when I execute the below query it has thrown…
user109612
  • 21
  • 2
1
vote
1 answer

Splitting a table into two tables randomly with 50-50 percent of records in impala

I have a table having n number of records in impala. I need to find out how can I divide that table into two equal halves with 50 - 50 percent of records in each.
user97537
  • 11
  • 1
  • 2
0
votes
1 answer

Calculate (n) workdays from a given date, using a calendar table

As shown on below image, I have two tables; Customer : with about 4 million records; and 3 columns [Customer] : Customer ID [Date] : Creation date [Num_Days] : Number of working days to calculate the [next_wkday] in the query result. Calendar :…
LEOPOLDO
  • 1
  • 2
0
votes
0 answers

Get Join Keys dynamically from a table

SQL/Impala I have a scenario where I need to join my input file with a mapping table which has map columns and derived columns. The join keys are dynamic and need to taken from a dynamic metadata table. Metadata table can be modified by user to…
0
votes
1 answer

Retrieving orders that only contain certain product IDS

I am trying to retrieve a list of transaction IDS that only contain at least 5 product IDS, the baskets could have many more product IDS but it must contain these 5 also. My table looks something like this: | transaction_id | trans_date | product_id…
RustyRyan
  • 89
  • 7