COLLECT_SET() in Hive, keep duplicates?

Name: Hive Tutorial Hive In Hadoop Hadoop Hive Tutorial Intellipaat
Uploaded: 2017-07-14T14:22:09+00:00
Description: Is there a way to keep the duplicates in a collected set in Hive or simulate the sort of aggregate collection 88 rjkhd93 Arrayltintgt747

Question

1 Answer

Amit Rawat · Answer 1 · 2019-07-07T18:43:50+0000

After the release of Hive 13.0, collect_list(col) built-in aggregate function is supported in Hive, It returns the list of objects with duplicates. You should use it here:

SELECT
    hash_id, collect_list(num_of_cats) AS aggr_set
FROM
    <tablename>
WHERE
    <condition>
GROUP BY
    hash_id
;

If you are having any doubt regarding Hive, then you can refer the following video tutorial regarding the same:

COLLECT_SET() in Hive, keep duplicates?

COLLECT_SET() in Hive, keep duplicates?

Please log in or register to add a comment.

Please log in or register to answer this question.

1 Answer

Please log in or register to add a comment.

Related questions

Browse Categories

Popular Courses

Top Tutorials

Top Articles

Top Interview Questions