I would like to know what is the major difference between Cluster By and CLUSTERED BY in hive.
Cluster By used for bucketing the table. And it will use the Hash function.
CLUSTERED BY used for order by value with in the reducer.
is there any other difference between.
Please let me know
Thanks
venkatbala.