What is the meaning of partitionColumn, lowerBound, upperBound, numPartitions parameters?

1 Answer

answered Jul 19, 2019 by Amit Rawat (32.3k points)

partitionColumn is a column which should be used to determine partitions.

lowerBound and upperBound determine range of values to be fetched. The complete dataset will be using rows corresponding to the following query:

SELECT * FROM table WHERE partitionColumn BETWEEN lowerBound AND upperBound

numPartitions determines number of partitions to be created. Range between lowerBound and upperBound is divided into numPartitions each with stride equal to:

upperBound / numPartitions - lowerBound / numPartitions

For example if:

lowerBound: 0
upperBound: 1000
numPartitions: 10

Stride is equal to 100 and partitions are will be corresponding to the following queries:

SELECT * FROM table WHERE partitionColumn < 100

SELECT * FROM table WHERE partitionColumn BETWEEN 100 AND 200
...
SELECT * FROM table WHERE partitionColumn BETWEEN 900 AND 1000

Browse Categories

...