Bitonic sorter
Bitonic mergesort is a parallel algorithm for sorting. It is also used as a construction method for building a sorting network. The algorithm was devised by Ken Batcher. [3] The resulting sorting networks consist of comparators and have a delay of , where is the number of items to be sorted.[1][2] This makes it a popular choice for sorting large numbers of elements on an architecture which itself contains a large number of parallel execution units running in lockstep, such as a typical GPU. A sorted sequence is a monotone sequence---that is, a sequence which is either non-decreasing or non-increasing. A sequence is bitonic when it consists of a non-decreasing sequence followed by a non-increasing sequence, i.e. when there exists an index for which [3] A bitonic sorter can only sort inputs that are bitonic. Bitonic sorter can be used to build a bitonic sort network that can sort arbitrary sequences by using the bitonic sorter with a sort-by-merge scheme, in which partial solutions are merged together using bigger sorters. The following sections present the algorithm in its original formulation, which requires an input sequence whose length is a perfect powers of two. We will therefore let be the integer for which , meaning that the bitonic sorters may be enumerated in order of increasing size by considering the successive values . Bitonic Sorter![]() A bitonic sorter for is simply a comparator.[3] This is illustrated by the given box layout, in which X and Y represent the inputs, while H and L represent the higher and lower outputs, respectively. With the sorter for , we can recursively create a sorter of higher order. For example, consider the following bitonic sorter.[3] ![]() The bitonic sorter consists of two layers: a recombination layer, which recombines the bitonic inputs into two new bitonic sequences that are each half as long as the original sequence, and a bitonic sort layer consisting of two bitonic sorters of order , each of which sorts one of the two bitonic sequences produced by the previous layer. This structure may be extended recursively for higher values of by ensuring that each comparator always accepts one input from each of the two halves of the bitonic sequence it is meant to help sort. The following illustration depicts these connections schematically.[3] ![]() As you can see, the first half of the input sequence is compared against the last half of the input sequence. Comparing each element of the (green) subsequence with the element of the other (orange) subsequence at the respective index produces two bitonic subsequences. These two bitonic series (blue and red, respectively) can then be fed into the next lower-order bitonic sorter. This can be done because all elements in the red sequence are guaranteed to be higher than all elements in the blue series. [3] Correctness of the bitonic sorterKen Batcher provided some mathematical proof sketch in his paper.[3] With out loss of generality the bitonic input sequence is assumed to be with . With out loss of generality the sequence can be reversed therefore, we can assume . Case 1: If then every element of the two subsequences are smaller. In this case and with and therefore and are trivially bitonic. Case 2: Otherwise there exists a such that the element of the first sub-sequence is bigger then of the second sub-sequence while it is the opposite for . This means that and are true for a specific . Therefore we now know that: 1. For the sequences are and 2. For the sequences are defined as the opposite of 1, with and In the original paper he then claims the following inequalities result from those definitions:[3] Following from 1:
Following from 2:
From both: From the paper claims follows that the sequences and are in fact bitonic.[3] Bitonic Sorting Networks (Bitonic Merge Sort)A bitonic sorting network is created by using several bitonic sorters. These bitonic sorters are recursively used to create two monotonic sequences, one decreasing and one increasing, which are then put into the next stage. This creates a bitonic series for the next stage, which can then use this bitonic series as a monotonic series for the next stage. Consider the following example for an bitonic sort network.[3] ![]() The bitonic sorting network for can be created by using a bitonic sorter and two sorters. The two sorters create a decreasingly or increas- ingly sorted sequence in order to create a bitonic input for the bitonic sorter. Bitonic sorting networks of a lower order are mostly used for the two pre-sorters; therefore, a recursive definition of a bitonic sorting network from bitonic sorters can be described. In the above example, the two bitonic sorting networks are networks; hence, they are just a comparator.[3] The following figure shows the overall scheme. ![]() This overall scheme requires the sorter to have an input of sequence that is a power of two. There are, however, possibilities to mitigate this by, for example, using sentinel values. PseudocodeThe following pseudocode describes the sorting process. In the code, The function call function bitonicMerge(a, low, count, direction) is if count > 1 THEN k ← count / 2 // Compare and swap elements across the halves for i ← low to low + k - 1 do // determine if two elements of a are out of order in relation to the direction of sorting. if (direction == 1 AND a[i] > a[i + k]) OR (direction == 0 AND a[i] < a[i + k]) THEN swap a[i] with a[i + k] // Recursively merge both halves bitonicMerge(a, low, k, direction) bitonicMerge(a, low + k, k, direction) // This only works when input size is a power of 2. function bitonicSort(a, low, count, direction) is if count > 1 THEN k ← count / 2 // Sort first/second half into ascending/descending order bitonicSort(a, low, k, 1) bitonicSort(a, low + k, k, 0) // Merge entire sequence in desired order bitonicMerge(a, low, count, direction) ComplexityIn this section we assume that our sorter has input elements as previously. Each recursion in a bitonic sorting network adds a sorter of order , which consists of bitonic sorter and the next recursion. As both sub-sorters can be done in parallel, only one level is added for each level in both sub-sorters. Each bitonic sorter has, therefore, one recombination layer and a lower-order bitonic sorter for its recursion. This results in levels per bitonic sorter. Therefore we can describe this construction's levels as the following sum: . This sum can be reduced using the Gauss sum formula Therefore, the number of levels in which each comparison can be done in parallel is given by .[3] Which gives us assuming comparisons can be performed in parallel. Although the absolute number of comparisons is typically higher than Batcher's odd-even sort, many of the consecutive operations in a bitonic sort retain a locality of reference, making implementations more cache-friendly and typically more efficient in practice.[3] See alsoReferences
External links |