CUDA_Arbitrary_Bitonic_Sort Parallel Bitonic sort for CUDA Works with arbitrary inputs Inputs: int length, int blockNumber, long* input Output: long* res