Sorting 19X Faster Than C++ Parallel Sort
In my previous blog Standard C++ Sort was benchmarked running on a single core of an Intel processor at 11 million 32-bit integers per second. Its parallel version scaled up to 93 million integers per second on a 48-core Xeon processor AWS node (C5.24xlarge) – providing 8X speedup. Also, my implementation of Parallel Merge Sort […]
Read more "Sorting 19X Faster Than C++ Parallel Sort"