Exoshuffle-CloudSort: The 2022 CloudSort Benchmark Winner

Abstract

We present Exoshuffle-CloudSort, a sorting application running on top of Ray using the Exoshuffle architecture. Exoshuffle-CloudSort runs on Amazon EC2, with input and output data stored on Amazon S3. Using 40 i4i.4xlarge workers, Exoshuffle-CloudSort completes the 100 TB CloudSort Benchmark (Indy category) in 5378 seconds, with an average total cost of $97.

Publication
arXiv
Frank Sifei Luan
Frank Sifei Luan
栾思飞 | PhD Student

PhD at UC Berkeley focused on AI systems and cloud computing.

Related