Ray Deep Dives

The Ray Dataplane: History and the Cloudsort World Record

September 19, 1:45 PM - 2:15 PM
View Slides

The Ray dataplane powers all memory-intensive applications in the Ray ecosystem. It's also come a long way. In this talk, we'll give a brief overview of the history and design of the Ray dataplane, starting from its origin as a zero-copy object store, to its involvement today in the Ray Data library. We'll finish by explaining how we were able to use the Ray dataplane to break the Cloudsort world record, reducing the cost of sorting 100TB of data on the cloud to less than 1 dollar/TB.

About Stephanie

Stephanie is a software engineer at Anyscale, a Ray committer, and an author of Ray core. She is working on problems related to data processing and distributed execution with Ray. In fall ‘24, she will join the computer science faculty at the University of Washington.

About Frank

Frank Sifei Luan is a PhD student in computer science at the Sky Computing Lab (previously RISELab) at UC Berkeley, advised by Ion Stoica. His research interest is in data, AI systems and cloud computing. Before Berkeley, he worked in the Big Code team at Facebook from 2017 to 2019. He received my bachelor’s degrees in computer science and statistics from the University of Chicago in 2017.

Stephanie Wang

Software Engineer, Anyscale

Frank Luan

PhD Student, UC Berkeley
Photo of Ray Summit pillows
Ray Summit 23 logo

Ready to Register?

Come connect with the global community of thinkers and disruptors who are building and deploying the next generation of AI and ML applications.

Photo of Ray pillows and Raydiate sign
Photo of Raydiate sign

Join the Conversation

Ready to get involved in the Ray community before the conference? Ask a question in the forums. Open a pull request. Or share why you’re excited with the hashtag #RaySummit on Twitter.