The Ray dataplane powers all memory-intensive applications in the Ray ecosystem. It's also come a long way. In this talk, we'll give a brief overview of the history and design of the Ray dataplane, starting from its origin as a zero-copy object store, to its involvement today in the Ray Data library. We'll finish by explaining how we were able to use the Ray dataplane to break the Cloudsort world record, reducing the cost of sorting 100TB of data on the cloud to less than 1 dollar/TB.
Stephanie is a software engineer at Anyscale, a Ray committer, and an author of Ray core. She is working on problems related to data processing and distributed execution with Ray. In fall ‘24, she will join the computer science faculty at the University of Washington.
Frank Sifei Luan is a PhD student in computer science at the Sky Computing Lab (previously RISELab) at UC Berkeley, advised by Ion Stoica. His research interest is in data, AI systems and cloud computing. Before Berkeley, he worked in the Big Code team at Facebook from 2017 to 2019. He received my bachelor’s degrees in computer science and statistics from the University of Chicago in 2017.
Come connect with the global community of thinkers and disruptors who are building and deploying the next generation of AI and ML applications.