Subject: Spark dataframe hdfs vs s3


Try to deploy Alluxio as a caching layer on top of S3, providing Spark a
similar HDFS interface?
Like in this article:
https://www.alluxio.io/blog/accelerate-spark-and-hive-jobs-on-aws-s3-by-10x-with-alluxio-tiered-storage/
On Wed, May 27, 2020 at 6:52 PM Dark Crusader <[EMAIL PROTECTED]>
wrote: