Subject: Spark dataframe hdfs vs s3


Maybe some aws network optimized instances with higher bandwidth will improve the situation.