pysparkling
A pure Python implementation of Apache Spark's RDD and DStream interfaces.
Python260other
2 years ago
apache-sparkdata-processingdata-science
EMR_Spark_Automation
A repository for deploying an AWS EMR cluster and submiting spark jobs on it. Bo
Python8apache-2.0
7 years ago