The cookie-datasets library provides various DataFrame readers for Apache Spark.
A few variants are supported:
Coming Soon - 0.2
A few of the datasets are based on common formats. In those cases, the reader is available.
Example data: www.cs.ie.ntu.edu.tw
Coming Soon - 0.2