This is a repository featuring example code for the MemSQL Spark Streamliner.
MemSQL Spark Streamliner lets you build custom Spark pipelines to:
- extract from real-time data sources such as Kafka,
- transform data structures such as CSV, JSON, or Thrift in table rows,
- load your data into MemSQL.
Check out:
- Examples of Extractors
- Examples of Transformers
- ... and browse the code for more
Check out the MemSQL Spark Streamliner Starter repository.
Or read more on how to create custom Spark Interface JARs in our docs.
Please submit a pull request with new Extractors and Transformers.
When you contribute code, you affirm that the contribution is your original work and that you license the work to the project under the project's open source license. Whether or not you state this explicitly, by submitting any copyrighted material via pull request, email, or other means you agree to license the material under the project's open source license and warrant that you have the legal authority to do so.
Clone the repository, then run:
make build
The JAR will be placed in target/scala-<version>/
. You can upload the JAR to MemSQL Ops and create a pipeline using this or your custom code.
Run:
make test