Collaborate with a number core Spark contributors to co-design and co-develop enhancements to Spark in areas ranging from Performance to HA. This includes connecting to various datastores or real-time streams and figuring out ways to serverize Spark transforms.
- Extensive programming proficiency in Java and Scala,
- An expert-level understanding of Apache Spark as your work will involve modifying its internals
- Prior experience developing analytics infrastructure
- Desire to contribute code to the Apache Spark community in areas ranging from Machine
- Learning to Spark HA
- Experience with MLlib
- BS, MS in Computer Science, Mathematics, etc.