News on Apache Spark - September 2016
GigaSpaces Launches the Next Generation Apache Spark Distribution.BusinessWire.com, September 6,2016
GigaSpaces, the provider of in-memory computing technologies launched a data grid enabled real time analytics platform ,InsightEdge for faster data analytics using Apache Spark.“Gaining meaningful intelligence from data has typically been hindered by the speed at which data becomes usable. InsightEdge makes it possible for companies to make decisions that are informed by all of the data they have available at any given second.”- said Ali Hodroj, VP of Products and Strategy in the IMC Business Unit for GigaSpaces.
Azul's New Zing Release Brings Cassandra, Spark Performance. Boost.eweek.com,September 7, 2016.
Azul, one of the leading providers for Java runtime solutions for developers has released the latest version of Zing runtime for Java which is a replacement for the legacy JVM’s. The latest version of Zing’s JVM enhances the performance and scalability of DataStax, Cassandra and Apache Spark applications.
For the complete list of big data companies and their salaries- CLICK HERE
Taking Spark Apps from Prototype to Production.Dzone.com, September 12,2016.
Cask Data Application Platform (CDAP) is an open source , enterprise ready integration platform that provides necessary components for building production ready data platform around spark.The latest version of CDAP 3.4 provides easy to use API for Java and Scala,provides support for Spark SQL ,Spark Streaming and for fine grained transactions with Apache Tephra.
Spark GraphX in Action Book Review and Interview.InfoQ, September 12,2016.
The book authored by Michael Malak and Robin East “Spark GraphX in Action” provides a complete tutorial based coverage of the graph processing library GraphX.The readers of this book can learn how to use SQL with Spark graphs by using GraphFrames API. The book will also help its readers learn on how to apply machine learning algorithms to graph data.
Apache Spark Earns Datanami Awards for Machine Learning, Real-time Analytics, and More. Databricks.com,September 19,2016
Apache Spark -The fast and flexible big data processing engine created by Matei Zaharia was honoured with four awards at the Datanami Readers' and Editors' Choice Awards with votes contributed by the big data community worldwide.The four honours to the Apache Spark framework include -
1) Readers' Choice - Best Big Data Product or Technology: Machine Learning
2) Readers' Choice - Best Big Data Product or Technology: Real-Time Analytics
3) Readers' and Editors' Choice - Top 5 Open Source Projects to Watch
4) Readers' Choice - Best Big Data Startup: Databricks
Apache Spark Survey 2016 Results Now Available.Databricks.com,September 27, 2016
Databricks revealed the results of Apache Spark Survey conducted in July 2016 to analyse spark community growth trends. 900 different companies and 1615 Apache Spark users participated in this survey.The results favoured the growth of the swiss army knife of big data- Apache Spark.The survey results showed that the number of spark users and the meetup members has tripled since 2015 and also the number of people working on Spark projects has gone up by 67%.
sparklyr — R interface for Apache Spark. RStudio.org, September 27, 2016
The increasing demand for a native dplyr interface to Apache Spark has led to the innovation of a new package sparklyr that allows R programmers to tap into apache spark big data. Big giants in the industry have already started using the new interface- IBM has already incorporated sparklyr interface into their data science experience, H20 has an integration between H20 Sparkling Water and sparklyr and Cloudera is experimenting with the new interface to ensure that it meets the requirements of its enterprise customers.