GigaSpaces, the provider of in-memory computing technologies launched a data grid enabled real time analytics platform ,InsightEdge for faster data analytics using Apache Spark.“Gaining meaningful intelligence from data has typically been hindered by the speed at which data becomes usable. InsightEdge makes it possible for companies to make decisions that are informed by all of the data they have available at any given second.”- said Ali Hodroj, VP of Products and Strategy in the IMC Business Unit for GigaSpaces.
Azul, one of the leading providers for Java runtime solutions for developers has released the latest version of Zing runtime for Java which is a replacement for the legacy JVM’s. The latest version of Zing’s JVM enhances the performance and scalability of DataStax, Cassandra and Apache Spark applications.
For the complete list of big data companies and their salaries- CLICK HERE
Cask Data Application Platform (CDAP) is an open source , enterprise ready integration platform that provides necessary components for building production ready data platform around spark.The latest version of CDAP 3.4 provides easy to use API for Java and Scala,provides support for Spark SQL ,Spark Streaming and for fine grained transactions with Apache Tephra.
The book authored by Michael Malak and Robin East “Spark GraphX in Action” provides a complete tutorial based coverage of the graph processing library GraphX.The readers of this book can learn how to use SQL with Spark graphs by using GraphFrames API. The book will also help its readers learn on how to apply machine learning algorithms to graph data.
Apache Spark -The fast and flexible big data processing engine created by Matei Zaharia was honoured with four awards at the Datanami Readers' and Editors' Choice Awards with votes contributed by the big data community worldwide.The four honours to the Apache Spark framework include -
1) Readers' Choice - Best Big Data Product or Technology: Machine Learning
2) Readers' Choice - Best Big Data Product or Technology: Real-Time Analytics
3) Readers' and Editors' Choice - Top 5 Open Source Projects to Watch
4) Readers' Choice - Best Big Data Startup: Databricks
Databricks revealed the results of Apache Spark Survey conducted in July 2016 to analyse spark community growth trends. 900 different companies and 1615 Apache Spark users participated in this survey.The results favoured the growth of the swiss army knife of big data- Apache Spark.The survey results showed that the number of spark users and the meetup members has tripled since 2015 and also the number of people working on Spark projects has gone up by 67%.
The increasing demand for a native dplyr interface to Apache Spark has led to the innovation of a new package sparklyr that allows R programmers to tap into apache spark big data. Big giants in the industry have already started using the new interface- IBM has already incorporated sparklyr interface into their data science experience, H20 has an integration between H20 Sparkling Water and sparklyr and Cloudera is experimenting with the new interface to ensure that it meets the requirements of its enterprise customers.