
It’s Apache Spark time in our Big Data Roundup for the week ending June 12. At the Spark Summit West 2016, vendors big and small made announcements supporting the real-time big data analytics platform. Microsoft is getting behind Spark with several of its products. Distribution company Databricks revealed general availability of its Community Edition. And Intel declared Spark to be at the center of the big data revolution.
Let’s start with Microsoft. This week the company announced general availability of Apache Spark 1.6.1 for Azure HDInsight, and Power BI support for Spark Streaming. Azure HDInsight is Microsoft’s answer to Hadoop in the Azure cloud. Based on Hortonworks Data Platform Hadoop distribution, the service deploys and provisions managed Apache Hadoop clusters in the Azure cloud, providing a framework designed to process, analyze, and report on big data.
Now, the company is adding Spark for HDInsight, and Microsoft says it’s a popular service, being adopted in 50% of all new HDInsight clusters deployed.
“With GA, we are revealing improvements we’ve made to the service to make Spark hardened for the enterprise and easy for your users,” wrote Oliver Chiu, a senior product marketing manager for big data and data warehousing at Microsoft, in a blog post. “This includes improvements to the availability, scalability, and productivity of our managed Spark service.”
Microsoft also said it worked with Hortonworks to add capabilities to the YARN resource manager. In addition, Redmond co-led Project Livy with Cloudera and other organizations to create an open source Apache licensed REST web service for managing long-running Spark contexts and submitting Spark jobs.
Microsoft said it will offer an integration between Spark and the Azure Data Lake Store to enable Spark to store and process data of any size. Microsoft plans to enable role-based data access at the storage level through integration of Spark and the Data Lake Store.
And, for data scientists specifically, Microsoft introdcued out-of-the-box integration with Jupyter data science notebooks.
Microsoft had something for business intelligence professionals and analysts as well. The company will offer integration with Power BI and other BI tools such as Tableau, SAP Lumira, and QlikView.
“This lets you build interactive visualizations over data of any size,” Chui wrote. “In addition to the traditional dashboards, Power BI offers a streaming connector that has integration with Spark allowing you to publish real-time events from Spark Streaming directly to Power BI.”
Databricks
Databricks is the chief commercial distribution company behind Apache Spark and this week the company announced general availability of its Data Bricks Community Edition, a free version of its just-in-time data platform built on top of Apache Spark.
The company said in a statement that DCE is accessible to all users, making it easy and quick to learn Apache Spark without the need to deal with infrastructure concerns.
“This year we’ve seen explosive growth for the Apache Spark project and all signs indicate the pace will only accelerate as the community expands even more,” said Matei Zaharia, cofounder and chief technology officer at Databrick, in the statement. “Databricks Community Edition has created an ideal environment for learning Apache Spark. Developers of all backgrounds can now use Databricks Community Edition to learn Spark and mitigate the acute Spark skills gap.”
Intel
In her presentation at the Spark Summit, Ziya Ma, Intel’s Director of big data technologies, shared a statement made by the company’s No. 3 employee Andy Grove about how analytics would be the No. 1 one workload in the data center by the year 2020. “Analytics enriches people’s lives,” she said.
“We believe Spark is at the center of the analytics revolution,” Ma told attendees at the Summit.
In keeping with her company’s commitment to big data and analytics, Ma provided an update on Intel’s Trusted Analytics Platform. She also discussed a new chip announcement, the Xeon Processor E7 V4 family, which she said provided a 7x performance improvement for Spark workloads when moving from the previous generation of Intel hardware to this new one.
This article was originally published on www.informationweek.com and can be viewed in full


Archive
- October 2024(44)
- September 2024(94)
- August 2024(100)
- July 2024(99)
- June 2024(126)
- May 2024(155)
- April 2024(123)
- March 2024(112)
- February 2024(109)
- January 2024(95)
- December 2023(56)
- November 2023(86)
- October 2023(97)
- September 2023(89)
- August 2023(101)
- July 2023(104)
- June 2023(113)
- May 2023(103)
- April 2023(93)
- March 2023(129)
- February 2023(77)
- January 2023(91)
- December 2022(90)
- November 2022(125)
- October 2022(117)
- September 2022(137)
- August 2022(119)
- July 2022(99)
- June 2022(128)
- May 2022(112)
- April 2022(108)
- March 2022(121)
- February 2022(93)
- January 2022(110)
- December 2021(92)
- November 2021(107)
- October 2021(101)
- September 2021(81)
- August 2021(74)
- July 2021(78)
- June 2021(92)
- May 2021(67)
- April 2021(79)
- March 2021(79)
- February 2021(58)
- January 2021(55)
- December 2020(56)
- November 2020(59)
- October 2020(78)
- September 2020(72)
- August 2020(64)
- July 2020(71)
- June 2020(74)
- May 2020(50)
- April 2020(71)
- March 2020(71)
- February 2020(58)
- January 2020(62)
- December 2019(57)
- November 2019(64)
- October 2019(25)
- September 2019(24)
- August 2019(14)
- July 2019(23)
- June 2019(54)
- May 2019(82)
- April 2019(76)
- March 2019(71)
- February 2019(67)
- January 2019(75)
- December 2018(44)
- November 2018(47)
- October 2018(74)
- September 2018(54)
- August 2018(61)
- July 2018(72)
- June 2018(62)
- May 2018(62)
- April 2018(73)
- March 2018(76)
- February 2018(8)
- January 2018(7)
- December 2017(6)
- November 2017(8)
- October 2017(3)
- September 2017(4)
- August 2017(4)
- July 2017(2)
- June 2017(5)
- May 2017(6)
- April 2017(11)
- March 2017(8)
- February 2017(16)
- January 2017(10)
- December 2016(12)
- November 2016(20)
- October 2016(7)
- September 2016(102)
- August 2016(168)
- July 2016(141)
- June 2016(149)
- May 2016(117)
- April 2016(59)
- March 2016(85)
- February 2016(153)
- December 2015(150)