
Big data is proving to be bad data for some surveyed enterprises, with almost nine in 10 reporting they believe they are flowing bad data into their data stores, note results of a global survey issued Wednesday by StreamSets.
The finding – part of a global survey of 314 data management professionals at organizations and conducted by independent research firm Dimensional Research – illustrates the need for enterprises to adopt a data flow operations mindset, contends the San Francisco-headquartered company, which provides data ingest technology for the next generation of big data applications.
That conclusion may be further supported by that fact that only slightly more than one in 10 of the enterprises taking part in the survey consider themselves to be good at the key aspects of data flow performance management.
Specifically, StreamSets reports, 12% of respondents rate themselves as “good” or “excellent” across five key performance management areas, namely detecting the following events: pipeline down, throughput degradation, error rate increases, data value divergence and personally identifiable information violations.
Of those surveyed, 30% were from enterprises with 10,000-plus employees, 16% from enterprises with 5,000 to 10,000 employees, 29% from enterprises with 1,000 to 5,000 employees and 25% from enterprises with 500 to 1,000 employees. Three-quarters of respondents – from food and beverage, hospitality and entertainment, media and advertising, non-profit, retail, transportation, energy and utilities, telecommunications, government, services, education, healthcare, manufacturing and financial services sectors – were from the United States or Canada, 14% were from Europe, and the rest from Asia, Middle East/Africa, Australia/ New Zealand, and Mexico/Central America/South America.
StreamSets cautions that pervasive data pollution, which implies analytic results may be wrong, is leading to false insights that drive poor business decisions. “Even if companies can detect their bad data, the process of cleaning it after the fact wastes the time of data scientists and delays its use, which is deadly in a world increasingly reliant on real-time analysis,” the company contends.
In all, 68% of respondent enterprises cite ensuring data quality as the most common challenge they face when managing big data flows, 74% report currently having bad data in their stores (despite cleansing throughout the data lifecycle), and only 34% rate themselves as “good” or “excellent” at detecting diverging data values in flow.
Of those surveyed, 44% felt weakest with performance degradation, 44% with error rate increases and 34% with detecting divergent data. “Detecting a ‘pipeline down’ event was the only metric where a large majority felt positively about their capabilities (66%),” notes the StreamSets statement.
“The study showed that enterprises of all sizes face challenges on a range of key data performance management issues from stopping bad data to keeping data flows operating effectively,” the company reports.
“In today’s world of real-time analytics, data flows are the lifeblood of an enterprise,” says Girish Pancha, CEO of StreamSets. “The industry has long been fixated on managing data at rest and this myopia creates a real risk for enterprises as they attempt to harness big and fast data. It is imperative that we shift our mindset towards building continuous data operations capabilities that are in tune with the time-sensitive, dynamic nature of today’s data,” Pancha says.
All that said, “enterprises overwhelmingly report that they struggle to manage their data flows. What is required is a new organizational discipline around performance management of data flows with the goal of ensuring that next-generation applications are fed quality data continuously,” the report states.
A chasm exists between the “problem-detection capabilities data experts have today and what they desire. This translates into a lack of real-time visibility and control of data flows, operations, quality and security,” the report states.
“For companies who use big data to optimize current business operations or to make strategic decisions, it is critical that they ensure their big data teams have real-time visibility and control over the data at all times,” it emphasizes.
This article was originally published on www.canadianunderwriter.com and can be viewed in full


Archive
- October 2024(44)
- September 2024(94)
- August 2024(100)
- July 2024(99)
- June 2024(126)
- May 2024(155)
- April 2024(123)
- March 2024(112)
- February 2024(109)
- January 2024(95)
- December 2023(56)
- November 2023(86)
- October 2023(97)
- September 2023(89)
- August 2023(101)
- July 2023(104)
- June 2023(113)
- May 2023(103)
- April 2023(93)
- March 2023(129)
- February 2023(77)
- January 2023(91)
- December 2022(90)
- November 2022(125)
- October 2022(117)
- September 2022(137)
- August 2022(119)
- July 2022(99)
- June 2022(128)
- May 2022(112)
- April 2022(108)
- March 2022(121)
- February 2022(93)
- January 2022(110)
- December 2021(92)
- November 2021(107)
- October 2021(101)
- September 2021(81)
- August 2021(74)
- July 2021(78)
- June 2021(92)
- May 2021(67)
- April 2021(79)
- March 2021(79)
- February 2021(58)
- January 2021(55)
- December 2020(56)
- November 2020(59)
- October 2020(78)
- September 2020(72)
- August 2020(64)
- July 2020(71)
- June 2020(74)
- May 2020(50)
- April 2020(71)
- March 2020(71)
- February 2020(58)
- January 2020(62)
- December 2019(57)
- November 2019(64)
- October 2019(25)
- September 2019(24)
- August 2019(14)
- July 2019(23)
- June 2019(54)
- May 2019(82)
- April 2019(76)
- March 2019(71)
- February 2019(67)
- January 2019(75)
- December 2018(44)
- November 2018(47)
- October 2018(74)
- September 2018(54)
- August 2018(61)
- July 2018(72)
- June 2018(62)
- May 2018(62)
- April 2018(73)
- March 2018(76)
- February 2018(8)
- January 2018(7)
- December 2017(6)
- November 2017(8)
- October 2017(3)
- September 2017(4)
- August 2017(4)
- July 2017(2)
- June 2017(5)
- May 2017(6)
- April 2017(11)
- March 2017(8)
- February 2017(16)
- January 2017(10)
- December 2016(12)
- November 2016(20)
- October 2016(7)
- September 2016(102)
- August 2016(168)
- July 2016(141)
- June 2016(149)
- May 2016(117)
- April 2016(59)
- March 2016(85)
- February 2016(153)
- December 2015(150)