
Some reading this may be too young to remember the Kevin Costner movie, Field of Dreams where Costner’s character hears a voice tell him, “If you build it, they will come”. In the movie, the voice is referring to a baseball field and some long-departed baseball players but it’s surprising how that concept holds true in real-world situations.
Take traffic and road building, town planners have long been aware of the concept of “induced traffic”. If you build new roads to ease congestion, ironically, over time, more cars are attracted to these roads and congestion gets worse. The answer is integrated transport planning.
Data is no different. The term data gravity was coined in 2010 by a now, ever to be remembered software engineer called Dave McCrory. In a blog post, he described his data gravity concept. Here is a paraphrased outline of what Dave described:
“Data gravity is the ability of bodies of data to attract applications, services and other data. The force of gravity, in this context, is manifested in the way software, services and business logic are drawn to data relative to its mass (the amount of data) and as a result, are physically located closer to the data”.
The concept makes a lot of sense. Where you make data available at scale, no doubt smart data scientists, analysts, marketers and the like will be drawn to the data, like prospectors to California during the gold rush of the 1800s.
When McCrory came up with his definition, he didn’t just talk about the pull of data gravity. He also suggested that as the body of data grows bigger, it becomes more “immoveable”, requiring centralised infrastructure to “house it” and drawing those who want access to it to be physically closer as distance would cause latency and delays in generating insights.
Enter Cloudera with an enterprise data cloud that defies the laws of physics. Well, at least the metaphorical physics of data gravity!
Today’s demand for instant analytics at scale leads to two imperatives that are held back by data gravity.
The first is the fact that we need to collect data from the edge – i.e. from devices and remote locations. Not only does this data need to be collected but it needs to be analysed at the edge in real-time, sometimes pulling data from the core to enhance this “local” analysis.
The second is that the speed at which analytic processes need to be carried out is accelerating rapidly. It is no longer good enough to send data to a central store and then do historical point-in-time analysis. Data has to be analysed in transit and real-time, often from many locations instantaneously.
The only way to make this happen is to break one of the shackles of data gravity, namely the idea of all data being placed in one immovable location. Instead, your body of data can be geographically dispersed and fluid, changing in real-time.
Cloudera’s Enterprise Data Cloud makes this possible. You can find out more about it here.
One thing does not change, however. The body of data may no longer be shackled by location but the virtualised lake of data that Cloudera enables you to collate still meets the idea of “if you build it, they will come” and in-fact it accelerates it. Once you put data at scale on Cloudera’s cloud-based data platform, it becomes easier for more people to access and use. The net result is the gravitational pull for more data-driven applications gets even stronger.


Archive
- October 2024(44)
- September 2024(94)
- August 2024(100)
- July 2024(99)
- June 2024(126)
- May 2024(155)
- April 2024(123)
- March 2024(112)
- February 2024(109)
- January 2024(95)
- December 2023(56)
- November 2023(86)
- October 2023(97)
- September 2023(89)
- August 2023(101)
- July 2023(104)
- June 2023(113)
- May 2023(103)
- April 2023(93)
- March 2023(129)
- February 2023(77)
- January 2023(91)
- December 2022(90)
- November 2022(125)
- October 2022(117)
- September 2022(137)
- August 2022(119)
- July 2022(99)
- June 2022(128)
- May 2022(112)
- April 2022(108)
- March 2022(121)
- February 2022(93)
- January 2022(110)
- December 2021(92)
- November 2021(107)
- October 2021(101)
- September 2021(81)
- August 2021(74)
- July 2021(78)
- June 2021(92)
- May 2021(67)
- April 2021(79)
- March 2021(79)
- February 2021(58)
- January 2021(55)
- December 2020(56)
- November 2020(59)
- October 2020(78)
- September 2020(72)
- August 2020(64)
- July 2020(71)
- June 2020(74)
- May 2020(50)
- April 2020(71)
- March 2020(71)
- February 2020(58)
- January 2020(62)
- December 2019(57)
- November 2019(64)
- October 2019(25)
- September 2019(24)
- August 2019(14)
- July 2019(23)
- June 2019(54)
- May 2019(82)
- April 2019(76)
- March 2019(71)
- February 2019(67)
- January 2019(75)
- December 2018(44)
- November 2018(47)
- October 2018(74)
- September 2018(54)
- August 2018(61)
- July 2018(72)
- June 2018(62)
- May 2018(62)
- April 2018(73)
- March 2018(76)
- February 2018(8)
- January 2018(7)
- December 2017(6)
- November 2017(8)
- October 2017(3)
- September 2017(4)
- August 2017(4)
- July 2017(2)
- June 2017(5)
- May 2017(6)
- April 2017(11)
- March 2017(8)
- February 2017(16)
- January 2017(10)
- December 2016(12)
- November 2016(20)
- October 2016(7)
- September 2016(102)
- August 2016(168)
- July 2016(141)
- June 2016(149)
- May 2016(117)
- April 2016(59)
- March 2016(85)
- February 2016(153)
- December 2015(150)