Covering Disruptive Technology Powering Business in The Digital Age

image
Bringing Governance to Big Data
image
December 11, 2015 News big data Hadoop

This article was originally published by it-director.com and can be viewed in full here

It should be completely obvious that just because data is “big” doesn’t mean that it needs any less governance or security than any other sort of data. However, all too many companies seem to have the same sort of blind spot over data quality for big data as they used to have over conventional data. Pretty much a “it’ll be alright on the night” mentality and we can rely on hand coding if we need to.

In part, perhaps, this attitude has been fostered by the lack of specific tools designed for big data environments such as Hadoop. However, this position is changing. Earlier this year Trillium announced Trillium Big Data and now, within a matter of days of one another, both IBM and Informatica have made important announcements with respect to big data. As the big beasts in this area – not just for governance but also for data integration, these are most significant and most likely to sway the market.

Of course, both vendors will claim that they have advantages over the other. From an IBM perspective the main features of these releases are that:

  • the solutions run on Apache Spark,
  • Optim data masking has been integrated into Information Server so that you can mask as a part of your transformation and loading processes
  • you can combine this with the InfoSphere Information Governance Catalog for things like discovering the lineage of masked data, and
  • InfoSphere Data Replication provides real-time replication into Hadoop.
(0)(0)

Archive