In my previous two blogs I touched upon what is data architecture, how it is important, overview and its relationship with the Information architectural layer.
This blog I wanted to touch on the best practices for Data Stewardship and the big data disruptions.
Lets start with what is Data Stewardship? According to http://searchdatamanagement.techtarget.com/definition/data-stewardship, data stewardship is the management and oversight of an organizations' data assets to help provide business users with high quality data that is easily accessible in a consistent manner. These roles are common in organizations that are attempting to exchange data precisely and consistently between computer systems and also making them available for reusability in the future (ref: https://en.wikipedia.org/wiki/Data_steward)
Some of the benefits of data stewardship that I see, with reference to those listed in wiki https://en.wikipedia.org/wiki/Data_steward
- data quality needs to be considered within a business process
- need of an effective governance strategy for data quality across the entire organization
With big data comes the efforts of managing this large amount of complex data within an organization. If the data quality is not maintained, as noted in the Gartner article, it can lead to the failing of many strategic business initiatives. Programs such as CRM (customer relationship management), BI will not be able to generate enough business if the quality of the data is not improved. The article talks about various ways in which this problem can be alleviated. One of the proposal that I don't completely agree with is " stewards residing in business and not in IT organization". In my opinion, even though we identify that the business is responsible for the quality of data, stewards should reside in the middle layer that connects both business and IT. As much as Data stewards need to understand the business, they also need to understand IT infrastructure, so that they can make sound decisions when it comes to harnessing data with quality, and if the current data architecture of the organization can even sustain such proposal. The only way data stewards can be respected within the organization is if they have a combined knowledge of both business and IT infrastructure. Having one of the two will not give them a holistic perspective of the entire data setup within the enterprise architectural layer.
Lets quickly touch on various roles/ responsibilities of data steward (ref: https://www.gartner.com/doc/554646/best-practices-data-stewardship)
Big data is an eye opener to the challenges that are being faced by the organizations on a daily basis related to data. It provides an insight to the deepest levels of the organizations, their relationships within the company and outside the company. If not handled properly by the business/ enterprise architects, they big data challenge can have a negative impact. Some of these impacts are, also as discussed in the Gartner article, reference above,
To summarize, as long as we as an organization understand the criticality and importance of Big Data, convert the findings through data mining into effective and impactful strategies, as an organization we can be more efficient, have stronger relationships with our employees, vendors and other organizations.
This blog I wanted to touch on the best practices for Data Stewardship and the big data disruptions.
Lets start with what is Data Stewardship? According to http://searchdatamanagement.techtarget.com/definition/data-stewardship, data stewardship is the management and oversight of an organizations' data assets to help provide business users with high quality data that is easily accessible in a consistent manner. These roles are common in organizations that are attempting to exchange data precisely and consistently between computer systems and also making them available for reusability in the future (ref: https://en.wikipedia.org/wiki/Data_steward)
Some of the benefits of data stewardship that I see, with reference to those listed in wiki https://en.wikipedia.org/wiki/Data_steward
- consistent use of data management resources
- easier mapability between various computer systems
- lower costs during migration
- avoiding redundancy and overlapping of information across the layers
- compliant with the organizations informational layer
- reusability
- data quality needs to be considered within a business process
- need of an effective governance strategy for data quality across the entire organization
With big data comes the efforts of managing this large amount of complex data within an organization. If the data quality is not maintained, as noted in the Gartner article, it can lead to the failing of many strategic business initiatives. Programs such as CRM (customer relationship management), BI will not be able to generate enough business if the quality of the data is not improved. The article talks about various ways in which this problem can be alleviated. One of the proposal that I don't completely agree with is " stewards residing in business and not in IT organization". In my opinion, even though we identify that the business is responsible for the quality of data, stewards should reside in the middle layer that connects both business and IT. As much as Data stewards need to understand the business, they also need to understand IT infrastructure, so that they can make sound decisions when it comes to harnessing data with quality, and if the current data architecture of the organization can even sustain such proposal. The only way data stewards can be respected within the organization is if they have a combined knowledge of both business and IT infrastructure. Having one of the two will not give them a holistic perspective of the entire data setup within the enterprise architectural layer.
Data steward in the middle of Business and IT layer |
- ensuring the consistency and accuracy of data
- implementing governance tasks and achieving data quality metrics
- responsibility in master data management objectives
- identifying issues with source systems
- updating and maintaining documentation, taxonomies
- proactively finding errors/ bugs or issues within data
- ensuring data is compliant with industry standards
Big data is an eye opener to the challenges that are being faced by the organizations on a daily basis related to data. It provides an insight to the deepest levels of the organizations, their relationships within the company and outside the company. If not handled properly by the business/ enterprise architects, they big data challenge can have a negative impact. Some of these impacts are, also as discussed in the Gartner article, reference above,
- although big data shows patterns on data types, if this information is not turned into a competitive advantage, then there is no use in spending money on all of these tools.
- big data will give a visual representation of cultural issues to business and IT leaders, this can be a positive impact, since the leaders can identify and work on solving these issues in a more proactive and effective manner
Big Data - Handle with Care |
To summarize, as long as we as an organization understand the criticality and importance of Big Data, convert the findings through data mining into effective and impactful strategies, as an organization we can be more efficient, have stronger relationships with our employees, vendors and other organizations.