Wednesday, May 14, 2025

One Huge Cluster Caught: Knowledge Asset Standardization


Knowledge asset standardization is the purposeful and thoroughly deliberate consolidation of redundant, contradictory reviews, processes, and databases into enterprise requirements. The proliferation of knowledge belongings can have the best hostile impression on environmental well being; standardization has many well being advantages:

  • Reduces the chance that ill-constructed belongings take down processes, nodes, and clusters
  • Reduces competition and competitors for compute and storage
  • Reduces course of and repair failures and related troubleshooting effort
  • Reduces effort spent sustaining and supporting redundant belongings

Though the impacts of knowledge asset standardization on environmental well being may be greater than another class on this sequence, the enterprise worth advantages exponentially outweigh them: commonplace knowledge definitions, improved knowledge governance, constant knowledge interpretation, higher knowledge trustworthiness, and improved data-driven choice making. Ideally you’re realizing these advantages utilizing Cloudera Knowledge Catalog.  

Whole knowledge standardization is a multiyear journey and sure pointless, however the low hanging fruit is ripe for the choosing. We strongly suggest embarking on this journey till returns diminish.

Report Standardization

Take these steps:

  1. Stock reviews, together with possession, utilization statistics, and report frequency. 
  2. Goal for retirement any reviews unused within the final 12 months, then within the final 6 months. Pay explicit consideration to report frequency as low utilization of an annual report could also be acceptable.
  3. Choose a report archival technique commensurate along with your buyer partnership dynamic (we hope you’re not in knowledge purgatory )
    1. Two weeks earlier than, every week earlier than, and the day of the archival, notify report house owners as to which reviews you propose to archive, permitting them a grace interval to object and supply justification for the reviews continued existence. 
    2. Conversely, archive them with out notification and restore a report when anybody shouts about it.
  4. Archive focused reviews. In Tableau, we choose to easily assign report possession to a system person which prohibits additional use whereas enabling us to simply restore it if requested and justified.
  5. Repeat the train quarterly. In our expertise, 80-90% of reporting stock may be archived in as little as 2 quarters.
    1. In case your visualization device employs extract jobs, cease them, and notice any database archival targets.
  6. Often examine the appropriateness of report refresh charges and negotiate.
  7. Over time, consolidate further belongings by grafting closely used report options and capabilities into enterprise commonplace dashboards then retire redundant legacy reviews. Admittedly, that is troublesome and time consuming work normally undertaken as a method to trusted knowledge, not environmental well being. 

DB Standardization

  1. As earlier than, stock database belongings, possession, refresh frequency, and related utilization statistics. 
  2. Goal short-term/testing databases and person databases owned by former FTE. 
  3. Talk far and broad. We’re not as courageous as to archive dbs with out notification and permission usually. We get pleasure from our jobs and wish to preserve them.
  4. Archive the databases. We normally archive into a typical archival database. In our expertise, this will cut back 35-55% of manufacturing tables. 
  5. Often negotiate refresh charges and knowledge retention insurance policies with database house owners.
  6. We strongly suggest taking the multiyear journey to standardize centralized knowledge belongings into enterprise requirements as a lot as doable as it could considerably enhance knowledge trustworthiness and correct data-driven choice making.

Pipelines and Jobs Standardization

Database asset standardization will establish archival alternatives for (1) the pipeline stock, right here referring to processes which transfer knowledge from one repository or supply to a different repository or curated dataset, in addition to (2) the roles stock, right here referring to queries which give views or persist knowledge throughout the atmosphere. Standardizing processes is excessive effort with diminishing returns on environmental well being; subsequently, start with processes that:

  • Often fail
  • Are most crucial
  • Are most often up to date
  • Are probably the most useful resource intensive

As all the time, when you want help figuring out or executing knowledge asset standardization, interact our Skilled Companies consultants. We did!

Related Articles

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Stay Connected

0FansLike
3,912FollowersFollow
0SubscribersSubscribe
- Advertisement -spot_img

Latest Articles