Drive has 700+ articles for digital transformation leaders written by StarCIO Digital Trailblazer, Isaac Sacolick. Learn more.

It’s been almost two years since I last blogged about the threats and opportunities around Dark Data, so when I was asked to join Joseph Bradley in Cisco’s Future of IT podcast on Navigating Dark Data To Find Hidden Value in a Digital Era I couldn’t resist.To remind everyone, I provided the following definition of Dark Data in my post

Dark data is data and content that exists and is stored, but is not leveraged and analyzed for intelligence or used in forward looking decisions – Isaac Sacolick, see full definition

Since then, the subject of dark data has been covered by CIO.com (The Dangers of Dark Data), Forbes (Factories of the Future), and VentureBeat (Are you afraid of the dark data?). VentureBeat points to the cause of the problem, ”

As storage has become cheaper, those who generate data have grown used to hanging onto it… When data is “dark,” it’s often because the organizations that own it lack the tools, infrastructure, or skills to effectively leverage it – John Joseph

Internal Dark Data, that is dark data that is already captured and stored by the enterprise but not leveraged to drive insights or decision making represents a threat and a possible missed opportunity. The threat is if storing the data impacts the performance of a key business operation or contains sensitive information that should be better secured. The opportunity is to really figure out the value of retaining and processing this data.

Determining the Business Value of Stored, Dark Data

One of the themes we discussed during the podcast is developing the discipline on identifying the business value in dark data. To do this, use basic data visualization, analytics, and quality tools to identify the substance of the data and look to answer some basic questions:

Agile Data Discovery of Dark Data
    • How to catalog the data so that business users can learn about its existence? Can the data be broken down into basic entities, dimensions, metrics, and volumes to provide more details to business users looking for data sources?
    • Also, identify “know issues” with the data source. This can be measures of data quality, information on how the data is sourced, and other feedback that might undermine any analysis of the data.
    • Have a Data Governance board “score” this data set based on its potential vs. known issues. Absent of any easy way to quantify value, scoring by a voting committee can at least rank what data sets look attractive for further analysis.
  • Make sure that you have disciplined agile data scientists. Have them demo their findings to the Data Governance board and adjust the Score based on what was discovered.
As I said in the podcast, I suggest data scientists look at the value of data before investing too much time in discovery. Imagine that all the friction in the analysis because of the data set’s size, speed, complexity, variety, or quality can be “solved” given sufficient skills, tools, and time – what do you hope to get out of this data analytics or mining exercise?For additional insights, see my post 10 Attributes of Data-Driven Organizations.

Published on:

Leave a Reply


StarCIO

My company, StarCIO, provides leadership, learning, and advisory programs for companies looking to accelerate delivering business value from digital transformation. Contact me if you’d like to learn more about partnering opportunities.


Isaac Sacolick

Join us for a future session of Coffee with Digital Trailblazers, where we discuss topics for aspiring transformation leaders. If you enjoy my thought leadership, please sign up for the Driving Digital Newsletter and read all about my transformation stories in Digital Trailblazer.


Coffee with Digital Trailblazers hosted by Isaac Sacolick

Digital Trailblazers! Join us Fridays at 11am ET for a live audio discussion on digital transformation topics:  innovation, product management, agile, DevOps, data governance, and more!


Join the Community of StarCIO Digital Trailblazers

1 comment:

  1. It is refreshing to see an agile approach applied to Analytics in particular Dark Data. The amount of data and the lack of clarity of where to start make an agile approach the best method to maximize benefit and minimize risk.

About Drive

Drive Agility, Innovation, Transformation

Drive is the blog for digital transformation leaders brought to you by StarCIO and Isaac Sacolick.

Agility, Innovation, and Transformation are the three primary digital transformation core competencies that every StarCIO Digital Trailblazer must champion in their organizations. Learn more About Drive.


About the StarCIO Digital Trailblazer Community

StarCIO Digital Trailblazer Community

Revolutionizing traditional learning, networking, and advising experiences.

Visit the community


About StarCIO

StarCIO

About Isaac Sacolick

Isaac Sacolick

Author, 1,000+ articles, keynote speaker, Chief StarCIO Digital Trailblazer. Full bio


Driving Digital Newsletter

Driving Digital Newsletter

StarCIO Guides

StarCIO Agile Planning Guides

Digital Trailblazer

Digital Trailblazer by Isaac Sacolick

Driving Digital

Driving Digital by Isaac Sacolick

Driving Digital Standup

Driving Digital Standup

Coffee with Digital Trailblazers

StarCIO Coffee With Digital Trailblazers

Recognition

InfoWorld 2025 Judge
InfoWorld Technology of the Year 2024 Judge
Thinkers360 Top 10 in IT Leadership
Thinkers360 Top Agile Thought Leader
Thinkers360 Top DevOps Leader
Thinkers360 Top in Digital Transfomation
Thinkers360 Top in Analytics
Thinkers360 Top in Product Management

Discover more from StarCIO Digital Trailblazer Community

Subscribe now to keep reading and get access to the full archive.

Continue reading