TDWI | Training & Research | Business Intelligence, Analytics, Big Data, Data Warehousing

Think
- Research & Resources
  - TDWI Playbook | Next Generation Data Science: The AI-Driven Data Science Life Cycle
  - TDWI Data Points | The Data Foundation for AI
  - TDWI Best Practices Report | Data Strategies and Foundations for Modern Data Management
  - TDWI Insight Accelerator | Adopting a Platform Approach for Gaining Insights from Unstructured Data
- Webinars
  - Modernize and Govern: Unifying Your Data Strategy July 10, 2025
  - Expert Panel: Best Practices for Modernizing Your Data Environment July 14, 2025
  - Powering Data Science with AI-Driven Tools and Practices July 15, 2025
  - Data Integration for AI: Overcoming Modern Pipeline Challenges July 23, 2025
- Virtual Summits
  - Virtual Events Keys to Making Your Data AI Ready September 10, 2025
  - Virtual Events Data Quality for BI, Analytics and AI October 22, 2025
  - Virtual Events Modern Data Strategy November 12, 2025
  - Virtual Events What’s Ahead in 2026 for Data & Analytics December 10, 2025
- By Topic
  - By Topic
    
    Explore the Latest AI, Analytics, and Data Research and Training by Topic
  - BI, Analytics, and Data Literacy
  - AI, Data Science, and Machine Learning
  - Data Management and Governance
  - Platforms and Architecture
  - Strategy and Methods
- Speaking of Data Podcast
  
  Current Research Surveys
Train
- In-Person Events
  - Conference TDWI Transform 2025 San Diego August 18, 2025
  - Executive Summit TDWI Modern Data Leader's Summit San Diego: AI in the Enterprise August 18, 2025
  - Executive Summit AI Accelerate 2025, Brought to You by AI Boadroom & TDWI August 18, 2025
  - Conference TDWI Transform 2025 Orlando November 16, 2025
- Virtual Live Seminars
  - TDWI Data Governance Principles and Practices: Managing Data as an Asset June 25, 2025
  - Building Your Company’s Data Governance Roadmap June 25, 2025
  - Data Governance: Driving Engagement and Organizational Change June 26, 2025
  - A Framework for Modern Data Governance June 25, 2025
- Online Learning
- By Topic
  - By Topic
    
    Explore the Latest AI, Analytics, and Data Research and Training by Topic
  - BI, Analytics, and Data Literacy
  - AI, Data Science, and Machine Learning
  - Data Management and Governance
  - Platforms and Architecture
  - Strategy and Methods
- Train Your TeamCustom solutions for training your team
  
  Get CertifiedEarn a professional credential in BI and Analytics, Data Governance, or AI
  
  TDWI MembershipExclusive access to the research, tools, training, and connections
Engage
- Connect
  - Connect and Contribute to Our Vibrant Community of Data Leaders
    
    Subscribe to TDWI Stay up to date on the latest news and events. Sign Up
    
    Become a TDWI Member Gain exclusive access to the research, tools, training, and connections to move your careers, teams, and projects forward. Learn More
    
    Become a Part of the TDWI Research Panel Make a difference in the data and analytics industry and earn incentives by sharing your insights with TDWI. Explore Now
    
    Speak at TDWI Events Share your expertise and build your personal brand as a speaker at a TDWI In-Person or Virtual Event. Submit a Proposal
    
    Become a TDWI Research Fellow Apply to be a member of TDWI’s industry leading research team. Apply Today
    
    Become a Member of the Data & AI Leaders Forum Engage in collaborative discussions, stay ahead of the curve, and stay in the know. Apply Now
    
    Showcase Your Data & AI Solutions Reach and engage with TDWI community through multi-channel marketing programs. Learn More

TDWI Articles

Are Big Data Frameworks Accelerating to a Dead End?

Data storage and analytics frameworks aren’t keeping up with the growth in data creation today.

By Jonathan Friedmann
June 6, 2022

With data volume growing exponentially every year, it is expected that the amount of digital data created in the next five years will result in more than twice the amount of data produced since the advent of digital storage.

For Further Reading:

How Big Data Accelerators Enable Faster, Cost-Effective Analytics

Data Management: What’s Ahead for 2022

The Changing Value of Data and the Need for Data Literacy

There is promise in these lofty projections because big data has proven key to progress and innovation for countless industries in the digital age. For healthcare organizations, the ability to collect and analyze vast swaths of patient records has streamlined hospital management and catalyzed breakthrough discovery of cures. By leveraging big data, insurance companies can analyze beneficiary behavior to detect fraud; financial institutions have harnessed big data to anticipate behaviors and subsequently create more efficient strategies. Airlines such as Etihad plan to utilize big data analytics to improve fuel economy, minimize maintenance costs, optimize flight scheduling, and improve safety. The list goes on.

The feasibility of a future empowered by big data depends solely on industry leaders who set the tone and pace of data-driven innovation.

When we talk about “big data,” data itself is only half the equation. It can be easy to overlook the colossal storage and analytics frameworks needed to process that information and actually turn it into something usable. Big data frameworks such as Spark, Presto, Big Query, AWS Redshift, and others are rapidly evolving to address skyrocketing computing demands. Given big data’s staggering growth, are our processing technologies keeping pace or are we losing the race to keep up ... big-time?

Bursting the Big Data Bubble

Key players in the space are offering insights into this question.

Databricks, a software company at the cutting-edge of high-performance big data frameworks, recently presented the progression of analytics frameworks’ performance in the last decade. They found performance improved two to four times from 2016 to 2021, translating roughly to a 25 percent increase in performance year over year.

Furthermore, it is reasonable to assume that the key improvements that boosted software performance are drying out based on the trends I see in software tools. For example, Databricks has recently re-written its analytics engine in C++, moving from a high-level programming to low-level programming essentially trying to scrape the last optimization by getting closer to the hardware. This is a difficult task to accomplish and signifies that there is not much left to improve.

If the rate of data growth remains much higher than the growth of our software processing capabilities, the industry will reach a critical pain point: the amount of data in the ether will far surpass our means to do anything with it. If current trends continue, then the amount of processing needed to match the exploding growth of data will already be falling behind, opening an alarming gap in computing resources.

Mind the Gap

This impending “computation gap” is no secret in the industry. Multiple innovators have risen to the challenge and are already making progress bridging the void. Databricks and Meta Platforms, Inc., for example, both recently released new C++ libraries (Photon and Velox respectively) designed to improve query performance and upgrade analytics processing.

However, this progress can also be viewed in a less-positive light: if the industry has reached a point where it must clamber to squeeze out any additional optimization, could this signify that we have all but exhausted our capabilities to maximize our software?

In response, some industry players are trying to re-engineer the lower tiers of their C++ libraries’ stack to upgrade performance -- in essence, scraping the bottom of the barrel to get the most out of an overwhelming amount of data.

What’s Next?

Big data analytics has been essential to innovation in the 21st century. Unfortunately, if current trends continue, the growth of these capabilities will continue to be eclipsed by the exponentially greater growth of data itself. Without the capacity to take advantage of this data, countless businesses will miss out on critical advances.

If we hope to continue benefiting from all that big data has to offer, then it is high time for our industry to rethink the approach to both the hardware and software of analytics frameworks. Only by striving for new and unprecedented processing capabilities that evolve hand in hand with (if not even more rapidly than) the data they are tasked to assess will we be able to close the widening computing gap and usher in a new age.

About the Author

Jonathan Friedmann is the co-founder and CEO at Speedata. Previously, Friedman was CEO and co-founder of Centipede, which developed IP for general purpose processors. He also served as COO and VP R&D at Provigent, a cellular infrastructure semiconductor company acquired by Broadcom. You can contact the author via LinkedIn.

TDWI Membership

Accelerate Your Projects,
and Your Career

TDWI Members have access to exclusive research reports, publications, communities and training.

Individual, Student, and Team memberships available.

↑

TDWI | Training & Research | Business Intelligence, Analytics, Big Data, Data Warehousing

Research & Resources

Webinars

Virtual Summits

By Topic

In-Person Events

Virtual Live Seminars

Online Learning

By Topic

Connect and Contribute to Our Vibrant Community of Data Leaders

TDWI Articles

Are Big Data Frameworks Accelerating to a Dead End?

Related Articles

Trending Articles

Breaking Barriers in Conversational BI/AI with a Semantic Layer

AI in 2025: Key Considerations for Technology Leaders

The Tech Blanket: Building a Seamless Tech Ecosystem

What’s Ahead in Generative AI in 2025? (Part Two)

TDWI Membership

Accelerate Your Projects,
and Your Career

TDWI

Engage

Research

Research & Resources

Webinars

Virtual Summits

By Topic

In-Person Events

Virtual Live Seminars

Online Learning

By Topic

Connect and Contribute to Our Vibrant Community of Data Leaders

TDWI Articles

Are Big Data Frameworks Accelerating to a Dead End?

Related Articles

Trending Articles

Breaking Barriers in Conversational BI/AI with a Semantic Layer

AI in 2025: Key Considerations for Technology Leaders

The Tech Blanket: Building a Seamless Tech Ecosystem

What’s Ahead in Generative AI in 2025? (Part Two)

TDWI Membership

Accelerate Your Projects, and Your Career

TDWI

Engage

Research

Accelerate Your Projects,
and Your Career