Scalability
Because not only the volume of data is captured. Sometimes the data flowing into an organization’s system is high volume for a few hours, but then drops in volume for several hours. An MDW should be easily scalable, or at least with little effort, and should be able to scale horizontally or vertically on the fly.
Interoperability
An MDW should be able to connect with the legacy and on-premises data warehouse and any new DW coming onto the market. In organizations where multiple cloud environments are available, they should be able to connect with each other.
Reliability
Data must be processed and executed accurately and completely such that the expected output is achieved as intended, every time, always.
Note Out of the four V’s—volume, variety, velocity, and veracity—there is one that is inherent in the others—veracity. It means data should be accurate, error-free, reliable, consistent, bias-free, and complete, along with having multilingual capability. We are not counting this important factor because it is applicable in traditional data warehouses as well. It applies in all technologies and practices in all business cases.
Modern Data Warehouse Features: Distributed Processing, Storage, Streaming, and Processing Data in the Cloud
With the advent of 5G connectivity, edge computing is building a case for the platform that hyperscale cloud providers offer as part of their distributed cloud solutions.
There are multiple use cases for modern data warehouses in analytics, cloud-native applications, and edge computing to increase the capabilities in operations management, security and compliance, data analysis, resilience, enterprise integration, developer services, edge-computing immersive experience, and data/event reporting, among others.
There are multiple features of the modern data warehouse that make it modern. With the advent of processing, memory capabilities are enhanced in many technologies. There are a couple of features that were not available before that are available now. All these features may or may not be available in one single database solution. An ultimate solution could be a hybrid of two or more databases.