N
The Daily Insight

What is big data modeling?

Author

Christopher Pierce

Updated on February 28, 2026

Data modeling is a complex science that involves organizing corporate data so it fits the needs of business processes. It requires the design of logical relationships so the data can interrelate with each other and support the business.

What are the 3 types of big data?

Big data is classified in three ways:

  • Structured Data.
  • Unstructured Data.
  • Semi-Structured Data.

What is big data in performance management?

Big Data refers to the large collections of data that may be analysed to reveal patterns, trends and associations, especially relating to human behaviour and interactions. Forecasting better (eg customer’s future spending patterns, when machines will need replacing) so that more appropriate decisions can be made.

How big data are generated?

The bulk of big data generated comes from three primary sources: social data, machine data and transactional data.

What are the advantages of data models in big data?

• Performance: Good data models can help us quickly query the required data and reduce I/O throughput. • Cost: Good data models can significantly reduce unnecessary data redundancy, reuse computing results, and reduce the storage and computing costs for the big data system.

What is the best book on data warehouse modeling?

His book ‘The Data Warehouse Toolkit — The Complete Guide to Dimensional Modeling” is a classic of data warehouse modeling in the data warehouse engineering field. Dimensional Modeling tackles the issue of analytical decision making and requirement analysis.

How to create data warehouse models by using ER modeling?

To create data warehouse models by using ER modeling, we first need to integrate and combine the data in various systems thematically and from the perspective of the entire enterprise. We then need to process the data for consistency to enable analysis and decision making based on the data.

What is the data vault model by Dan Linstedt?

Dan Linstedt used ER model as a base to create the Data Vault Model. The model’s design is useful for integrating data, but one cannot use it directly for data analysis and decision making. The model emphasizes establishment of an auditable basic data layer focusing on data history, traceability, and atomicity.