This website uses cookies to ensure you have the best experience. Learn more

Data Warehousing Essay

1198 words - 5 pages


Data Warehousing and Data Mining
Bruce Nimo
CIS 111
March 19, 2012.
Prof Jones
Data mining is a process of numerical analysis. Analysts use technical tools to query and sort through terabytes of data looking for patterns. Usually, the analyst will develop a hypothesis, such as customers who buy product X usually buy product Y within six months.
Running a query on the relevant data to prove or disprove this theory is data mining.
Data warehousing describes the process of designing how the data is stored in order to improve reporting and analysis. Data warehouse experts consider that the various stores of data are ...view middle of the document...

However, if the data warehouse expert designs a data storage system that closely connects relevant data in different databases, the data miner can now run much more meaningful and efficient queries to improve the business.
Selection of architecture will determine, or be determined by, where the data warehouses and or data marts themselves will reside and where the control resides. For example, the data can reside in a central location that is managed centrally. Or, the data can reside in distributed local and/or remote locations that are either managed centrally or independently. The architecture choices we consider in this book are global, independent, interconnected, or some combination of all three. The implementation choices to be considered are top down, bottom up, or a combination of both. It should be understood that the architecture choices and the implementation choices can also be used in combinations.
For example, data warehouse architecture could be physically distributed, managed centrally, and implement from the bottom up starting with data marts that service a particular workgroup, department, or line of business. A global data warehouse is considered one that will support all, or a large part, of the corporation that has the requirement for a more fully integrated data warehouse with a high degree of data access and usage across departments or lines-of-business. That is, it is designed and constructed based on the needs of the enterprise as a whole. It could be considered to be a common repository for decision support data that is available across the entire organization, or a large subset thereof. A common misunderstanding is that a global data warehouse is centralized.
The term global is used here to reflect the scope of data access and usage, not the physical structure. The global data warehouse can be physically centralized or physically distributed throughout the organization. A physically centralized global warehouse is to be used by the entire organization that resides in a single location and is managed by the Information Systems department. A distributed global warehouse is also to be used by the entire organization, but it distributes the data across multiple physical locations within the organization and is managed by the IS department. When we say that the Information Systems department manages the data warehouse, we do not necessarily mean that it controls the data warehouse. For example, the distributed locations could be controlled by a particular department or line of business. That is, they decide what data goes into the data warehouse, when it is updated, which other departments or lines of business can access it, which individuals in those departments can access it, and so forth. However, to manage the implementation of...

Other Papers Like Data Warehousing

Business Intelligence Essay

739 words - 3 pages enabling organizations to integrate their various databases into data warehouses. Data warehousing is defined as a process of centralized data management and retrieval. Data warehousing, like data mining, is a relatively new term although the concept itself has been around for years. Data warehousing represents an ideal vision of maintaining a central repository of all organizational data. Centralization of data is needed to maximize user access

It560 Week 2 Essay

1830 words - 8 pages lot of their manual processes. This is a very simple plug and play process which will simplify the implementation and support of the new product. Part B Describe the benefits of real-time data warehousing at Continental. Real time data warehousing offers many benefits to Continental. There are benefits from a marketing, corporate security, information technology and revenue management perspective. “The benefits from real-time

Data Management

632 words - 3 pages . Data Warehousing & Business Intelligence Management 8. Document & Content Management 9. Meta Data Management 10. Data Quality Management Data Management: A Unified Approach Effective , unified Data Management Strategies depends on IT and Business users working together. When teams of IT and business users are formed to guide data management, the results are often positive. “The strategy around data management has to deal with the

Data Mining

1405 words - 6 pages making (Coronel, Morris, & Rob, 2013). Data Warehouse A data warehouse enables an organization to obtain the information about future trends and track customer demands. The key terms that define data warehouse are subject-oriented, integrated, time-variant, and nonupdateable. Each one has its meaning and importance in data warehousing. Subject-oriented – A data warehouse is organized around the key subjects that may include but not


3118 words - 13 pages Data Warehousing Concepts Venkat Jandhyala 2010 1 1.1 INTRODUCTION Data Warehousing Concepts Based on the way the data is used, databases can be classified in two ways: the one that is used for transactions i.e. Online Transaction Processing (OLTP) and the one that is used for analysis Online Analytical Processing (OLAP). As the businesses these days contain huge amounts of data and the users are connected to these databases

Data Mining

1657 words - 7 pages . Data warehousing is defined as a process of centralized data management and retrieval. Data warehousing, like data mining, is a relatively new term although the concept itself has been around for years. Data warehousing represents an ideal vision of maintaining a central repository of all organizational data. Centralization of data is needed to maximize user access and analysis. Dramatic technological advances are making this vision a reality for


965 words - 4 pages |PROFESSIONAL SUMMARY: | • Over all 3 Plus years of experience in data warehousing project using of datastage, quality stage and oracle. |KEY SKILLS: | • Experience working with Data Stage

Audit Design Program Iii

1202 words - 5 pages have an effect on the cost of goods sold and inventory) Most important analytical procedures for uncovering material misstatements of the inventory and warehousing cycle is comparing current year data with prior year data. Cash Cycle The audit of the cash cycle is designed to ensure existence, completeness, accuracy, classification, timing and summarization. The table below displays the relationship of the transactions related

Tertiary Packaging Market - Global Industry Analysis, Size, Share, Growth, Trends and Forecast 2016 – 2023

836 words - 4 pages . Additionally, export companies are one of the major users of tertiary packaging. Some of the tertiary packaging is disposable while some can be reused. Corrugated brown carton is one of the most widely used tertiary packaging. Tertiary Packaging Market: Drivers The market for tertiary packaging was driven by various factors such as growing demand from logistics and warehousing activities, rise in export from China and other Asia countries and

Module 4 Review

647 words - 3 pages . Database management systems simplify finding and utilizing safely stored data, enabling the organization to operate quickly and fluently. Furthermore, it acts as an interpreter and medium where data are stored safely and retrieved per the need of the organization. It also aids in applying information system technologies like database management, data warehousing, and other data management tools to the task of managing an organization’s data resources

Mba Research

519 words - 3 pages information quality dimensions. Each data table whether a base table or materialized view has cost and quality associated with it. Depending on whether the data table is a base table or a materialized view the cost of the table varies and the quality levels High, Medium and low are measured based on the four dimensions accuracy, completeness, currency and comprehensibility. Data warehousing systems aims at helping organizations to seek and store data tables that are of value and good quality level as well as helps the organizations in attaining the organizational goals rather through various techniques rather than storing all the data.

Related Essays

Data Warehousing Essay

1219 words - 5 pages Data Warehousing Data warehouse is a repository of an organization's electronically stored data. Data warehouses are designed to facilitate reporting and analysis[1]. This definition of the data warehouse focuses on data storage. However, the means to retrieve and analyze data, to extract, transform and load data, and to manage the data dictionary are also considered essential components of a data warehousing system. Many references to

Data Warehousing Essay

2565 words - 11 pages Data Warehousing - An Overview The data warehouses are supposed to provide storage, functionality and responsiveness to queries beyond the capabilities of today's transaction-oriented databases. Also data warehouses are set to improve the data access performance of databases. Traditional databases balance the requirement of data access with the need to ensure integrity of data. In present day organizations, users of data are often completely

Data Warehousing In Universities Essay

2843 words - 12 pages needs for complex queries and insightful information with a managed database. In 1990, William Inmon (Inmon, W. H. 1997) coined the phrase “Data warehouse”. The ultimate goal of data warehousing is the creation of a single, logical view of data, which may reside in many physically disparate databases (Butler Group. 1996). “…traditional database systems are good at recording and reporting what happened. A data warehouse shows why” (Fisher

Healthcare Data Warehousing Essay

1903 words - 8 pages Healthcare Data Warehousing Doug Kelley Health Informatics I Professor Lu December 7, 2012 Abstract ` Dimensional modeling lays the groundwork for data warehouses. Dimensional modeling is a similar process to traditional Entity/Relationship modeling in regards to tables (entities) having joins (relationships) with other tables via primary keys. Dimensional modeling has been used as a standard in industry for decision support systems