56 TOP Data Warehousing Multiple choice Questions and Answers pdf

The below List of 56 TOP Data Warehousing Multiple choice Questions and Answers for freshers and experienced pdf free download
1. With data mining, the best way to accomplish this is by setting aside some of your data in a vault to isolate it from the mining process; once the mining is complete, the results can be tested against the isolated data to confirm the model's _______.
2. The automated, prospective analyses offered by data mining move beyond the analyses of past events provided by _____________ tools typical of decision support systems.
3. The technique that is used to perform these feats in data mining is called modeling, and this act of model building is something that people have been doing for a long time, certainly before the _________ of computers or data mining technology.
4. Classification consists of examining the properties of a newly presented observation and assigning it to a predefined ____________.
5. During business hours, most ______ systems should probably not use parallel execution.
6. In contrast to statistics, data mining is ______ driven.
7. Data mining derives its name from the similarities between searching for valuable business information in a large database, for example, finding linked products in gigabytes of store scanner data, and mining a mountain for a _________ of valuable ore.
8. As opposed to the outcome of classification, estimation deal with __________ valued outcome.
9. The goal of ideal parallel execution is to completely parallelize those parts of a computation that are not constrained by data dependencies. The smaller the portion of the program that must be executed __________, the greater the scalability of the computation.
10. Data mining evolve as a mechanism to cater the limitations of ________ systems to deal massive data sets with high dimensionality, new data types, multiple heterogeneous data resources etc.
11. The goal of ideal parallel execution is to completely parallelize those parts of a computation that are not constrained by data dependencies. The ______ the portion of the program that must be executed sequentially, the greater the scalability of the computation.
12. The goal of ___________ is to look at as few blocks as possible to find the matching records(s).
13. In nested-loop join case, if there are ‘M’ rows in outer table and ‘N’ rows in inner table, time complexity is
14. Many data warehouse project teams waste enormous amounts of time searching in vain for a ________.
15. A dense index, if fits into memory, costs only ______ disk I/O access to locate a record by given key.
16. All data is ______________ of something real.
18. The key idea behind ___________ is to take a big task and break it into subtasks that can be processed concurrently on a stream of data inputs in multiple, overlapping stages of execution.
19. Non uniform distribution, when the data is distributed across the processors, is called ______.
20. The goal of ideal parallel execution is to completely parallelize those parts of a computation that are not constrained by data dependencies. The smaller the portion of the program that must be executed __________, the greater the scalability of the computation.
21. Data mining is a/an __________ approach, where browsing through data using data mining techniques may reveal something that might be of interest to the user as information that was unknown previously.
22. Data mining evolve as a mechanism to cater the limitations of ________ systems to dealmassive data sets with high dimensionality, new data types, multiple heterogeneous data resources etc.
23. ________ is the technique in which existing heterogeneous segments are reshuffled, relocated into homogeneous segments.
24. To measure or quantify the similarity or dissimilarity, different techniques are available. Which of the following option represent the name of available techniques?
25. For a DWH project, the key requirement are ________ and product experience.
26. Pipeline parallelism focuses on increasing throughput of task execution, NOT on __________ sub-task execution time.
27. Focusing on data warehouse delivery only often end up _________.
28. Pakistan is one of the five major ________ countries in the world.
29. _____________ is a process which involves gathering of information about column through execution of certain queries with intention to identify erroneous records.
30. Relational databases allow you to navigate the data in ____________ that is appropriate using the primary, foreign key structure within the data model.
31. DSS queries do not involve a primary key
32. __________ contributes to an under-utilization of valuable and expensive historical data, and inevitably results in a limited capability to provide decision support and analysis.
33. DTS allows us to connect through any data source or destination that is supported by _______.
34. If some error occurs, execution will be terminated abnormally and all transactions will be rolled back. In this case when we will access the database we will find it in the state that was before the ____________.
35. The need to synchronize data upon update is called
36. Taken jointly, the extract programs or naturally evolving systems formed a spider web, also known as
37. It is observed that every year the amount of data recorded in an organization is
38. Pre-computed _______ can solve performance problems
39. The degree of similarity between two records, often measured by a numerical value between _______, usually depends on application characteristics.
40. The purpose of the House of Quality technique is to reduce ______ types of risk.
41. NUMA stands for ________.
42. There are many variants of the traditional nested-loop join. If the index is built as part of the query plan and subsequently dropped, it is called
43. The Kimball s iterative data warehouse development approach drew on decades of experience to develop the _____________.
44. During the application specification activity, we also must give consideration to the organization of the applications.
45. The most recent attack is the ________ attack on the cotton crop during 2003- 04, resulting in a loss of nearly 0.5 million bales.
46. The users of data warehouse are knowledge workers in other words they are_________ in the organization.
47. _________ breaks a table into multiple tables based upon common column values.
48. _____modeling technique is more appropriate for data warehouses.
49. Multi-dimensional databases (MDDs) typically use ___________ formats to store pre-summarized cube structures.
50. Data warehousing and on-line analytical processing (OLAP) are _______ elements of decision support system.
51. Analytical processing uses ____________ , instead of record level access.
52. The divide & conquer cube partitioning approach helps alleviate the ____________ limitations of MOLAP implementation.
53. Data Warehouse provides the best support for analysis while OLAP carries out the _________ task.
54. Virtual cube is used to query two similar cubes by creating a third “virtual” cube by a join between two cubes.
56. Data warehousing and on-line analytical processing (OLAP) are _______ elements of decision support system.

0 comments: