Reference no: EM133037179
Dtabase: Questions
Question 1: In traditional RDBMS, a table may contain which of the following?
Complex data structures.
Arrays.
Embedded classes.
All of the above.
None of the above.
Question2: Which of the following is a potential driver for the development and/or adoption of a NoSQL database?
Simplicity in the database design (e.g., the NoSQL database is simpler than the corresponding SQL database).
Minimize the number of translations from how the data is stored to how the data is consumed.
Flexibility in the database design.
All of the above. [Hint: a description for this answer would explain the bene?t for each of the above possible answers.]
None of the above. [Hint: a description for this answer would explain how SQL provides each of the above possible answers.]
Question 3: In the paper "MapReduce: Simpli?ed Data Processing on Large Clusters," by Jeffrey Dean and Sanjay Ghemawat, published in the Communications of the ACM, Jan 2008, the concept of performing a map function followed by a reduce function is introduced. The computations take what as input and produce what as output?
Tables, Tables.
Key/Value pairs, Key/Value pairs.
Documents, Tables.
XML, XML.
Question 4: In the paper "MapReduce: Simpli?ed Data Processing on Large Clusters," by Jeffrey Dean and Sanjay Ghemawat, published in the Communications of the ACM, Jan 2008, the concept of performing a map function followed by a reduce function is introduced. Which of the following is true of the way in which the MapReduce operations work?
Each instance of the map function receives the complete set of data.
Each instance of the MapReduce functionality randomly chooses whether to perform the map function or the reduce function.
Each instance of the reduce function receives all of the intermediate data from every map function.
All of the above. [Hint: a description for this answer would explain the basic operation for each of the above possible answers.]
None of the above. [Hint: a description for this answer would provide the correct operation and explain the basic of this operation for each of the above possible answers.]
Question 5: In the paper "MapReduce: Simpli?ed Data Processing on Large Clusters," by Jeffrey Dean and Sanjay Ghemawat, published in the Communications of the ACM, Jan 2008, the concept of performing a map function followed by a reduce function is introduced. Which of the following statements is true about the intermediate data generated by the map worker functions and the reduce worker functions? [Hint: If a single answer (a), (b) or (c) is chosen, explain why the chosen answer is correct and the other two are incorrect.]
The map intermediate results are stored on local machines.
The reduce worker results are stored on local machines.
The map intermediate results are never combined or manipulated before being input to the reduce worker function. d) All of the above. [Hint: a description for this answer would explain the basic operation for the map and reduce worker functions.]
e) None of the above. [Hint: a description for this answer would provide the correct operation and explain the basic operation for the map and reduce worker functions.]
Question 6: In the paper "MapReduce: Simpli?ed Data Processing on Large Clusters," by Jeffrey Dean and Sanjay Ghemawat, published in the Communications of the ACM, Jan 2008, the concept of performing a map function followed by a reduce function is introduced. Which of the following are identi?ed as advantages of MapReduce? [Hint: If a single answer (a), (b) or (c) is chosen, explain why the chosen answer is correct and the other two are incorrect.]
MapReduce allows programmers with no parallel system experience to exploit large amounts of resources easily.
MapReduce makes it possible to write simple programs that run e?ciently on large numbers of machines.
MapReduce allows for a large variety of problems to be easily expressible as MapReduce problems.
All of the above. [Hint: a description for this answer would explain why each of the answers above provides value.]
None of the above. [Hint: a description for this answer would explain why each of the answers above is not a value provided by MapReduce.]
Question 7: In the paper "BigTable: A Distributed Storage System for Structured Data," by Chang et al., published in OSDI 2006, a distributed storage system for managing structured data that scales to petabytes is presented. The goals for BigTable include which of the following?
Scalability
High performance
High availability
Wide applicability
All of the above. [Hint: a description of this answer would explain each of the answers a-d above.]
None of the above. [Hint: a description for this answer would provide an explanation of the goals for BigTable.]
Question 10: In the paper "BigTable: A Distributed Storage System for Structured Data," by Chang et al., published in OSDI 2006, a distributed storage system for managing structured data that scales to petabytes is presented. Which of the following best describes BigTable?
BigTable provides a scalable implementation of a traditional relational database.
BigTable provides a complex data model that supports dynamic control over data layout and format.
BigTable provides a simple data model that supports dynamic control over data layout and format.
BigTable provides a simple interface to a traditional relational database that supports dynamic control over data layout and format.
Question 11: In the paper "BigTable: A Distributed Storage System for Structured Data," by Chang et al., published in OSDI 2006, a distributed storage system for managing structured data that scales to petabytes is presented. BigTable provides which of the following traditional RDBMS functionalities?
Atomic write operations.
Access control rights.
Atomic read operations.
All of the above. [Hint: a description for this answer would explain each of the answers a-d above.] e) None of the above. [Hint: a description for this answer would provide two examples, and explanations of what functionalities are provided by BigTable.]
Question 12: In the paper "BigTable: A Distributed Storage System for Structured Data," by Chang et al., published in OSDI 2006, a distributed storage system for managing structured data that scales to petabytes is presented. BigTable utilizes a hierarchy analogous to that of a B+ tree. How many levels are in this hierarchy? [Hint: explain each level in the hierarchy.]
2
3
4
5
Question 13: Which of the following data is least suited to a traditional RDBMS database?
Book title.
Multiple authors.
Publication date.
An abstract spanning several paragraphs.
Question 14: In MongoDB, a table (called a collection) may contain which of the following?
Complex data structures.
Arrays.
Embedded classes.
All of the above.
None of the above.
Question 15: The paper "Dynamo: Amazon's Highly Available Key-value Store," by DeCandia et al., published in SOSP 2007, presents a highly available key-value storage system used by Amazon.com. Scalability and reliability are the primary drivers for the design and implementation of Dynamo. To this end, hardware and network failures are treated as which of the following?
The normal case.
Rarely occurring special case.
Often occurring special case.
They are not explicitly considered in the design.
Question 16: The paper "Dynamo: Amazon's Highly Available Key-value Store," by DeCandia et al., published in SOSP 2007, presents a highly available key-value storage system used by Amazon.com. Dynamo provides only what kind of access to the data store?
Candidate key
Primary key
Superkey
Foreign key
Question 17: The paper "Dynamo: Amazon's Highly Available Key-value Store," by DeCandia et al., published in SOSP 2007, presents a highly available key-value storage system used by Amazon.com. Dynamo relaxes which database property to maintain high availability?
Isolation
Durability
Atomicity
Consistency
Question 18: The paper "Dynamo: Amazon's Highly Available Key-value Store," by DeCandia et al., published in SOSP 2007, presents a highly available key-value storage system used by Amazon.com. Dynamo achieves its performance by utilizing which of the following techniques?
Uniform data distribution through consistent hashing.
Trade off durability guarantees for performance.
Uniformly assigning requests to nodes.
All of the above.
None of the above.
Question 19: Brewer's CAP Conjecture states:
It is possible for a distributed database to achieve consistency, availability, and partition tolerance simultaneously.
It is possible for a distributed database to achieve at most one of consistency, availability, and partition tolerance simultaneously.
It is impossible for a distributed database to achieve consistency, availability, and partition tolerance, or any combination of the three, simultaneously.
It is impossible for a distributed database to achieve consistency, availability and partition tolerance simultaneously.
Question 20: NoSQL databases are architecturally different from relational database because:
they represent relational data in a different manner.
they are designed to reap the read and write performance bene?ts of partition tolerance (horizontal scaling) while leaving either consistency or availability up for negotiation.
they are designed to reap the certainty bene?ts of consistency while leaving either partition tolerance or availability up for negotiation.
they are designed to reap the access bene?ts of availability while leaving either partition tolerance or consistency up for negotiation.
MongoDB:
doesn't provide ACID guarantees over a series of operations.
doesn't have the equivalent of an RDBMs' BEGIN, COMMIT and ROLLBACK semantics. c) supports atomic, durable updates on individual documents and consistent reads.
All of the above.
None of the above.
Question 21: Which of the following statements is true of MongoDB operations?
if you have a multi-operation transaction that decrements from one property and increments another (on a single document), then MongoDB can ensure atomicity and durability in this case.
if you have a multi-operation transaction that updates properties on separate or multiple documents, then MongoDB cannot ensure atomicity or durability in this case.
if you have a multi-operation transaction on a single document, locking strategies are used to ensure strong consistency at the potential cost of high availability.
All of the above.
None of the above.
Question 22: Which of the following types of databases is most suitable for banking transactions? [Hint: in your explanation describe an example scenario that illustrates why your chosen type works.]
Relational type
Document type
Graph type
Key-Value type
Question 23: The Cassandra database operates best under what relative read-write occurrences?
Equal numbers of reads and writes.
More writes than reads.
More reads than writes.
All of the above.
None of the above.
Question 24: MongoDB is designed to operate with data stored across multiple servers. The process of distributing different data across multiple servers is referred to as?
Sharding
Replication
Copying
Splitting
Question 25: Which of the following companies developed Cassandra?
Twitter
Google
LinkedIn
Facebook
Question 26: List ?ve 1-2 word advantages of NoSQL databases (generally speaking) over relational databases, and list ?ve 1-2 word disadvantages of NoSQL databases (generally speaking) compared to relational databases. Provide a one to two sentences explanation for each advantage and for each disadvantage.
Attachment:- Database questions.rar