Estimate the height of the b plus tree

Assignment Help Database Management System
Reference no: EM1364810

Consider an inverted index containing, for each term, the posting list (i.e. the list of documents and occurrences within documents) for that term. The posting lists are accessed through a B+ tree with the terms serving as search keys. Each leaf of the B+ tree holds a sublist of alphabetically consecutive terms, and, with each term, a pointer to the posting list for that term.

Part a. An artificially small example of a B+ tree is shown here (pdf). (Note only part of the tree is shown in detail.) What nodes of the example B+ tree are visited to find the posting list for "dune"?

Part b. Suppose there are 2 million terms for a collection of 32 million documents of total size 200 gigabytes. We would like each internal node of the B+ tree and each leaf of the B+ tree to fit in one 8-kilobyte page of the file system. Recall that a B+ tree has a parameter m called the order of the tree, and each internal node of a B+ tree has between m+1 and 2m+1 children (except the root, which has between 2 and 2m+1). Assume that each term is represented using 16 bytes, and each pointer to a child in the tree or to a posting list is represented using 8 bytes. Find a value for the order m of the B+ tree so that one 8 kilobyte page can be assigned to each internal node and leaf, and so that an internal node will fill, but not overflow, its page when it has 2m+1 children. If you need to make additional assumptions, state what assumptions you are making.

Part c. For your m of Part b, estimate the height of the B+ tree. (Giving a range of heights is fine.) Also estimate the amount of memory needed to store the tree, including leaves but not including the posting lists themselves.

Part d. Estimate the aggregate size of the posting lists.

Reference no: EM1364810

Questions Cloud

Using expected monetary value as decision criterion : Bill Goodman has been offered the opportunity to invest $15,000 in a start-up company that intends to supply personal digital assistants to physicians in order to enable them to determine the approved medication for each HMO patient they treat.
Operations management-jacobs baby food company : Operations Management-Jacob's Baby Food Company - Jacob's Baby Food Company must go through the following steps to make mashed carrots
Explain the electric flux through the surface of the cube : explain the electric flux through the surface of the cube. Calculate the volume charge density of the atmosphere, assuming it to be uniform between 270 and 420 m.
Trends in medical practice in the united states : What do you consider the most important trends in medical practice in the U.S.? Discuss at least three major trends, and provide your assessment as to whether these are positive or negative trends for medical care in our country.
Estimate the height of the b plus tree : Estimate the height of the B+ tree. (Giving a range of heights is fine.) Also estimate the amount of memory needed to store the tree, including leaves but not including the posting lists themselves.
Operations management-oakwood outpatient clinic : Oakwood Outpatient Clinic is analyzing its operation in an effort to improve performance.
Describe the probability section of the risk management plan : Describe the probability and impacts section of the Risk Management Plan and justify the values assigned
Analyze four eras of business : Examine the four eras of business and make a prediction for what the next era will be like. Explain the rationale behind your prediction.
What is the resultant velocity of the motor boat : What is the resultant velocity of the motor boat. What distance downstream does the boat reach opposite shore.

Reviews

Write a Review

Database Management System Questions & Answers

  Design work breakdown structure for designing database

Design a work breakdown structure for the task below: Designing the database, Working on the website content

  Evaluate maximum rate at which data can be read from disk

What is the maximum rate at which data can be read from disk, assuming that we can only read data from one surface at a time? What is the average rotational latency?

  Database to keeps track of students in university

University XYZ needs a database that keeps track of students, what classes they taken and the grades for each of the classes.

  Create structurally sound relational database schema

Create a structurally sound relational database schema showing the minimum number of fields, tables, and relationships between the tables.

  Draw an er diagram for database scenario

Draw an ER diagram for database scenario. Design a set of 3NF tables for your database scenario.

  Create link list in adt to maintain employee information

Create a link list in ADT c to maintain employee information like name,empid,basic salary and address.1.add employee info to the list if the empid is valid.

  List course along with names of students from database table

List the courses (D-code and C-no), along with the names of the students who are currently taking them. List all the courses (D-code and C-no) that John (i.e., S-Name=''John'') got 'A' grade.

  Pharmacy designating database

Pharmacy systems today are more efficient and user friendly when compared to the systems 20 years ago.

  Describing the select statement

Data processing needs taking or receiving the data from a source and doing something with it. The same can be said about the transaction processing. When working along with a file, whether it be a fixed length.

  Define set of relational schemas and identify primary keys

We want to construct a database for a world-wide package delivery company. Define a set of relational schemas and identify primary and foreign keys. Try not to include redundant schemas.

  Implement a database of courses and students for a school

Implement a database of courses and students for a school.

  Demodulator circuits and amplitude modulator

Explain how much the modulating signal power is required to generate 100 percent modulation? What is the approximate center frequency of filter required to pass the lower sideband?

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd