Use postings intersection procedure to find list of document

Assignment Help Basic Computer Science
Reference no: EM1370433

Suppose we wish to use a postings intersection procedure to determine simply the list of documents that satisfy a /k clause, rather than returning the list of positions, as in Figure 2.12 (page 42). For simplicity, assume k ≥ 2. Let L denote the total number of occurrences of the two terms in the document collection (i.e., the sum of their collection frequencies). Which of the following is true? Justify your answer.

a. The merge can be accomplished in a number of steps linear in L and independent of k, and we can ensure that each pointer moves only to the right.

b. The merge can be accomplished in a number of steps linear in L and independent of k, but a pointer may be forced to move non-monotonically (i.e., to sometimes back up)

c. The merge can require kL steps in some cases.
Figure 2.12 can be found from this pdf file - https://nlp.stanford.edu/IR-book/pdf/02voc.pdf

Q2.
How should the Boolean query x AND NOT y be handled? Why is naive evaluation of this query normally very expensive? Write out a postings merge algorithm that evaluates this query efficiently.

Q3.
If all the hub and authority scores are initialized to 1, what is the hub/authority score of a node after one iteration?

Q4.
In the preceding discussion we encountered two recommended "hard constants" - the increment on te being ten times the last fetch time, and the number of back queues being three times the number of crawl threads. How are these two constants related?

Reference no: EM1370433

Questions Cloud

Information about marginal costs : A driver wishes to buy gasoline and have her car washed. She finds that the wash costs $3.00 when she buys 19 gallons at $1.00 each, but that if she buys 20 gallons, the car wash is free. Thus the marginal cost of the twentieth gallon of gas is:
Philanthropy and charity : Explain the difference between philanthropy and charity. Use examples from private and non-profit sectors to illustrate these differences.
Explain the theory of operant conditioning : Explain the theory of operant conditioning and Compare and contrast positive and negative reinforcement and Determine which form of reinforcement is the most effective
Present value analysis : Determine which of following independent projects should be selected for investment if $325,000 is available and the MARR is 10 percent per year
Use postings intersection procedure to find list of document : Assume we want to use postings intersection procedure to find simply the list of documents which satisfy a /k clause, rather than returning list of positions.
Explain and identify ways in which individual elements : Explain and Identify ways in which individual elements of the national economic environment affect the business environment
Producers-consumers and competitive markets : Assume that the competitive firm's marginal cost of producing output q is given by MC(q)=3+2q. Suppose that the market price of the firm's product is $9. Find out level of output will the firm produce?
Explain what changes should be made to copyright law : Explain What changes should be made to copyright law to reflect the ease with which much creative material can be reproduced and distributed over the Internet
Create program to randomly access data on stocks : A small mutual company wishes you to create program to randomly access data on stocks it holds. Presently, data are stored in a text file, each line of which contains following: a stock code.

Reviews

Write a Review

Basic Computer Science Questions & Answers

  Largest positive octal number that can be stored in pdp-9

Data was stored in the PDP-9 computer using six-digit octal notation. Negative numbers were stored in 8's complement form. What is the largest positive octal number that can be stored in this machine?

  Explain local telephone calls-long distance telephone calls

Will distinction between local telephone calls and long distance telephone calls ever disappear? What may cause this to happen?

  What techniques have greatest impact on website

What web design techniques would you use to help a user with these disabilities? What techniques might have the greatest impact on your website and why?

  Explain specific challenges of facing designer

Explain specific challenges of facing the designer, specifically with regard to limitations of hardware, software and interface design two paragraph each.

  Secure windows-unix-linux servers from known shortcoming

The CIO has asked you to explain why you suggest it is so significant to secure your Windows and Unix/Linux servers from known shortcomings/vulnerabilities.

  Compute mean number of rounds per contention period

Determine the probability that the contention ends on round k, and compute the mean number of rounds per contention period?

  Finding content of top of stack-call instruction is executed

Specify the content of PC, SP, and the top of the stack in the following situations: After the call instruction is executed.

  Create worksheet using excel having different columns

Create a worksheet using excel having different columns depicting the Serial Number, Name of the Student, Marks obtained in various subjects i.e. English, Maths, and Science.

  Video memory is needed to store picture as true color image

how much memory is required to store the picture? How much video memory is required to store the picture as a ''true color'' image, at 3 bytes per pixel?

  Explaining network attacker steal secure google cookies

Explain how a network attacker (an active attacker that can intercept or forge network packets, etc.) could steal secure google.com cookies.

  Subsets of integers which sum to the same number

How many numbers do you require from this generator to guarantee that there exist 2 subsets of integers which  sum to the same number? Write steps how derive the solution.

  People and organization responsible for bumping problems

Answer from the perspective of the airlines and from the perspective of the customers. What people, organization, and technology factors are responsible for excessive bumping problems?"

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd