What is the minimum budget to index all pages

Assignment Help Computer Engineering
Reference no: EM131965158

Assignment

You are in charge of the Genghis ('We execute fast') search engine. You are designing your server cluster to handle 500 million hits a day and 10 billion pages of indexed data. Each machine costs $1000, and can store 10 million pages and respond to 200 queries per second (against these pages).

1. If you were given a budget of $500,000 dollars for purchasing machines, and were required to index all 10 billion pages, could you do it?

2. What is the minimum budget to index all pages? If you assume that each query can be answered by looking at data in just one (10 million page) partition, and that queries are distributed across partitions, what peak load (in number of queries per second) can such a cluster handle?

3. How would your answer to the previous question change if each query, on average, accessed two partitions?

4. What is the minimum budget required to handle the desired load of 500 million hits per day if all queries are on single Assume that queries are uniformly distributed with respect to tirTle of day.

5. How would your answer to the previous question change if the number of queries per day went up to 5 billion hits per day? How would it change if the number of pages went up to 100 billion'?

6. Assume that each query accesses just one that queries are uniformly distributed across partitions, but that at any given the peak load on a partition is upto 10 times the average load. What is the minimum budget for purchasing machines in this scenario?

7. Take the cost for machines from the previous question and multiply it by 10 to reflect the costs of maintenance, administration, network bandwidth, etc. This amount is your annual cost of operation. Assume that you charge advertisers 2 cents per page. What fraction of your inventory (i.e., the total number of pages that you serve over the course of a year) do you have to sell in order to make a profit?

Reference no: EM131965158

Questions Cloud

Explain how modern beliefs have influenced the production : Describe your understanding of the way that the material culture is related to a society's beliefs, values, and actions.
What strategic issues confront amazon : What strategic issues confront Amazon in 2015? What market or internal circumstances should most concern J eff Bezos and the company’s senior leadership team?
What is the maximum profit that you can make : Please draw the graph for your combined position. What is the maximum profit that you can make with this combined position? What is the maximum possible loss?
Analyze the five business-level strategies : Based on your research make recommendation about how that company could modify its business-level strategy to both increase your overall level of satisfaction.
What is the minimum budget to index all pages : What is the minimum budget to index all pages? If you assume that each query can be answered by looking at data in just one partition.
Distinguish between gemeinschaft and geselschaft : How might a sociologist approach the issue of domestic violence, drug addiction, or depression differently from a psychologist?
Discuss the net present value approach : Using the net present value approach, determine whether CVS Health Corporation should lease or buy the new store. Assume that you will be making your.
Conduct a thirty-minute interview : Conduct a thirty-minute interview (an ideal interview would be in person or via the telephone, but an e-mail or chat-based interview is also acceptable) .
Long-term relationships with key supply chain participants : Developing long-term relationships with key supply chain participants (e.g., consumers, intermediate customers, and suppliers) can be best described as:

Reviews

Write a Review

Computer Engineering Questions & Answers

  Mathematics in computing

Binary search tree, and postorder and preorder traversal Determine the shortest path in Graph

  Ict governance

ICT is defined as the term of Information and communication technologies, it is diverse set of technical tools and resources used by the government agencies to communicate and produce, circulate, store, and manage all information.

  Implementation of memory management

Assignment covers the following eight topics and explore the implementation of memory management, processes and threads.

  Realize business and organizational data storage

Realize business and organizational data storage and fast access times are much more important than they have ever been. Compare and contrast magnetic tapes, magnetic disks, optical discs

  What is the protocol overhead

What are the advantages of using a compiled language over an interpreted one? Under what circumstances would you select to use an interpreted language?

  Implementation of memory management

Paper describes about memory management. How memory is used in executing programs and its critical support for applications.

  Define open and closed loop control systems

Define open and closed loop cotrol systems.Explain difference between time varying and time invariant control system wth suitable example.

  Prepare a proposal to deploy windows server

Prepare a proposal to deploy Windows Server onto an existing network based on the provided scenario.

  Security policy document project

Analyze security requirements and develop a security policy

  Write a procedure that produces independent stack objects

Write a procedure (make-stack) that produces independent stack objects, using a message-passing style, e.g.

  Define a suitable functional unit

Define a suitable functional unit for a comparative study between two different types of paint.

  Calculate yield to maturity and bond prices

Calculate yield to maturity (YTM) and bond prices

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd