Network design and analysis

Assignment Help Computer Networking
Reference no: EM131109103

Data Communications: Network Design and Analysis
Individual Assignment - Requirements Analysis  (Adapted from Oppenheimer's Chapter 4 Design Scenario)

Enterprise Description:

Genome4U is a scientific research project at a large university in the United States. Genome4U has recently started a large-scale project to sequence the genomes of 250,000 volunteers with a goal of creating a set of publicly accessible databases with human genomic, trait, and medical data.

The project's founder, a brilliant man with many talents and interests, tells you that the public databases will provide information to the world's scientific community in general, not just those interested in medical research. Genome4U is trying not to prejudge how the data will be used because there may be opportunities for interconnections and correlations that computers can find that people might have missed.

The founder envisions clusters of servers that will be accessible by researchers all over the world. The databases will be used by end users to study their own genetic heritage, with the help of their doctors and genetic counselors. In addition, the data will be used by computer scientists, mathematicians, physicists, social scientists, and other researchers.

The genome for a single human consists of complementary DNA strands wound together in a double helix. The strands hold about 6 billion base pairs of nucleotides connected by hydrogen bonds. To store the research data, 1 byte of capacity is used for each base pair. As a result, 6 Giga-Bytes of data capacity is needed to store the genetic information of just one person. On top of that, 10% of disk space is required for indexing. The project plans to use network-attached Storage (NAS) clusters. A system has been prototyped using the current version of FreeNAS software (www.freenas.org). Production software is expected to migrate to HBase and Hadoop cloud computing infrastructure. The NAS cluster will be made by standard computers (NAS servers) with 40 SATA hard-drives 2 TB each configured with RAID6 redundancy.

Genome4U has developed new techniques to sequence a person's genome quickly, accurately and most importantly at low cost. The research group is a contestant for the $10,000,000 X-Prize offered by Archon-Genomics (see https://genomics.xprize.org for details). With their current funding they expect to complete the pilot project with capability to store the research data for 1,000 individuals by December 2012. And can sequence 5,000 individuals every month thereafter.

In addition to genetic information, the project will ask volunteers to provide detailed information about their traits so that researchers can find correlations between traits and genes. Volunteers will also provide their medical records. Storage will be required for these data sets and the raw nucleotide data. This detailed medical information is expected to require not more than 100 Mega-Bytes of storage for each individual.

Since the data is to be publically shared, an initial community of 25,000 active users are expected, and this community expected to double every 18 months. Active users are expected to access 10% of the entire database daily which is expected to create huge demand on the networking infrastructure. For user navigation, search and management HTTP will be used as well as FTP for genome data transfer.

Also, the data center with the NAS and the research center with equipment to enumerate genome sequence are in different university campus buildings. To store one genome information in the NAS, 25% of traffic overhead is generated.

You have been brought in as a network design consultant to help the Genome4U project and the management team has asked you to help them organize their requirements.

They would appreciate your analysis to answer the following questions:

1. List the major user communities.

2. List project technical goals. Specify expected tradeoffs.

3. Calculate data storage requirements in the table below ( (Hint: do not forget RAID 6 waste of disk space)

Parameter

By December 2012

Next each month

Storage size

 

 

Number of NAS servers

 

 

4. Estimate additional bandwidth requirements between data and research center buildings in kbps. Assume that a Month is 20 work days and equipment works 10 hours per day. (Show all your calculations) 

5. Can you determine the relationship between the storage size, number of genomes, number of users and network capacity requirements? If possible express this as an equation.

 

6. Characterize the network traffic in terms of flow, load, behavior, and QoS requirements. You will not be able to precisely characterize the traffic but provide some theories about it and document the types of tests you would conduct to prove your theories right or wrong. 

Reference no: EM131109103

Questions Cloud

Who does not gain when a tariff is imposed : Who does not gain when a tariff is imposed? Although political arguments strongly favor free trade, most decisions affecting international trade are made in the economic arena.
Determine the heat received from the hot source : Determine the heat received from the hot source. Determine the thermal efficiency of the power plant. Determine the Carnot efficiency associated with the present power plant and compare it with the previous result.
Find the probability that this adult spends : (A) Find the probability that this adult spends less than 2.5 hours per week on the computer. (B) Find the probability that this adult spends between 2.2 hours and 6.5 hours on their home computer per week.
How the values of a, b and c are accessed in r''s print state : Consider the following C-like program that allows subprograms to nest. Show the sequence of frames, with static links, on the stack when r(16) is executed assuming we start execution (as usual) with a call to main(). Explain how the values of a, b..
Network design and analysis : Genome4U is a scientific research project at a large university in the United States. Genome4U has recently started a large-scale project to sequence the genomes of 250,000 volunteers with a goal of creating a set of publicly accessible databases wit..
What was the average per capita consumption : What was the average per capita consumption of commercially produced fresh vegetables in the country between 1980 and 2000 - which year was the per capita consumption closest to the average per capita consumption between 1980 and 2000?
Prepare the portion of the income statement : Prepare the portion of the income statement, starting with "Income before income taxes," for 2011.
Overbooking policy of the airline : The cost of this free round-trip ticket averages $250. Super Discount considers the cost of flying the plane from JFK to LAX a sunk cost. We would like to get insights for the overbooking policy of the airline. whats cost of overstocking?
Which type of unemployment is the most difficult to cure : Which type of unemployment is the most difficult to cure? The Humphrey-Hawkins Act's target rates for unemployment and inflation were reached by their target date of 1983.

Reviews

Write a Review

Computer Networking Questions & Answers

  Diane the consultant summary of case

Case study:Diane the consultant Summary of case : Construct a diagram using Rationale to map the arguments about a moral claim that you have identified in the article/case study:

  Assignment on domain design for security worksheet

Assignment on Domain Design for Security Worksheet, Research and examine a domain model for security that is different from the one you previously developed for this course. Assume that recent compromises of sensitive information require security e..

  Display the valve stored in a num by derefecing it

Display the valve stored in a num by derefecing it

  Osi model-switching systemsnetwork-channel processors

A switch is a Data Link layer device, which means it's able to look into the packets that pass through it to examine a critical piece of Data Link layer information: the MAC address. With this information in hand, a switch can keep track of which ..

  Service provided by transport layer and the network layer

Does Figure 6-1 illustrate a connectionless transport layer demultiplexing or connection-oriented transport layer demultiplexing? Explain your answer.

  Design a modern network for a private high school

You are required to design the network you would recommend and how it would be configured - Design a Modern Network for a Private High School.

  Explain function without the use of a wlan controller

What are the two terms most often used to describe an AP that is able to function without the use of a WLAN controller

  Discuss what actions can be taken by the tcp protocol

Discuss what actions can be taken by the TCP protocol to preserve a connection-oriented packet stream, particularly if there has been packet loss due to the route disruption. What are the consequences to throughput as a result of route breakage du..

  Determine maximum value in ring if there is unique initiator

Design an algorithm that, under the standard set of assumptions (bidirectional links, total reliability, connectivity), determines maximum value in the ring assuming that there is a unique initiator.

  Design and applying a range of appropriate deployment method

Discuss the technologies and security resources that support and are available in network infrastructure management by demonstrating the practical and conceptual usage by preparing the design and applying a range of appropriate deployment method

  Design single table to hold all of the information required

Design a single table to hold all of the information required to store an invoice including this information. Next, apply normalization to reduce this table to third normal form.

  What conditions would you choose to subnet a network

What is the OSI model and why is it important in understanding networking? Under what conditions would you choose to subnet a network

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd