A Simple Distributed Computing Platform , Programming Languages

You are asked to develop a replicator (client) that distributes a large job over a number of computers (a server group) on a single switched LAN (our BU04 lab). In this assignment, a large (simulation) job can be divided into a number of small jobs, each of which can be assigned to one machine from the server group for execution. The execution results of the small jobs can be merged once all of them successfully terminate.
System Architecture:
client server1 server2 server3 ...
| | | |
| | | |
| LAN | | |
|------------------|-------------|--------------|-----
The client and servers are running Network File System (NFS) so that user files are visible at $HOME directory. You may want to set up the following environment:
• $HOME/replicate.hosts: a list of (server) hostnames which participate in the simulation.
• $HOME/replicate_out: the directory that stores the small job execution result.
The simulation program "hyper_link" (binary) is provided. In this assignment, you don''''t need to know or care what "hyper_link" does, and actually it is a computing intensive (CPU demanding) simulator. The command line arguments of "hyper_link" are job# 100000 999 1 2 2 100, where the job number determines the number of small jobs in your simulation. To allow the client to run a large job, the job# should be given in a tuple: start, end, and step. For example, the command "hyper_link 1 100 1 100000 999 1 2 2 100" yields 100 small jobs with the job# starting from 1 to 100. Each small job produces a screen output (see example below) at the end (if finished successfully). Your code needs to redirect the output to a file and save it in $HOME/replicate_out. For example,
./hyper_link 1 100000 999 1 2 2 100
will produce a screen output looks like:
1:100000:999:2:0.5:1.124571e+00:7.744712e-04:6.245814e-01:6.957320e-04:0.000000e+00:0.000000e+00.
Requirements:
1. The communications between the replicator and servers are achieved through remote procedure calls.
2. A user interface is required for the replicator to control the server. A command line interface will be acceptable. A (working) graphic user interface (GUI) will impress the instructor and earn up to 20 bonus credits. Your interface should at least support the following operations.
o show the current CPU load of a certain server (if the server is active).
o show the current server status (active or inactive).
o stop a certain server.
o restart a certain server.
o show the CPU load (with the mark of hi-threshold) in the real time (for GUI interface only).
3. The replicator has to make sure all small jobs are successfully finished.
o If a server crashes (or not responsive), the running job (not finished yet) will be killed and rescheduled for execution.
o If a server CPU load exceed the preset threshold (the higher threshold), the replicator stops the server (and therefore kills the job).
o The replicate should keep polling the CPU load of the stopped server. Once the load becomes lower than the lower threshold (a preset value), the server should be reactivated to run the jobs.
o The replicator can also stop any server (through user interface) if needed. Once happened, the unfinished job will be killed.
o If a job terminates abnormally (e.g., being killed), the replicator has to reschedule the job execution later.
Submission:
Create a folder with the name of your Linux account, and copy all source code (better be divided into client and server two subdirectories) files/directories into the created folder. Make sure you have sshed to grail.cba.csuohio.edu, and then type: .

$ turnin -c cis620w -p proj2 your_folder
Posted Date: 4/4/2012 11:14:38 AM | Location : United States







Related Discussions:- A Simple Distributed Computing Platform , Assignment Help, Ask Question on A Simple Distributed Computing Platform , Get Answer, Expert's Help, A Simple Distributed Computing Platform Discussions

Write discussion on A Simple Distributed Computing Platform
Your posts are moderated
Related Questions
You are working for a cryptography department, where one of your jobs is to decipher intercepted messages that have been encrypted using a book cipher. The messages are sent by dif

Define a Prolog predicate flatten(List, FlattenedList)  that asserts List  is any nested list of atoms and  FlattenedList  is the same list with the nesting removed. The atom [] sh

What is an abstract class:   Abstract class is stated as an abstract concept that cannot be instantiated or comparing to interface this can have some implementation and interface c

want to do an image-mean. but image is and mean is so an error showing Error using ==> minus Number of array dimensions must match for binary array op. wat to do?

What are Relative URLS ? When a web browser reads an HTML document, it has a great deal of information about the document. This includes the protocol used to retrieve the docu

Assume that you are a college professor needing to average grades for 10 students. Write a program that asks for each student's name and grade. Print the list on-screen with each s

Assignment for Networing&camunication

Pre-order integration on Shopify store We are looking for on-call developers who can integrate Celery on client Shopify stores. Requirements a) Contented editing Shopify them

Extend the AirRaid game, so that the planes drop a bomb on the gun as they go over it. The gun has to move out of the way otherwise it will be destroyed if hit. Provide three lives

#questCustomers of a particular restaurant are described with some categories. If the categories indicate the customer is a senior citizen, or the customer works for the restaurant