A Simple Distributed Computing Platform , Programming Languages

You are asked to develop a replicator (client) that distributes a large job over a number of computers (a server group) on a single switched LAN (our BU04 lab). In this assignment, a large (simulation) job can be divided into a number of small jobs, each of which can be assigned to one machine from the server group for execution. The execution results of the small jobs can be merged once all of them successfully terminate.
System Architecture:
client server1 server2 server3 ...
| | | |
| | | |
| LAN | | |
|------------------|-------------|--------------|-----
The client and servers are running Network File System (NFS) so that user files are visible at $HOME directory. You may want to set up the following environment:
• $HOME/replicate.hosts: a list of (server) hostnames which participate in the simulation.
• $HOME/replicate_out: the directory that stores the small job execution result.
The simulation program "hyper_link" (binary) is provided. In this assignment, you don''''t need to know or care what "hyper_link" does, and actually it is a computing intensive (CPU demanding) simulator. The command line arguments of "hyper_link" are job# 100000 999 1 2 2 100, where the job number determines the number of small jobs in your simulation. To allow the client to run a large job, the job# should be given in a tuple: start, end, and step. For example, the command "hyper_link 1 100 1 100000 999 1 2 2 100" yields 100 small jobs with the job# starting from 1 to 100. Each small job produces a screen output (see example below) at the end (if finished successfully). Your code needs to redirect the output to a file and save it in $HOME/replicate_out. For example,
./hyper_link 1 100000 999 1 2 2 100
will produce a screen output looks like:
1:100000:999:2:0.5:1.124571e+00:7.744712e-04:6.245814e-01:6.957320e-04:0.000000e+00:0.000000e+00.
Requirements:
1. The communications between the replicator and servers are achieved through remote procedure calls.
2. A user interface is required for the replicator to control the server. A command line interface will be acceptable. A (working) graphic user interface (GUI) will impress the instructor and earn up to 20 bonus credits. Your interface should at least support the following operations.
o show the current CPU load of a certain server (if the server is active).
o show the current server status (active or inactive).
o stop a certain server.
o restart a certain server.
o show the CPU load (with the mark of hi-threshold) in the real time (for GUI interface only).
3. The replicator has to make sure all small jobs are successfully finished.
o If a server crashes (or not responsive), the running job (not finished yet) will be killed and rescheduled for execution.
o If a server CPU load exceed the preset threshold (the higher threshold), the replicator stops the server (and therefore kills the job).
o The replicate should keep polling the CPU load of the stopped server. Once the load becomes lower than the lower threshold (a preset value), the server should be reactivated to run the jobs.
o The replicator can also stop any server (through user interface) if needed. Once happened, the unfinished job will be killed.
o If a job terminates abnormally (e.g., being killed), the replicator has to reschedule the job execution later.
Submission:
Create a folder with the name of your Linux account, and copy all source code (better be divided into client and server two subdirectories) files/directories into the created folder. Make sure you have sshed to grail.cba.csuohio.edu, and then type: .

$ turnin -c cis620w -p proj2 your_folder
Posted Date: 4/4/2012 11:14:38 AM | Location : United States







Related Discussions:- A Simple Distributed Computing Platform , Assignment Help, Ask Question on A Simple Distributed Computing Platform , Get Answer, Expert's Help, A Simple Distributed Computing Platform Discussions

Write discussion on A Simple Distributed Computing Platform
Your posts are moderated
Related Questions
Develop (i) a context diagram and (ii) a use case diagram to model the following functionalities of NetFlix. Netflix is an online video/DVD renting company. A customer desiring

How do you print triangles using for loops and if statements

depth estimation from block matching algorithm motion estimation

Of the 6,325 kindergarten students who participated in the study, almost half or 3,052 were eligible for a free lunch program. The categorical variable sesk (1 == free lunch, 2 = n

Go to the following link and please do Problem 6. This is in C language. http://wikisend.com/download/304748/hw%206.pdf The code I need must be in C language and not C++ . Thank

A manager at the local Walmart needs to determine how many workers to hire to cover a 24-hour period. Each worker must work 9 consecutive hours but may only start work at the begin

Companies divide their subtasks of game's development. For large complex games, the development team can vary in size from 20 to 100 or more members. The most represented are artis

Tamagochi were all the rage in the 90's as a small toy that had limited functionality but modelled a pet. The "owner" could do the following • Feed the pet • Heal the pet

Expertsmind brings you unique solution in java assignments I.O streams The Java.io packages contains nearly every category you might ever need to execute feedback and resu