Clustered pipeline architecture

Assignment Help Computer Engineering
Reference no: EM131434968

Clustered Pipeline Architecture:

One of the benefits of clustered pipeline architecture is that we have smaller register files in each cluster. Smaller register files translates into faster access to registers. Additionally, splitting the physical register file into multiple smaller pieces allows for more registers to be accessed in parallel for minimal additional power and area overhead. For architectures with high degree of parallelism (8 way superscalar as an example), many registers will be accessed at the same time, thus it is important to reduce RF (Register File) access time. One unwanted side effect is the extra communication overhead between clusters due to possible intercluster dependencies.

1769_Figure.jpg

For more information about this architecture and its benefits and drawbacks you can read :

https://www.hpl.hp.com/techreports/98/HPL-98-204.pdf

Review (Dispatch Bound VS Issue Bound):

Dispatch Bound: In dispatch bound, the issue queue contains reservation stations (Memory elements that will hold the value of dependents once ready). For example, when an instruction is decoded, its dependents are read from the register file when ready, and when the instruction moves to the functional unit, the values are also sent with the instruction.

If the dependants are not ready, these dependants will be forwarded to the issue queue and stored in the reservation entities once the producing instruction completes.

Question 1

MOVC R1 #5
MOVC R2 #10
MOVC R3 #15
MOVC R4 #91
MOVC R5 #20 ADD R1 R1 #100 ADD R6 R1 R2 MULT R7 R6 R2 LOAD R8 R5 #500 DIV R9 R8 R4
STORE R7 #100 #200
STORE R9 #200 #300

Assumption: You have an instruction "CLUSTER #NO" that directs all subsequent instructions to the cluster determined by the instruction operand. For example, the following instruction sequence will send the ADD to cluster 1 and MULT to cluster 2.

Cluster #1 ADD R1 R1 R1
Cluster #2 MULT R2 R2 R3

Please re-order the provided instruction sequence utilizing the cluster command to minimize intercluster communication.

Question 2

Please justify your answer to question 1

Question 3

For the purpose of this question, you are allowed to add new data structures or modify existing structures inside the decode stage. Also assume that each cluster has limited capacity in terms of the number of instructions it can hold at a time. Propose any necessary changes so that dependent instructions, as much as possible, are executed in the same cluster. Briefly describe logic changes to the decode stage.

Note: Assume FRAT belongs to the decode stage.

Question 4

While it is true that executing all dependent instructions on one cluster will reduce intercluster dependency, this may limit parallelism and utilization of all clusters. Please suggest a smarter mechanism that would achieve an acceptable tradeoff reducing intercluster dependencies and maintaining good level of parallelism.

Question 5

Describe one scenario where dispatch-bound would more efficient than dispatch-bound.

Question 6

This question is similar to variation 2 for the architecture described in slide Lecture Slides 3, slide number 104. However, in this new proposed variation, each Rob entry holds the a copy of the instruction result.

Given the following:
1. N_R: Number of physical registers
2. N_S: Number of reservation entities
3. W_Matrix -- Its entries are ready for you to use by the decode stage
4. RAT -- RAT entries will always point to physical register. It will never point to architectural register.
5. Each RoB Entry contains (Destination Physical Register Number, Destination Architectural Register Number, A memory to hold the result of the instruction once ready). Assume Variables : Ph_NO, AR_NO, INST_VALUE respectively.
6. Free_List: A list of free physical registers
7. There is no Renamed[] vector.

At the decode stage, upon allocating a new register and performing renaming, there is a chance of freeing a physical register. Assuming the destination register number is R_Dest, write a pseudocode that would free a physical register when necessary.

Question 7

During instruction execution, there are other places where a physical might be freed. Briefly describe where else we need to put freeing logic and explain why.

Reference no: EM131434968

Questions Cloud

Major advantages to an organization using maturity model : Explain to a fellow student the major benefits of project management oversight to an organization. What are three major advantages to an organization using the maturity model?
Should sids weight be a factor in how bill decides hiring : In one of our readings, Prof. Davidson's 2002 article "Leveraging differences for organizational excellence: managing diversity differently" proposes a new diversity model on "what needs to be in place in order to leverage difference, not merely ..
Differences between the minimum pay regulations : Discuss the main differences between the minimum pay regulations in the United States and China's practices. How do these differences affect companies ability to compete with other companies worldwide?
How a profitable company could be running short of cash : Write a brief paragraph on how a profitable company could be running short of cash. It can be a general discussion, or you can mention specifics about Cartwright in your discussion.
Clustered pipeline architecture : During instruction execution, there are other places where a physical might be freed. Briefly describe where else we need to put freeing logic and explain why
Undergraduate degree in business administration : You recently completed your undergraduate degree in Business Administration, majoring in Finance, at University. You are now working at PPL Corporation (https://www.pplweb.com), at their corporate headquarters, in Allentown, PA. Your first assignm..
How many times were data collected in this study : Find an article in a social science journal that reports the results of actual research. (Be sure it is not an article that only reviews and summarizes other research.) Answer the following questions about the article and the research. How many ti..
Discuss the characteristics of transformational leadership : Discuss and give examples demonstrating how your chosen leader exhibits the following fivecharacteristics of transformational leadership (see chapter 10, sec. 5 Contemporary Perspectives in your text for reference); namely, Intellectually Stimulat..
How much power to federal administrative agencies : Does the the federal government delegate too much power to federal administrative agencies? Do the courts defer too much to Congress in its delegation of power to agencies? Support your answer with factual evidence and cases you've found.

Reviews

Write a Review

Computer Engineering Questions & Answers

  Question1-lu decomposition given a lower triangular matrix

question1-lu decomposition given a lower triangular matrix l write down a program to solve linear system lx b. count

  Design an application that finds all 4 digit numbers

make an application that finds all 4 digit numbers (those would be the numbers in the range 1000-9999) whose value is equal to the sum of each digit raised to the fourth power. MEANINGFUL output should be sent to the screen.

  Make a web page that contains two selection lists

Pick your favorite sport and search the internet for current roster of players for five teams. design a web page that contains two selection lists: one that displays a drop-down menu of team names and the other a multi-line selection list that dis..

  Obtain a histogram of the image

Obtain a histogram of the image, and using the fact that the quantized image should be as good an approximation as possible for the original, design 1-, 2-, and 3-bit quantizers for this image.

  Implementing the code

Write down a complete program which declares an integer variable, reads a value from the keyboard into that variable. Write down a program which determines the mileage reimbursement for the salesperson at a rate of $.35 per mile.

  Create a jquery mobile app to record your daily activities

Create a JQuery mobile app to record your daily activities - Only the activities of the current user are displayed. Users can select an activity from the listview and see the activity details.

  Create a data model that site needs to support its business

Create a data model that the site needs to support its business process. Include entities to show what types of information the site needs.

  Briefly explain the difference mesh, bus, ring, and star top

Briefly explain the difference, including advantages and disadvantages Ethernet, Token Ring, FDDI, and Wireless.

  Explain porters five forces model

Describe for Bill what a Supply Chain Management (SCM) solution would look like for his ABC Pizza business and how his business could benefit from implementing an SCM solution.

  Find speed of execution of application programs

As computer word size gets larger and larger, there is a law of diminishing returns; the speed of execution of application programs does not increase and may, in fact, decrease. define why this can occur.

  In a 5 bit crc error detecting scheme suppose that we

in a 5 bit crc error detecting scheme assume we choose the generating bit sequence as 11011.a encode the bit stream

  Questionpart iassume you wanted to implement a control unit

questionpart iassume you wanted to implement a control unit for a computer that you are designing.would you use a

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd