Briefly describe logic changes to the decode stage

Assignment Help Computer Engineering
Reference no: EM131438046

Homework - Clustered Pipeline Architecture?

One of the benefits of clustered pipeline architecture is that we have smaller register files in each cluster. Smaller register files translates into faster access to registers. Additionally, splitting the physical register file into multiple smaller pieces allows for more registers to be accessed in parallel for minimal additional power and area overhead. For architectures with high degree of parallelism (8 way superscalar as an example), many registers will be accessed at the same time, thus it is important to reduce RF (Register File) access time. One unwanted side effect is the extra communication overhead between clusters due to possible intercluster dependencies.

994_Figure.png

For more information about this architecture and its benefits and drawbacks you can read: https://www.hpl.hp.com/techreports/98/HPL-98-204.pdf

Review (Dispatch Bound VS Issue Bound):?

Dispatch Bound: In dispatch bound, the issue queue contains reservation stations (Memory elements that will hold the value of dependents once ready). For example, when an instruction is decoded, its dependents are read from the register file when ready, and when the instruction moves to the functional unit, the values are also sent with the instruction.

If the dependants are not ready, these dependants will be forwarded to the issue queue and stored in the reservation entities once the producing instruction completes.

Question 1 -

MOVC R1 #5

MOVC R2 #10

MOVC R3 #15

MOVC R4 #91

MOVC R5 #20

ADD R1 R1 #100

ADD R6 R1 R2

MULT R7 R6 R2

LOAD R8 R5 #500

DIV R9 R8 R4

STORE R7 #100 #200

STORE R9 #200 #300

Assumption?: You have an instruction "CLUSTER #NO" that directs all subsequent instructions to the cluster determined by the instruction operand. For example, the following instruction sequence will send the ADD to cluster 1 and MULT to cluster 2.

Cluster #1

ADD R1 R1 R1

Cluster #2

MULT R2 R2 R3

Please re-order the provided instruction sequence utilizing the cluster command to minimize intercluster communication.

Question 2 -

Please justify your answer to question 1.

Question 3 -

For the purpose of this question, you are allowed to add new data structures or modify existing structures inside the decode stage. Also assume that each cluster has limited capacity in terms of the number of instructions it can hold at a time. Propose any necessary changes so that dependent instructions, as much as possible, are executed in the same cluster. Briefly describe logic changes to the decode stage.

Note: ?Assume FRAT belongs to the decode stage.

Question 4 -

While it is true that executing all dependent instructions on one cluster will reduce intercluster dependency, this may limit parallelism and utilization of all clusters. Please suggest a smarter mechanism that would achieve an acceptable tradeoff reducing intercluster dependencies and maintaining good level of parallelism.

Question 5 -

Describe one scenario where dispatch-bound would more efficient than dispatch-bound.

Question 6 -

This question is similar to variation 2 for the architecture described in slide Lecture Slides 3, slide number 104. However, in this new proposed variation, each Rob entry holds the a copy of the instruction result. Given the following:

1. N_R: Number of physical registers

2. N_S: Number of reservation entities

3. W_Matrix -- Its entries are ready for you to use by the decode stage

4. RAT -- RAT entries will always point to physical register. It will never point to architectural register.

5. Each RoB Entry contains (Destination Physical Register Number, Destination

Architectural Register Number, A memory to hold the result of the instruction once ready). Assume Variables : Ph_NO, AR_NO, INST_VALUE respectively.

6. Free_List: A list of free physical registers

7. There is no Renamed[] vector.

At the decode stage, upon allocating a new register and performing renaming, there is a chance of freeing a physical register. Assuming the destination register number is R_Dest, write a pseudocode that would free a physical register when necessary.

Question 7 -

During instruction execution, there are other places where a physical might be freed. Briefly describe where else we need to put freeing logic and explain why.

Attachment:- Assignment.rar

Reference no: EM131438046

Questions Cloud

Describe the difference between formal and informal research : Write a paper that describes the differences between formal and informal research. What are the benefits and the drawbacks? Provide examples of when might you use each type of research.
What are the disadvantages of your proposed policy : 1. Describe how antipoverty programs can discourage the poor from working. How might you reduce this disincentive? 2. What are the disadvantages of your proposed policy?
Income share of the richest : What has happened to the income share of the richest fifth of the U.S. population over the past 40 years? Why has this happened?
What is sonya accounting or business profit : What is Sonya's accounting or business profit? Explain. What is Sonya's economic profit? Explain.
Briefly describe logic changes to the decode stage : CS 520: Computer Architecture and Organization Spring 2017 Homework. Propose any necessary changes so that dependent instructions, as much as possible, are executed in the same cluster. Briefly describe logic changes to the decode stage
Why amazons kindle cannot be made in the united states : Why Amazon's Kindle Cannot be Made in the United States? As a result of the benefit of the need for design engineers and those involved in distribution to interact.l
Letter of complaint hospitalli am victim : letter of complaint Hospitalli am victim, Paper Details please see attachment this needs to sound so convincing is going to Court please I am the victim here and I need to sound as if I have been a victim of discrimination of race and working in a ho..
Difference between the annual depreciation : If the MACRS depreciation method with a 3-year recovery period is used for tax purposes, determine the difference between the annual depreciation after 2 years calculated from both depreciation methods.
Total cost of treatment at an interest rate : How much will people who have to get every month from 1 month after treatment ends to get the same amount to the total cost of treatment at an interest rate of 12% per year compounded monthly?

Reviews

len1438046

3/23/2017 7:39:16 AM

Hi here is a version of homework again and also there are some hints included. please let me know the price quote again on this. This is the final version of the assignment. All further clarifications will appear as announcements on blackboard. A turnitin assignment will be created on my courses for this assignment. Please *do not* submit scanned copies of handwritten assignments. All assignments must be submitted as word or pdf files or any format accepted by turnitin. Also please abide by the number of lines suggested by each question.

Write a Review

Computer Engineering Questions & Answers

  Write a program that reads in a list of positive integers

Write a program that reads in a list of positive integers. Write down a program which asks user to respond to a question by entering either 1 for yes or 2 for no.

  Print all lines where the birthdays are in november

Print all lines where the birthdays are in November or December - Append three asterisks to the end of lines starting with Fred

  Create a new type of credit card

Invent a new way to protect computers from viruses. Create a new type of credit card. Work on solving a problem of your own choosing - a problem that is related to your major field of study.

  Write down an applet that reads in the size of the side

Write down an applet that reads in the size of the side of a square and displays a hollow square of that size out of asterisks, by using the drawString method inside your applet's paint method. Use an input dialog to read the size from the user. Y..

  How the two algorithms differ in their exploration

E28: Mobile Robotics - Fall 2015 - HOMEWORK 8. Keep the default start and goal state. Run A* search (use the Euclidean heuristic). Then, run Dijkstra's algorithm (you should allow diagonal movement) and compare the results. Explain how the two algo..

  Create an order class constructor that takes parameters

Create a class named Order that performs order processing of a single item that sells for $19.99 each.

  Registering user at directory service

Consider a user A who wishes to download the mp3 file with the title T.Describe how B is able to register anonymously at directory service (letting the service know that it has title T) such that A may use the directory service to receive the file..

  Explain methods whereby training materials can be delivered

Describe the methods whereby training materials can be delivered to the users of the software system. Explain the ways in which software can be supported after it is implemented/released.

  Questionleft explanation is particularly useful while

questionleft explanation is particularly useful while formattinga. a column of numbers representing your every day

  Describe the most likely impediments to health care

predict the function of medical health records in 2030 providing specific examples to support your response.describe

  Routers

Discuss in detail why the differentiated services (DS) domain comprises of the set of contiguous routers? Also explain how the boundary node routers are different from interior node routers in the DS domain.

  Convert these decimal numbers to 8-bit 2s complement binary

Convert these decimal numbers to 8-bit 2's complement binary numbers.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd