What is speedup in throughput for each of these improvements

Assignment Help Electrical Engineering
Reference no: EM13218085

Assume a GPU architecture that contains 10 SIMD processors. Each SIMD instruction has a width of 32 and each SIMD processor contains 8 lanes for single-precision arithmetic and load/store instructions, meaning that each non- diverged SIMD instruction can produce 32 results every 4 cycles.Assume a kernel that has divergent branches that causes on average 80% of threads to be active. Assume that 70% of all SIMD instructions executed are single-precision arithmetic and 20% are load/store. Since not all memory latencies are covered, assume an average SIMD instruction issue rate of 0.85. Assume that the GPU has a clock speed of 1.5 GHz.

Questions :

(1) Compute the throughput, in GFLOP/sec, for this kernel on this GPU.

(2) Assume that you have the following choices:

(1) Increasing the number of single precision lanes to 16

(2) Increasing the number of SIMD processors to 15 (assume this change doesn't affect any other performance metrics and that the code scales to the additional processors)

(3) Adding a cache that will effectively reduce memory latency by 40%, which will increase

instruction issue rate to 0.95

What is speedup in throughput for each of these improvements?

 

Reference no: EM13218085

Questions Cloud

Involvement in the insurance process : What are the implications of simultaneous federal and state involvement in the insurance process?
What current in amperes will flow through the bulbs : A 100 W bulb takes 0.833A and a 200 W bulb takes 1.67 A from a 120V source. If the two bulb were connected across a 240 V source, what current in amperes will flow through the bulbs?
Describe the muslim presence in africa in 18th-19th century : Describe the Muslim presence in Africa in the 18th and 19th century. How did Muslims make their money? How did they spread their influence? What areas of Africa were under Muslim control?
Calculate the amout of mmf generated by a coil : calculate the amout of mmf generated by a coil of wire having 1300 turns and carrying milliamperes of current?
What is speedup in throughput for each of these improvements : Increasing the number of SIMD processors to 15 (assume this change doesn't affect any other performance metrics and that the code scales to the additional processors)
The treaty of guadalupe hidalgo : The Treaty of Guadalupe Hidalgo... George Washington's success as a general is most accurately explained by.Benjamin Franklin.
What type of cable is necessary for each connection : What type of cable is necessary for each connection: straight or crossover? You can assume that S1 and S2 do not have the ability to resolve crossovers (called Auto-MDIX).
Calculate the minimum sampling interval : calculate the minimum sampling interval (time period for counting the pulse train) to achieve a speed resolution of 5 rpm, and the number of bits that must be there in the binary counter if the pulse train is to be sampled with this interval for a..
Calculate the minimum sampling interval : calculate the minimum sampling interval (time period for counting the pulse train) to achieve a speed resolution of 5 rpm, and the number of bits that must be there in the binary counter if the pulse train is to be sampled with this interval for a..

Reviews

Write a Review

Electrical Engineering Questions & Answers

  Define design bandpass filter of ''sallen and key'' topology

Design a bandpass filter of 'Sallen and Key' topology and having the following characteristics fo = 3.3 kHz, Q = 2.2

  Explain tones input in to an amplifier

Tones Input In To An Amplifier, Suppose two tones are input in to an amplifier. The tones are pure sinusoids with one at frequencies 8GHz and the other at frequency 8.03GHz,(delta f =0.03GHz).

  Laplace transforms enable interpretation and manipulation

Laplace transforms enable interpretation and manipulation of different signals by viewing these signals as either time domain signals/pulse or else frequency domain representations

  Explain a square wave signal with voltage levels

A square wave signal with voltage levels 0 volts and 2.0 volt at a frequency of 1.00 MHz is multiplied by a sine wave at a frequency of 5.0 MHz

  Explain wavelength and frequency the downstream information

Wavelength and Frequency, The downstream information (from the network to the customer) is transmitted at a wavelength of 1550 nanometers and the upstream information

  Explain which parameter of an fm transmission determines

Do either or both of the reasons you gave in part (a) above also apply to a point to point voice link using frequency modulation? c. Which parameter of an FM transmission determines whether it is wideband or narrow band?

  Explain a cable tv company delivers video signals

A cable TV company delivers video signals over coaxial cable to individual houses. Video signals occupy a bandwidth of 6 MHz and are stacked in frequency from 50 MHz to 656 MHz without guard bands

  Digital communications -order the following modulation types

Digital Communications, Order the following modulation types in terms of spectral efficiency (best to worse) and explain

  Create the walsh code matrix for a w

Create the Walsh code matrix for a W16. (c) For IS-95, on the forward link, list the Walsh code assigned to the various logical channels (hint: there are 64 Walsh codes)

  Nodal analysis solve for the current

Using nodal analysis solve for the current and supemode used to solve this circuit is located between which two nodes?

  Define the outputs of two nand

Can you confirm-The outputs of two"NAND" gates are connected to the inputs of an "EXCLUSIVE OR" gate.Each of the "NAND" gates has one input at logic high level and the other input

  Define tune pi parameters

Tune PI Parameters, Question: Given are the facts that Gc and Gp(s) are in series and Gp(s) = (K * Km) / [s[(Ls+R)(Js+b) + K^2m)]].

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd