Question

In: Computer Science

Consider the design of a processor, with a max instruction length of 600 ps. The propagation...

Consider the design of a processor, with a max instruction length of 600 ps. The propagation delay to load a register is 25 ps

What is the minimum clock cycle time, the instruction latency and CPU throughput using serial execution?

What is the minimum clock cycle time, the instruction latency and CPU throughput using a pipelined execution with 8 equal stages?

Consider a design which used n equal stages. What is the minimum clock cycle time, the instruction latency and CPU throughput expressed as a function of n? (You may wish to check that your generalization agrees with your results from parts (a) and (b), i.e., by substituting n = 1, 8.)

Solutions

Expert Solution

In case of serial execution, minimum clock cycle time = maximum instruction length = 600 ps

Instruction latency = time taken to finish a instruction = 600 ps

CPU throughput = number of instructions processed per second = 1/(time taken to finish a instruction) = 1/(600*10-12) = 1.67*109 instructions per second

When pipeline is divided into 8 equal states, then time taken by each stage = 600/8 = 75 ps

Also since propagation delay to load a register to pass the result from one stage of pipeline to next stage of pipeline is 25 ps . Hence in stage pipeline, this delay will occur 7 times.

Minimum clock cycle time = (maximum instruction length)/number of stages + time taken to load a register = 600/8 + 25 = 100 ps

Hence instruction latency = Total instruction length + 7* time taken to load register = 600 + 7*25 = 775 ps

Since in pipeline, each instruction effectively cause additional one clock cycle , hence CPU thoughput = 1/(clock cycle time) = 1/(100*10-12) = 10*109 instructions per second

If there are n stages in pipeline, then there will be (n-1) number of register load and hence delay of 25*(n-1) ns.

Minimum clock cycle time = (maximum instruction length)/n + delay due to load register = (600/n + 25) ns

So instruction latency = maximum instruction length + delay due to register load = 600 + 25*(n-1) ns

CPU throughput = 1/((600/n+25)*10-12) instructions per second

We can put n=8 to see our result match. However for n=1 since without pipeline, there will be no register load delay, hence serial execution will give different result.

Please comment for any clarification.


Related Solutions

At the initial stage of each instruction cycle, the processor fetches an instruction from memory. The...
At the initial stage of each instruction cycle, the processor fetches an instruction from memory. The processor interprets the instruction and performs the required action. Provide illustration of the process with a thorough explanation on the rudimentary instruction cycles the processor follows to execute fundamental instructions. Discuss how this processor interacts with the cache memory to give output speedily.
Assume a non-pipelined processor takes 100 ns to process an instruction. The same instruction can be...
Assume a non-pipelined processor takes 100 ns to process an instruction. The same instruction can be executed in a 5-stage pipelined processor, where each stage takes 20 ns. (i) What is the minimum number of instructions for which the speedup achieved by the pipelined processor compared to the non-pipelined processor is at least 4? (ii) The instruction takes the same time (100 ns) to execute whether on the pipelined processor or on the non-pipelined processor. So why do we say...
In a computer instruction format, the instruction length is 11 bits and the size of an...
In a computer instruction format, the instruction length is 11 bits and the size of an address field is 4 bits. Is it possible to have 5 two-address instructions 45 one-address instructions 32 zero-address instructions using the specified format? Justify your answer.
In a computer instruction format, the instruction length is 12 bits and the size of an...
In a computer instruction format, the instruction length is 12 bits and the size of an address field is 5 bits. The system architect has already designed three 2-address instructions and thirty one 1-address instructions. How many 0-address instructions can still be possibly accommodated?
describe vertical design of highways (vertical curves, types, length calculation, etc.).(max 1 page)
describe vertical design of highways (vertical curves, types, length calculation, etc.).(max 1 page)
Instruction to assigned to each processor. ARM Cortex-A12 Draw neatly architectural or block diagram of the...
Instruction to assigned to each processor. ARM Cortex-A12 Draw neatly architectural or block diagram of the selected processor. Identify the following listed below features and explain briefly. Pipelining and stages Memory or Cache Fetch Unit and Decode units Instruction issue policy Register Renaming Reorder Buffer Reservation Station Branch Prediction Execution Units Processor speed Number of Cores Number of Threads 36 Levels of cache and separate or shared cache or instruction and data cache Hyper-Threading or TLP
How does pipelining a processor affect instruction latency and throughput? Compare a basic 5 stage pipelined...
How does pipelining a processor affect instruction latency and throughput? Compare a basic 5 stage pipelined processor to a non-pipelined processor, show equations if necessary
Consider the following segment table: Segment Base Length 0 219 600 1 2300 14 2 90...
Consider the following segment table: Segment Base Length 0 219 600 1 2300 14 2 90 100 3 1327 580 4 1952 96 What are the physical addresses for the following logical addresses? 0,430 1,10 2,500 3,400 4,112
Consider pulse propagation in a dispersive, isotropic medium with a refractive index given as a function...
Consider pulse propagation in a dispersive, isotropic medium with a refractive index given as a function of vacuum wavelength lmda0 by: n(lmda0) = nc + a (lmda0/lmdac )^2 Here nc, a, and lmdac are fixed parameters describing the medium. a) Derive an expression for the phase velocity of radiation in this medium, and evaluate your answer for vacuum wavelength lmda0 = lmdac . b) Derive an expression for the velocity of a pulse propagating in this medium, and evaluate your...
Design and write a verilog code and testbench for a 16-bit RISC MIPS Processor on vivado...
Design and write a verilog code and testbench for a 16-bit RISC MIPS Processor on vivado and show waveform.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT