Question

In: Computer Science

Consider the design of a processor, with a max instruction length of 600 ps. The propagation...

Consider the design of a processor, with a max instruction length of 600 ps. The propagation delay to load a register is 25 ps

What is the minimum clock cycle time, the instruction latency and CPU throughput using serial execution?

What is the minimum clock cycle time, the instruction latency and CPU throughput using a pipelined execution with 8 equal stages?

Consider a design which used n equal stages. What is the minimum clock cycle time, the instruction latency and CPU throughput expressed as a function of n? (You may wish to check that your generalization agrees with your results from parts (a) and (b), i.e., by substituting n = 1, 8.)

Solutions

Expert Solution

In case of serial execution, minimum clock cycle time = maximum instruction length = 600 ps

Instruction latency = time taken to finish a instruction = 600 ps

CPU throughput = number of instructions processed per second = 1/(time taken to finish a instruction) = 1/(600*10-12) = 1.67*109 instructions per second

When pipeline is divided into 8 equal states, then time taken by each stage = 600/8 = 75 ps

Also since propagation delay to load a register to pass the result from one stage of pipeline to next stage of pipeline is 25 ps . Hence in stage pipeline, this delay will occur 7 times.

Minimum clock cycle time = (maximum instruction length)/number of stages + time taken to load a register = 600/8 + 25 = 100 ps

Hence instruction latency = Total instruction length + 7* time taken to load register = 600 + 7*25 = 775 ps

Since in pipeline, each instruction effectively cause additional one clock cycle , hence CPU thoughput = 1/(clock cycle time) = 1/(100*10-12) = 10*109 instructions per second

If there are n stages in pipeline, then there will be (n-1) number of register load and hence delay of 25*(n-1) ns.

Minimum clock cycle time = (maximum instruction length)/n + delay due to load register = (600/n + 25) ns

So instruction latency = maximum instruction length + delay due to register load = 600 + 25*(n-1) ns

CPU throughput = 1/((600/n+25)*10-12) instructions per second

We can put n=8 to see our result match. However for n=1 since without pipeline, there will be no register load delay, hence serial execution will give different result.

Please comment for any clarification.


Related Solutions

A RISC processor that uses the five-stage instruction fetch and execution design is driven by a...
A RISC processor that uses the five-stage instruction fetch and execution design is driven by a 1-GHz clock. Instruction statistics in a large program are as follows: Branch                                                20% Load                                                    30% Store                                                    10% Computational instructions                40% Please answer the following questions. 1-Assume 80% of the memory access operations are in the cache, and 20% are in the main memory. A cache miss has a 3-cycle penalty. What is the instruction throughput for non-pipelined execution? 2- Assume there are...
At the initial stage of each instruction cycle, the processor fetches an instruction from memory. The...
At the initial stage of each instruction cycle, the processor fetches an instruction from memory. The processor interprets the instruction and performs the required action. Provide illustration of the process with a thorough explanation on the rudimentary instruction cycles the processor follows to execute fundamental instructions. Discuss how this processor interacts with the cache memory to give output speedily.
Assume a non-pipelined processor takes 100 ns to process an instruction. The same instruction can be...
Assume a non-pipelined processor takes 100 ns to process an instruction. The same instruction can be executed in a 5-stage pipelined processor, where each stage takes 20 ns. (i) What is the minimum number of instructions for which the speedup achieved by the pipelined processor compared to the non-pipelined processor is at least 4? (ii) The instruction takes the same time (100 ns) to execute whether on the pipelined processor or on the non-pipelined processor. So why do we say...
In a computer instruction format, the instruction length is 11 bits and the size of an...
In a computer instruction format, the instruction length is 11 bits and the size of an address field is 4 bits. Is it possible to have 5 two-address instructions 45 one-address instructions 32 zero-address instructions using the specified format? Justify your answer.
In a computer instruction format, the instruction length is 12 bits and the size of an...
In a computer instruction format, the instruction length is 12 bits and the size of an address field is 5 bits. The system architect has already designed three 2-address instructions and thirty one 1-address instructions. How many 0-address instructions can still be possibly accommodated?
In a computer instruction format, the instruction length is 16 bits and the size of an...
In a computer instruction format, the instruction length is 16 bits and the size of an address field is 4 bits. Is it possible to have: 15 instructions with 3 addresses, 14 instructions with 2 addresses, 31 instructions with one address, and 16 instructions with zero addresses, using this format? Justify your answer.
describe vertical design of highways (vertical curves, types, length calculation, etc.).(max 1 page)
describe vertical design of highways (vertical curves, types, length calculation, etc.).(max 1 page)
Instruction to assigned to each processor. ARM Cortex-A12 Draw neatly architectural or block diagram of the...
Instruction to assigned to each processor. ARM Cortex-A12 Draw neatly architectural or block diagram of the selected processor. Identify the following listed below features and explain briefly. Pipelining and stages Memory or Cache Fetch Unit and Decode units Instruction issue policy Register Renaming Reorder Buffer Reservation Station Branch Prediction Execution Units Processor speed Number of Cores Number of Threads 36 Levels of cache and separate or shared cache or instruction and data cache Hyper-Threading or TLP
How does pipelining a processor affect instruction latency and throughput? Compare a basic 5 stage pipelined...
How does pipelining a processor affect instruction latency and throughput? Compare a basic 5 stage pipelined processor to a non-pipelined processor, show equations if necessary
I am struggling to write a python script that calculates the average length ,max length shortest...
I am struggling to write a python script that calculates the average length ,max length shortest length and overall average length of of a snake and ladder game. I use python 3 The length is the number the dice is rolled until the max is reached We need a python script that calculates the number of moves it will take for a single player to reach 100. We have to simulate a snake and ladder game using python language. So...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT