GATE CSE 2013 | Question: 45

Question

GATE CSE 2013 | Question: 45

Kriss Singh asked Sep 5, 2014 • edited Jun 21, 2021 by Lakshman Bhaiya

47,887 views

Consider an instruction pipeline with five stages without any branch prediction:

Fetch Instruction (FI), Decode Instruction (DI), Fetch Operand (FO), Execute Instruction (EI) and Write Operand (WO). The stage delays for FI, DI, FO, EI and WO are $\text{5 ns, 7 ns, 10 ns, 8 ns and 6 ns},$ respectively. There are intermediate storage buffers after each stage and the delay of each buffer is $1\ \text{ns}.$ A program consisting of $12$ instructions $\text{I1, I2, I3,}\ldots,\text{ I12}$ is executed in this pipelined processor. Instruction $\text{I4}$ is the only branch instruction and its branch target is $\text{I9}.$ If the branch is taken during the execution of this program, the time (in ns) needed to complete the program is

$132$
$165$
$176$
$328$

Kriss Singh asked Sep 5, 2014 • edited Jun 21, 2021 by Lakshman Bhaiya

Kriss Singh

47.9k views

See all

Show 7 previous comments

10 Answers

Best answer

After pipelining we have to adjust the stage delays such that no stage will be waiting for another to ensure smooth pipelining (continuous flow). Since we can not easily decrease the stage delay, we can increase all the stage delays to the maximum delay possible. So, here maximum delay is $10$ ns. Buffer delay given is $1$ ns. So, each stage takes $11$ ns in total.

FI of $\text{I9}$ can start only after the EI of $\text{I4}.$ So, the total execution time will be
$$15 \times 11 = 165$$
$$\small \begin{array}{|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|} \hline
&\bf{t_1}&\bf{t_2}&\bf{t_3}&\bf{t_4}&\bf{t_5}&\bf{t_6}&\bf{t_7}&\bf{t_8}&\bf{t_9}&\bf{t_{10}}&\bf{t_{11}}&\bf{t_{12}}&\bf{t_{13}}&\bf{t_{14}}&\bf{t_{15}}\\
\hline
\textbf{I1}&\text{FI}&\text{DI}&\text{FO}&\text{EI}&\text{WO}\\
\textbf{I2}&&\text{FI}&\text{DI}&\text{FO}&\text{EI}&\text{WO}\\
\textbf{I3}&&&\text{FI}&\text{DI}&\text{FO}
&\text{EI}&\text{WO}\\
\textbf{I4}&&&&\text{FI}&\text{DI}&\text{FO}&\text{EI}&\text{WO}\\
&&&&&\color{red}{\text{stall}}\\
&&&&&&\color{red}{\text{stall}}\\
&&&&&&&\color{red}{\text{stall}}\\
\textbf{I9}&&&&&&&&\text{FI}&\text{DI}&\text{FO}&\text{EI}&\text{WO}\\
\textbf{I10}&&&&&&&&&\text{FI}&\text{DI}&\text{FO}&\text{EI}&\text{WO}\\
\textbf{I11}&&&&&&&&&&\text{FI}&\text{DI}&\text{FO}&\text{EI}&\text{WO}\\
\textbf{I12}&&&&&&&&&&&\text{FI}&\text{DI}&\text{FO}&\text{EI}&\text{WO}\\
\hline\end{array}$$

Correct Answer: $B$

gatecse answered Sep 6, 2014 • edited Jun 21, 2021 by Lakshman Bhaiya

gatecse

See all

Bhushan Laware

commented Dec 27, 2017

Why we are not using the stage delays for FI, DI, FO, EI and WO are 5 ns, 7 ns, 10 ns, 8 ns and 6 ns, respectively and taking only highest one?

At least for first Instruction we suppose to use it if its given.

first instruction will take 5+7+10+8+6=36 and 36+5 (buffer delay)= 41 ns and remaining instruction will take (upto 4 only) 11 ns * 3= 33. thus total time after 4th instruction execution 41+33 =74 ns.

similary, form 9 to 12 th instruction it will take 41 ns (for 9th) + 33 ns (from 10 to 12th)=74ns.

and thus total time required for execution 74 + 74 =148 ns (No option is given)

What I am saying here is that if they are giving time for each stage then why we are not using it and just solve it by assuming only highest delay.

Page:

1
2
3
next »

@Ritik Jain RJ because it is the best case, that we come to know about branch decision in EX stage — codingo1234, Dec 2, 2018
Always count for branch instruction separately. first instruction=5 Clock, now remaining (I2 I3 I10 I11 I12 I4)=6 clock and at last branch instruction will take 4 clock.

Total=(5+6+4)*11=165 ns — Nitesh Singh 2, Dec 31, 2018
Because during execution only we came to know where we have to branch. During execution effective address for the next instruction will be calculated and loaded into PC — Pirtpal Singh, Sep 10, 2021
in the question it is not mentioned that whether it is a synchronous or asynchronous pipeline , how do we conclude this is a synchronous pipeline ??

i read all the cmnts but still need help ? — Pranay Datta 1, Sep 6, 2014
Actually to do with that- that is allow different stage delays with pipelining processor must use different clocks- which is not something common- I don't know if any exist. And doing so doesn't add any advantage also as the final output depends on the slowest stage anyway- it doesn't really make sense to do some stage faster as the slowest stage is there as bottle neck. So, for simplicity (I guess) all stages are done using the same clock.

http://web.cs.iastate.edu/~prabhu/Tutorial/PIPELINE/pipe_title.html — Arjun, Sep 6, 2014
Actually i`m bit confused with this computer architecture ,

So much confusion with this . write through write back dma pipeline anyways thanx :) — Pranay Datta 1, Sep 6, 2014
Thing here is we don't know what is actually practical there in architecture- we are in abstract world and concrete world is kind of vague. So, we believe certain things given in book and understand them. If you happen to work in Intel or AMD, you will see how different things are - whatever we learn in book are outdated long back but they are still the basics. As far as GATE questions are concerned they usually come from the most common stuffs given in standard books- if they ask otherwise as has happened sometimes - just be happy that everyone will be wasting time on it, and you can solve other questions :P — Arjun, Sep 6, 2014
For this kind of questions should we always draw space time diagram to solve?

Can't we use any direct formula?

Bit confused about when to use [1 + stall frequency*stall cycle]*d formula?

Please comment. — Ram Sharma1, Sep 6, 2014
Why operand forwarding is not applied..? Only when explicitly given in question to take into account operand forwarding then only it is applied.Is it so...? — GateMaster Prime, Jan 10, 2015
It is not mentioned as WHEN IS THE TARGET ADDRESS OF INSTRUCTION evaluated. Generally it is done in DI stage but here you have done that in Execution phase of I4. Why so . Is it implicitly assumed here .if yes why ?? — Sandeep_Uniyal, Jan 14, 2015
small doubt ... in the question it is not mentioned that whether it is a synchronous or asynchronous pipeline... so why are we taking the cycle time as 11ns (max stage delay) ? — Danish, Jan 24, 2015
It is mentioned in the question itself
Consider an instruction pipeline with five stages "without any branch prediction" i.e. target address is available after completion of instruction. — GateMaster Prime, Jan 25, 2015
yes sir, it cannot be applied. I had this doubt earlier. But now, i got this. :) — GateMaster Prime, Jan 25, 2015
"Without any branch prediction" should mean that target address is availabe only after the completion of the instruction. ie. after WO stage. Why isnt that considered here? — Swatish Satheesan, Jan 29, 2015
Why should the WO stage be completed? After Execution stage, target is known- this is not prediction. In branch prediction, based on the history of the branch, further instructions will be fetched. During successful prediction, there will be no branch overhead. On a prediction failure, the pipeline must be flushed. — Arjun, Jan 29, 2015
we should have considered stalls due to stage delays itself right?

when EI and FO execute in parallel the pipeline stalls for 2ns. Same for EI and WO. — ankitrokdeonsns, Jan 31, 2015
if EI stage is being executed in 1 instruction and FO in the next one parallely since delay of FO is 2ns more than EI pipeline will stall for 2ns — ankitrokdeonsns, Jan 31, 2015
Why is the maximum delay being considered in Clk8, Clk 9, Clk 10 and Clk 15? Also why is it considered in Clk1, Clk2, Clk3?
{Since in those clock cycles, EI stage is not functioning, hence the time for those clock cycles should be taken as the maximum delay of the stages which are active in those cycles, isn't it?} — saurabhrk, Feb 2, 2015
It is not mentioned as WHEN IS THE TARGET ADDRESS OF INSTRUCTION evaluated. Generally it is done in DI stage but here you have done that in Execution phase of I4. Why so . Is it implicitly assumed here .if yes why ?? ans : pipeline with five stages "without any branch prediction" i.e. target address is available after completion of instruction. sir it means that I9 should be fetched after EI stage of I4 instruction.?? plzz clear my doubt.. — focus _GATE, Jul 19, 2015
JZ LOOP
ADD X, Y

Here we can know whether we have to do a Jump or not only after the Execute stage of the JZ (Jump on Zero) instruction. Similar is the case for any conditional branch instruction. — Arjun, Jul 19, 2015
k sir, means in DI stage we can know only wt type of operation are involved in the instruction..? but in case of conditional branch instruction . BRANCh is know to be in EI stage only.?? am i right.?? — focus _GATE, Jul 19, 2015
In the given ans there would be stall for I8 too,plz clarify why only 3 stalls,.Plz explain someone. — Gate Mm, Dec 10, 2015
Instruction number 12 WO will be at T15 which is not shown due to insufficient space

so total number of clock =15 — sourav., Jan 2, 2016
When the 1^st instruction is in FI stage, why are we considering that it is taking 11 ns of time and not 7 ns, because FI is taking 7 ns? Do we assume a time of 11 ns for all the stages even though an instruction is not being executed in the stage of 11 ns? — Gaurav Sharma, Feb 4, 2016
How to know when the branch address is resolved whether during Execute phase or Decode phase since it is not mentioned explicity in the question?? — sushmita, Oct 18, 2016
@Gaurav yes, once pipelined all stages take same time or else synchronization of stages won't work.

@sushmita I assumed conditional branch since it is mentioned "branch was taken during execution" as for an uncoditional branch it will be taken everytime. And for conditional branch, condition will be evaluated only during EI stage. — Arjun, Oct 18, 2016
@Arjun Sir
Why we have to wait for EX stage in conditional branch ?
Why can't we move branch decision logic to ID stage only in Conditional branches also ?
please see side 11 here http://courses.cs.vt.edu/cs2506/Spring2013/Notes/L13.BranchPrediction.pdf — Sachin Mittal 1, Dec 10, 2016
@Arjun sir

Why we r not taking branch instruction in the EX stage as it is mentioned in the ques branch is taken during the execution state. — Devyani, Dec 16, 2016
Sir, Why u are not showing about Instructions no. 5,6,7,8 what about these? what happen about these.. — aman.anand, Dec 28, 2016
Instruction I4 is the only branch instruction and its branch target is I9. If the branch is taken during the execution of this program — Arjun, Dec 29, 2016
if the question was like that the branch is NOT TAKEN would the execution finish at I8 or we will go till the I12 instruction? — Pankaj Joshi, Dec 30, 2016
Total instruction executed will be 1 to 4 and then 9 to 12.Whihc is total 8.Now because of I4 is the branch and its target will be available in 4th stage.So 3 more instructions(NOP) will be fetched.Which means total 11 instructions to be executed by pipeline.

Now we know Time taken by pipeline will be k+n-1 cycles.which is 5+11-1=15 cycles

Cycle time is 11 ns ,so total time would be 11*15=165 — rahul sharma 5, May 28, 2017
Sir,

Can't we push I9 after decoding the instruction I4.Why should I wait till EI of I4? — prayas, Jun 16, 2017
Putting aside the options, what about the delay due to branch? wont 1.25nsec be added to 165nsec due to pipeline stall? Mr. @Arjun? — habedo007, Jul 11, 2017
@Arjun vetran

Sir, In question there is not mention about operator forwarding.but still we fetch I9 in T8.I think it should be fetch in T9. — Nils, Aug 11, 2017
Pls Explain I8 are not added stall Cycle if you have any reason ? Briefly Explain Anyone ? — Vijay Dulam, Aug 21, 2017
@ gatecse Veteran:

in the question they did'nt mention in which state of I4 target address is available.

when without branch prediction given in question ,then we assume that target address available at the last stage but then stalls present will be 4.it also fetched I8 also. — Sourabh Kumar, Sep 8, 2017
Can somebody clarify what if the instruction is unconditional jump?? — parulk, Oct 8, 2017
@arjun sir how you saying branch address availiable after EI stage..? — Shubham Shukla 6, Oct 15, 2017
@Nils

Fetch will happen in T8. Even if there is a RAW hazard between I4 and I9, I9 will get the updated data at decode phase of the instruction. — rishi71662data4, Dec 8, 2017
Why we are not using the stage delays for FI, DI, FO, EI and WO are 5 ns, 7 ns, 10 ns, 8 ns and 6 ns, respectively and taking only highest one?

At least for first Instruction we suppose to use it if its given.

first instruction will take 5+7+10+8+6=36 and 36+5 (buffer delay)= 41 ns and remaining instruction will take (upto 4 only) 11 ns * 3= 33. thus total time after 4th instruction execution 41+33 =74 ns.

similary, form 9 to 12 th instruction it will take 41 ns (for 9th) + 33 ns (from 10 to 12th)=74ns.

and thus total time required for execution 74 + 74 =148 ns (No option is given)

What I am saying here is that if they are giving time for each stage then why we are not using it and just solve it by assuming only highest delay. — Bhushan Laware, Dec 27, 2017
It is because the highest delay in the pipeline is 10ns. What we are doing is we increase the delay of all other stages to 10ns to ensure continuous flow. now, every stage has a delay of 10ns. Refer the best answer above. He has explained it nicely. — Ananthakrishnan Saji, Dec 30, 2017
Its the clock period which determines when the instruction will move to next hardware in the datapath - Not the execution time of datapath hardwares. I hope you now understand why it is 10+1 = 11 ns. — Harsh Kumar, May 30, 2018
@Swati Rauniyar

we are not doing it like this becoz pipeline's time is decided by its slowest unit!!

if it is still not clear to u, then tell me ill give u an example; — Gate Fever, Nov 5, 2018
HOW TO GET 165 I8 IS NOT COUNTED IN SOLUTION TOTAL CYCLE ARE 16 ->16 *11=176 ANS PLS ELABORATE — rajputved, Dec 11, 2018
after completing the branch instruction you have to come back so then why the answer is b — Ramij, Jan 1, 2019
In conditional branch instructions, condition and target address are resolved at EX' stage and bubbles are fed during its execution to the pipeline. (By default)

But we can also resolve it at ID -stage using more complex pipeline architecture( add simple ALU and computation unit at ID stage) but don't use this architecture unless otherwise stated. — Nitesh Singh 2, Oct 16, 2019
without any branch prediction

then we should consider I5 also ????

If we don't use any method to remove control dependency, then I5 will be executed. — mrinmoyh, Oct 31, 2019
in the question, it has been mentioned that " If the branch is taken during the execution of this program" then why are we not fetching the I9 instruction in the execution stage of I4?

why are we waiting till the completion of the execution phase to start fetching? — kritikasingh, Jul 15, 2020
What would be the solution had the question been asked for with branch prediction? — shashi7893, Sep 2, 2020
why we can’t perform fetch operation of I9 along with execution phase of I4? — arun yadav, Sep 7, 2020
how we are fetching I9 in the 8^th cycle, it is not mentioned. Please explain this point — Abhineet Singh, Nov 5, 2020
Source: Patterson & Hennessy

So, we can definitely move the branch comparison to the OF(ID stage in MIPS) stage and reduce the stall cycles by $1$. But since none of the options matches that, we are not considering this.

I guess for NAT question both answers would be true, because moving the branch comparison to earlier stages of the pipeline require complex circuitry to deal with forwarding and other data hazards. — avistein, Aug 4, 2021
There is no branch prediction used. So how can $I9$ start after the EI of $I4$. It should start after $WO$ right? So we would get $4 stalls$ instead of $3$. Answer should be : $16*11 = 176 ns.$ — Abhrajyoti00, Sep 8, 2022
@Abhrajyoti00 bro we are starting I9 after EI because it is clearly mentioned in question that we have to branch during the execution of the program . — Thadymademe, Nov 30, 2022
@abir_banerjee That’s not exactly correct. The meaning of that sentence is we have a conditional branch instruction like “Jump on Carry” and the condition is true during the execution of the program and thus the branch is taken.

@Abhrajyoti00 Even without a branch predictor can’t we start I9 as soon as we know it is the branch target? Which pipeline feature facilitates this? — gatecse, Nov 30, 2022
@shashi7893 I have been searching for this for a while did u get any answer to this ? — Priyanshu2602y, Aug 25, 2023

amarVashishth · Answer 1 · 2015-10-20T12:12:52+0000

See all

Show 7 previous comments

Prateek K · Answer 2 · 2017-10-24T00:13:52+0000

Clock Time = max stage delay + Buffer Delay

= 10+1= 11ns

I1 - Finish at 5th clock

I2 - Finish at 6th clock

I3 - Finish at 7th clock

I4- Finish at 8th clock

Due to branching at I4 pipelining halts and starts after EI stage of I4 and performs FI of I9 at 8th clock.

I9 - Finish at 12th clock

I10 - Finish at 13th clock

I11- Finish at 14th clock

I12 - Finish at 15 clock

Total time to complete program = 11*15= 165 ns

Short Trick :

Branching descion will be taken in Execute Instruction (EI) phase (4th phase) so there will be 3 stalls

first I1 will complete in 5 cycles + (I2,I3,I4,3 STALLS,I9,I10,I11,I12) WILL TAKE ONE-ONE CYCLE=15 CYCLE

cycle time=(largest cycle time + buffer delay)=10+1=11

Execution time =11*15=165 — Dharmendra Lodhi, Sep 9, 2018

Çșȇ ʛấẗẻ · Answer 3 · 2016-12-21T07:14:45+0000

clock cycle time= max. stage delay+ buffer delay

         = 10+1

         = 11 ns

clock cycle =15

execution time= 11*15=165

tags	tag:apple
author	user:martin
title	title:apple
content	content:apple
exclude	-tag:apple
force match	+apple
views	views:100
score	score:10
answers	answers:2
is accepted	isaccepted:true
is closed	isclosed:true

GATE CSE 2013 | Question: 45

Please log in or register to add a comment.

Please log in or register to answer this question.

10 Answers

Please log in or register to add a comment.

Please log in or register to add a comment.

Please log in or register to add a comment.

Please log in or register to add a comment.

Related questions

10 10 Comments reply

Please log in or register to add a comment.

Please log in or register to answer this question.

10 Answers

66 66 Comments reply

Please log in or register to add a comment.

10 10 Comments reply

Please log in or register to add a comment.

2 2 Comments reply

Please log in or register to add a comment.

0 reply

Please log in or register to add a comment.

Related questions

10 10 Comments

66 66 Comments

10 10 Comments

2 2 Comments

0