GATE CSE 2021 Set 1 | Question: 23

Arjun asked Feb 18, 2021 • retagged Nov 30, 2022 by Lakshman Bhaiya

12,964 views

See all

2 Answers

Best answer

$P(A>10) = \frac{10}{15} = \frac{2}{3}$
$P(B=18) = \frac{1}{20}$
$P(A>10 \land B=18) = \frac{2}{3}\times\frac{1}{20} = \frac{1}{30}$

$P(A>10 \lor B=18) = P(A>10) + P(B=18) – P(A>10 \land B=18)$

$\qquad = \frac{2}{3} + \frac{1}{20} – \frac{1}{30} = \frac{40 + 3 – 2}{60} = \frac{41}{60}$

$\text{Estimated number of tuples} = \frac{41}{60}\times1200 = 820$

The above answer is TRUE for SQL SELECT but not for Relational Algebra as by theory relational algebra operates on a set which means all the elements must be distinct. Since we have $15$ distinct possible values for $A$ and $20$ distinct possible values for $B,$ in strict relational algebra we’ll get

$\text{Estimated number of tuples} = \frac{41}{60}\times (15 \times 20) = 205.$

Official Answer: $205$ OR $820.$

zxy123 answered Feb 18, 2021 • selected Jun 13, 2021 by Arjun

zxy123

See all

Show 9 previous comments

See all

HitechGa

commented Feb 27, 2021

I see many people are confused with this question.

Let me clarify a bit.

Let’s read the question once again:

A relation $r(A,B)$ in a relational database has $1200$ tuples. The attribute $A$ has integer values ranging from $6$ to $20$, and the attribute $B$ has integer values ranging from $1$ to $20$. Assume that the attributes $A$ and $B$ are independently distributed.

The estimated number of tuples in the output of $σ_{(A>10)∨(B=18)}(r)$

is ____________.

Note the had the term been “relational model” instead of “relational database”, then we could have argued that the table is built using the classical set theory concept.

But since they have used the term “relational database” we could think that they have talking about a specific implementation. So there is no harm in considering the table as an SQL table. As such we can say that duplicates are allowed in the table.

We can further confirm this as follows:

$$E[A=a, B=b]=P(A=a,B=b). 1200 = \frac{1}{15} \times \frac{1}{20} \times 1200 = 8 \neq 1$$

So as such we are sure that the actual database has duplicate tuples. And the relational algebra query as such can be thought of as follows:

$\text{SELECT *}$

$\text{FROM r}$

$\text{WHERE A>10 AND B=18}$

tags	tag:apple
author	user:martin
title	title:apple
content	content:apple
exclude	-tag:apple
force match	+apple
views	views:100
score	score:10
answers	answers:2
is accepted	isaccepted:true
is closed	isclosed:true

GATE CSE 2021 Set 1 | Question: 23

Please log in or register to add a comment.

Please log in or register to answer this question.

2 Answers

Please log in or register to add a comment.

Please log in or register to add a comment.

Related questions

1 1 comment reply

Please log in or register to add a comment.

Please log in or register to answer this question.

2 Answers

12 12 Comments reply

Please log in or register to add a comment.

23 23 Comments reply

Please log in or register to add a comment.

Related questions

1 1 comment

12 12 Comments

23 23 Comments