The Gateway to Computer Science Excellence

+24 votes

Consider a selection of the form $\sigma_{A\leq 100} (r)$, where $r$ is a relation with $1000$ tuples. Assume that the attribute values for $A$ among the tuples are uniformly distributed in the interval $[0, 500].$ Which one of the following options is the best estimate of the number of tuples returned by the given selection query ?

- $50$
- $100$
- $150$
- $200$

0

I think actual answer should be 202 because it is $σ_{A≤100}(r)$ not $σ_{A <100}(r)$

ping @Krish__, @rahul sharma 5, @Red_devil, @Shivam Chauhan, @Tuhin Dutta, @Anu007, @Ashwin Kulkarni @reena_kandari and @srestha ji.

+1

It should be $\frac{101}{501}*\small1000 = 201.59$ to be precise. And since it is closer to 202 than 201, I'd go for 202.

+1

Why is everyone simply ignoring the fact that relational algebra doesn't return repeat values? it will give distinct values only right, the we will get only 100 tuples and 101 if 0 is included.

0

@shaz No where it is given A is primary key...so it can be repeat..

example-

A B

1 0

1 1

2 1

2 2

Etc.(if A is not ck)

+26 votes

Best answer

$\sigma_{A \leq 100}(r)$

$r$ has $1000$ tuples

Values for A among the tuples are uniformly distributed in the interval $[0, 500].$ This can be split to $5$ mutually exclusive (non-overlapping) and exhaustive (no other intervals) intervals of same width of $100$ $([0-100], [101-200], [201-300], [301-400], [401-500],$ $0$ makes the first interval larger - this must be a typo in question) and we can assume all of them have same number of values due to Uniform distribution. So, number of tuples with A value in first interval should be

$\frac{\text{Total no. of tuples}}{5} = 1000/5 = 200$

Correct Answer: $D$

$r$ has $1000$ tuples

Values for A among the tuples are uniformly distributed in the interval $[0, 500].$ This can be split to $5$ mutually exclusive (non-overlapping) and exhaustive (no other intervals) intervals of same width of $100$ $([0-100], [101-200], [201-300], [301-400], [401-500],$ $0$ makes the first interval larger - this must be a typo in question) and we can assume all of them have same number of values due to Uniform distribution. So, number of tuples with A value in first interval should be

$\frac{\text{Total no. of tuples}}{5} = 1000/5 = 200$

Correct Answer: $D$

–1

each tupple can choose number from 0 to 500.. So the probability of choosing a number from 0 to 100 for each tupple is 101 / 501..

So number of tupple return is = expected number of tupple choose number from 0 to 100

= 101/501 + 101/501..... 1000 times

around 200 tupples..

(D)

So number of tupple return is = expected number of tupple choose number from 0 to 100

= 101/501 + 101/501..... 1000 times

around 200 tupples..

(D)

+1

Numbers between 0 - 500 so number less than equal to 100 is 101(0- 100) . so answer should be 202.

why we take 1- 500 since closed bracket used so 0 and 500 are inclusive.

why we take 1- 500 since closed bracket used so 0 and 500 are inclusive.

+3

yes, but that must be a typo. And if we include 0, answer must be 101/501*1000. Also we cant assume values are integers.

0

how did you spilt [0-500]? why only 5 intervals ? why cant we have more intervals? didn't get it at all. please help!

+2

@Arjun Sir selection of A <= 100 means only 100 tuples will be returned right? Because selection operator removes duplicates??

0

didn't get it at all. please help!

0

since in question given...we have only nos. 0 to 500 to distribute among 1000 tuples for attribute A "UNIFORMLY". So if we use each element from 0 to 500 twice, we will have 1002 element which is suffice for distributing among 1000 tuples. we will use any of the 499 elements only and two times each. so, in this way we'll have 1000 values for A of relation r, distributed uniformly on 0 to 500.

so now when we'll select all the A values such that a<=100, we will get min 200(if 0 to 499 selected) and max 202(if 1 to 500 selected) such tuples.

so both the answers are correct, better go with the options now, whichever is given select it. if both given the select 202. bcz its the max possibility.

so now when we'll select all the A values such that a<=100, we will get min 200(if 0 to 499 selected) and max 202(if 1 to 500 selected) such tuples.

so both the answers are correct, better go with the options now, whichever is given select it. if both given the select 202. bcz its the max possibility.

+2

we've numbers [0,500] & tuples 1000. Now we've to uniformly or equally distribute this 501 numbers into 1000 tuples.

i.e each number will get, 1000/501 = 1.996 tuples.

i.e each number will get 2 tuples.( here floating point comes because of typo,if the number range would have [1,500] then we'll get perfectly 2 tuples for each number).

So the RA expression will select 101*2 = 202 tuples technically.

But here answer should be taken as 200 because of typo.

+1

but tell me this

but in relational algebra, in the final table ,the tuples are distinct,so if each number is repeated two times....even through number of distinct tuples should be 100

but in relational algebra, in the final table ,the tuples are distinct,so if each number is repeated two times....even through number of distinct tuples should be 100

+2

0

What if we choose interval like [0-50],[51-100].....[950-1000] the we would be having 10 intervals of size 50 each and each interval like others can contain a attribute value twice, then in this case we get vatue as 100 instead of 200 As

Total number of tuple /10 =1000/10 =100

@Abhinav Rana Sir

0

Can u tell me one thing, if we can accommodate <=100 values , then why do we need [0,500] interval to accommodate it or vice versa??

0

why do we need [0,500] interval to accommodate it or vice versa

what is the meaning of this line ???

here A attribute can take value in the range (0 - 500), & these 0 - 500 values will be distributed in 1000 places,that too uniformly(equal frequency of each values).

0

total￼total we have 1000

Now we splitting [0-500]---> 0-250, 251-500

Combinely we have 0-250,251-500,0-1000 in the 200 tuple is <=100

0

Since we have 1000 tuples and values between 0 and 500.So we can distribute 0-499 initially.So by that time 500 tuples would have been completed.

Now the next value is 500 which is 501 st tuple.Now if we try to wrap around and repeat same interval again,since we have 499 tuples remained with values possible from 0 to 499.

So for values A __<__ 100 we can have initial 101 + final 101 =202 tuples in total.

So the best estimate matches 200 tuples?

Can we consider such a distribution?

+15 votes

**option D**

total numbers are 1000 and they have said that it is uniformly distrubuted between [0,500] it means every number is 2 times thats the only way we can distribute it uniformaly and as per our condition A<=100 at max 100 tuples can be there and and every one can be repeated 2 times so it sums up to 200 hence it is answer

0

+2

but in relational alzebra, in the final table ,the tuples are distinct,so if each number is repeated two times....even through number of distinct tuples should be 100

- All categories
- General Aptitude 1.9k
- Engineering Mathematics 7.5k
- Digital Logic 2.9k
- Programming and DS 4.9k
- Algorithms 4.3k
- Theory of Computation 6.2k
- Compiler Design 2.1k
- Databases 4.1k
- CO and Architecture 3.4k
- Computer Networks 4.1k
- Non GATE 1.5k
- Others 1.5k
- Admissions 595
- Exam Queries 576
- Tier 1 Placement Questions 23
- Job Queries 72
- Projects 17

50,645 questions

56,596 answers

195,822 comments

102,062 users