edited by
1,597 views
0 votes
0 votes

You are reviewing four papers submitted to a conference on machine learning for medical expert systems. All the four papers validate their superiority on a standard benchmarking cancer dataset, which has only $5 \%$ of positive cancer cases. Which of the experimental settings is acceptable to you?

paper i) We evaluated the performance of our model through a $5$-fold cross-validation process and report an accuracy of $93 \%$.

paper ii) The area under the $\text{ROC}$ curve on a single left-out test set of our model is around $0.8$, which is the highest among all the different approaches.

paper iii) We computed the average area under the $\text{ROC}$ curve through $5$-fold cross-validation and found it to be around $0.75$ - the highest among all the approaches.

paper iv) The accuracy on a single left-out test set of our model is $95 \%$, which is the highest among all the different approaches.

  1. $\text{paper i}$ 
  2. $\text{paper i and paper iv}$
  3. $\text{paper ii and paper iv}$
  4. $\text{paper iii}$

     

edited by

2 Answers

0 votes
0 votes
I think the answer is D.

paper i and iv use accuracy which is not an appropriate measure given the imbalance in the data set. For ex. we could use a model classifying every point in the data set as not cancerous and get an accuracy of 95% (Tp = 95, FP =5, Tn = 0, FN =0)

Doubtful here but I thin, Paper ii uses ROC although it uses ROC for leave one out. Not sure how ROC might be made to work given only one point is being predicted.

Related questions

0 votes
0 votes
2 answers
1
admin asked Oct 21, 2023
3,882 views
Given $3$ literals $\text{A, B}$, and $\text{C}$, how many models are there for the sentence $\text{A $\vee$ $\neg$ B $\vee$ C}$ ?
1 votes
1 votes
2 answers
2
admin asked Oct 21, 2023
3,439 views
Which of the following first-order logic sentence matches closest with the sentence "All students are not equal"?$\forall x \exists y[\operatorname{student}(x) \wedge \op...
0 votes
0 votes
1 answer
3
admin asked Oct 21, 2023
1,608 views
The mean of the observations of the first $50$ observations of a process is $12$. If the $51$ $\text{st}$ observation is $18$, then, the mean of the first $51$ observatio...