Question# 01

Solution: (30pts)

25d8de40-37fb-4b0f-9cb2-6a6cd1ef0d81-image.png ede1d1e9-d26a-4d8d-a0c1-33cff47bd534-image.png 989a7ce0-af65-4517-b530-2bc54857d6a8-image.png 37664063-8e69-4fe4-8c90-e1231f48619c-image.png 977918e2-e47c-4ee4-a9ce-b5b845e9458e-image.png 3767bb79-0b8a-49a1-88a1-4185bcc9b461-image.png 54785684-6294-47cb-a260-f925551ff1c1-image.png ]]>

Data Mining (CS725)

Assignment # 01 Total Points: 50

Please read the instructions before stepping in to the assignment solution. Please try to complete and upload your assignment on or before due date. No assignment will be considered after due date through email.

Instructions:

Please apply/ perform all the steps.

Copying from other students or any other source will be

marked as ZERO.

Screenshots of assignment solution will not be considered.

Assignment solution must be typed.

Screenshots of Mathematical Equation will lead to marks deduction.

No assignment will be entertained through email.

Be specific and to the point in assignment solution. No assignment after due date will be accepted.

Due Date: January 16th, 2020

1

Fall 2019 Data Mining

Virtual university of Pakistan

Question# 01

Consider the following dataset mentioned below:

(10pts)

Actual | Predicted | ||
---|---|---|---|

Yes | No | ||

Yes | 70 | 30 | |

No | 40 | 60 |

Now calculate the following:

i. Sensitivity

ii. Specificity

iii. Error rate

Question# 02

(40pts)

Consider the following class-labeled training tuples from the weather forecast dataset mentioned below:

a) Calculate the Gain for every mentioned attribute and identify which attribute will be chosen for splitting?

b) Compute the Gini index for the overall collection of training dataset