Solution ManualDM:
Question# 01
Solution: (30pts)
25d8de4037fb4b0f9cb26a6cd1ef0d81image.png ede1d1e9d26a4d8da0c133cff47bd534image.png 989a7ce0af654517b5302bc54857d6a8image.png 376640638e694fe48c90e1231f48619cimage.png 977918e2e47c4ee4a9ceb5b845e9458eimage.png 3767bb790b8a49a188a14185bcc9b461image.png 54785684629447cba260f925551ff1c1image.png


Data Mining
Virtual university of Pakistan
Assignment # 01
Total Points: 40
Data Mining (CS725)
Please read the instructions before stepping in to the assignment solution. Please try to complete and upload your assignment on or before due date. No assignment will be considered after due date through email.
Instructions:
Please apply/ perform all the steps.
Copying from other students or any other source will be
marked as ZERO.
Screenshots of assignment solution will not be considered.
Assignment solution must be typed.
Screenshots of Mathematical Equation will lead to marks deduction.
No assignment will be entertained through email.
Be specific and to the point in assignment solution. No assignment after due date will be accepted.
Due Date: November 15th, 2019Data Mining Virtual university of Pakistan
Question# 01
Consider an undergraduate class having four core subjects as follows:
Computer programming
Data structures
Analysis of algorithms
Digital logic design.
(30pts)
As per university policy, a student has to score minimum marks as follows to clear the semester.
456be727e98d4a83a80222e79ff54214image.png
a) Minmax normalization by considering maximum as 1 and minimum as 0 b) Zscore and discuss every calculated zscore result.
Question# 02 (10pts)
a) Compute the cosine similarity for the below mentioned documents:
D1= Ali loves me more than Akram loves me
D2= Shahzain likes me more than Ali loves me.Data Mining Virtual university of Pakistan
b) Consider the university students records mentioned below and compute Euclidean distance:
465f2bbda0ac4d91a3e3bfabef224f65image.png 
Fall 2019 Data Mining Virtual university of Pakistan
Data Mining (CS725)
Assignment # 01 Total Points: 50
Please read the instructions before stepping in to the assignment solution. Please try to complete and upload your assignment on or before due date. No assignment will be considered after due date through email.
Instructions:
Please apply/ perform all the steps.
Copying from other students or any other source will be
marked as ZERO.
Screenshots of assignment solution will not be considered.
Assignment solution must be typed.
Screenshots of Mathematical Equation will lead to marks deduction.
No assignment will be entertained through email.
Be specific and to the point in assignment solution. No assignment after due date will be accepted.
Due Date: January 16th, 2020
1Fall 2019 Data Mining
Actual Predicted Yes No Yes 70 30 No 40 60
Virtual university of Pakistan
Question# 01
Consider the following dataset mentioned below:
(10pts)Now calculate the following:
i. Sensitivity
ii. Specificity
iii. Error rate
Question# 02
(40pts)
Consider the following classlabeled training tuples from the weather forecast dataset mentioned below:
6b04fe2aba864ad9a42896fc0eaacd25image.pnga) Calculate the Gain for every mentioned attribute and identify which attribute will be chosen for splitting?
b) Compute the Gini index for the overall collection of training dataset