Virtual university of Pakistan

Assignment # 01

Total Points: 40

Data Mining (CS725)

Please read the instructions before stepping in to the assignment solution. Please try to complete and upload your assignment on or before due date. No assignment will be considered after due date through email.

Instructions:

Please apply/ perform all the steps.

Copying from other students or any other source will be

marked as ZERO.

Screenshots of assignment solution will not be considered.

Assignment solution must be typed.

Screenshots of Mathematical Equation will lead to marks deduction.

No assignment will be entertained through email.

Be specific and to the point in assignment solution. No assignment after due date will be accepted.

Due Date: November 15th, 2019

Data Mining Virtual university of Pakistan

Question# 01

Consider an undergraduate class having four core subjects as follows:

Computer programming

Data structures

Analysis of algorithms

Digital logic design.

(30pts)

As per university policy, a student has to score minimum marks as follows to clear the semester.

a) Min-max normalization by considering maximum as 1 and minimum as 0 b) Z-score and discuss every calculated z-score result.

Question# 02 (10pts)

a) Compute the cosine similarity for the below mentioned documents:

D1= Ali loves me more than Akram loves me

D2= Shahzain likes me more than Ali loves me.

Data Mining Virtual university of Pakistan

b) Consider the university students records mentioned below and compute Euclidean distance:

Question# 01

Solution: (30pts)

]]>