import pandas and load training set and get only 2's and 3's

inspecting the data

first colume is the label (Y), the rest of columns are variables (X)

load test set and use 2's and 3's and separate X and Y

import model fit the training data

use the linear regression model to predict and calculate the accuracy

import k nearest model loop through all k's and calculate accuracy