Data Mining Assignment

Part 1 – General Data Cleaning – 25points

Use the provided faux_data.csv dataset. This file contains first name, last name, employee ID, gender, address, dollar, data, and comment data that needs to be cleaned. Follow the steps below to clean the dataset:

R code:

rm(list = ls(all = T))

#step 1

library(readxl)

df = read.csv(choose.files(),stringsAsFactors = FALSE)

head(df)

 

Instruction Files
Basic features
  • Free title page and bibliography
  • Unlimited revisions
  • Plagiarism-free guarantee
  • Money-back guarantee
  • 24/7 support
On-demand options
  • Writer’s samples
  • Part-by-part delivery
  • Overnight delivery
  • Copies of used sources
  • Expert Proofreading
Paper format
  • 275 words per page
  • 12 pt Arial/Times New Roman
  • Double line spacing
  • Any citation style (APA, MLA, Chicago/Turabian, Harvard)