Background: This course is all about data visualization. However, we must first have some understand about the dataset that we are using to create the visualizations.
Assignment:
- Use RStudio to generate a word document with basic data analysis of the following dataset: dataset_price_personal_computers.csv
- Use the following video to learn the simple commands to conduct basic data analysis with RStudio: Basic Data Analysis in RStudio.mp4
Questions/Requests:
- Create a summary of stats for the dataset. (provide a screen shot)
- Create a correlation of stats for the dataset. (provide a screen shot) (Hint: Transform may be needed)
- What is the Min, Max, Median, and Mean of the Price? (provide a screen shot)
- What is the correlation values between Price, Ram, and Ads? (provide a screen shot)
- Create a subset of the dataset with only Price, CD, and Premium. (provide a screen shot)
- Create a subset of the dataset with only Price, HD, and Ram where Price is greater than or equal to $1750. (provide a screen shot)
- What percentage of Premium computers were sold? (provide a screen shot)(Hint: Categorical analysis)
- How many Premium computers with CDs were sold? (provide a screen shot)(Hint: Contingency table analysis)
- How many Premium computers with CDs priced over $2000 were sold? (provide a screen shot)(Hint: Conditional table analysis)
Your document should be an easy-to-read font in MS Word (other word processors are fine to use but save it in MS Word format). Your cover page should contain the following: Title, Student’s name, University’s name, Course name, Course number, Professor’s name, and Date.
Submit your assignment on or before the due date.
Please follow and like us: