Background: This course is all about data science and big data analytics that involves data visualization. This

assignment requires the need for an understanding of the dataset that we are using to create the visualizations.

Additional Resources:

Use the following videos to learn the basic and simple commands to conduct basic data analysis with

R/RStudio:

➢ Introduction to Data Science with R – Data Analysis Part 1 –

➢ Introduction to Data Science with R – Data Analysis Part 2 –

➢ Introduction to Data Science with R – Data Analysis Part 3 –

Assignment Requirements:

1. Use R/RStudio to generate a word document with basic data analysis of the attached dataset:

dataset_price_personal_computers.csv

2. Create a summary of stats for the dataset. (provide a screenshot)

3. Create a correlation of stats for the dataset. (provide a screenshot) (Hint: Transform may be needed)

4. What is the Min, Max, Median, and Mean of the Price? (provide a screenshot)

5. What are the correlation values between Price, Ram, and Ads? (provide a screenshot)

6. Create a subset of the dataset with only Price, CD, and Premium. (provide a screenshot)

7. Create a subset of the dataset with only Price, HD, and Ram where Price is greater than or equal to

$1750. (provide a screenshot)

8. What percentage of Premium computers were sold? (provide a screenshot) (Hint: Categorical

analysis)

9. How many Premium computers with CDs were sold? (provide a screenshot) (Hint: Contingency table

analysis)

10. How many Premium computers with CDs priced over $2000 were sold? (provide a screenshot) (Hint:

Conditional table analysis)

Deliverable:

• About 6-8 pages. Maximum allowed submission is two.

• Follow standard APA guidelines, including an abstract, conclusion, and a reference page with a

minimum of three references.

• Include a cover page that should contain the following: Title, Student’s name, University’s name,

Course name, Course number, Professor’s name, and Date.

• Clickable auto-generated Table of Content is required

• Figures and Tables used should be labeled according to APA guidelines