Module #7 Assignment: Visualizing Distributions in R

 # Load required libraries

# -------------------------------

library(ggplot2)


# -------------------------------

# Step 1: Load and Inspect Dataset

# -------------------------------

data("mtcars")


# View first few rows

head(mtcars)


# Check structure

str(mtcars)


# -------------------------------

# Step 2: Histogram of MPG

# -------------------------------

hist_mpg <- ggplot(mtcars, aes(x = mpg)) +

  geom_histogram(binwidth = 2, fill = "steelblue", color = "black") +

  labs(title = "Distribution of Miles per Gallon (mpg)",

       x = "Miles per Gallon",

       y = "Count") +

  theme_minimal()


# Display plot

print(hist_mpg)


# Save plot as image

ggsave("hist_mpg.png", plot = hist_mpg, width = 6, height = 4, dpi = 300)


# -------------------------------

# Step 3: Density Plot of Horsepower by Cylinder

# -------------------------------

density_hp <- ggplot(mtcars, aes(x = hp, fill = factor(cyl))) +

  geom_density(alpha = 0.5) +

  labs(title = "Density of Horsepower by Cylinder Count",

       x = "Horsepower",

       y = "Density",

       fill = "Cylinders") +

  theme_minimal()


# Display plot

print(density_hp)


# Save plot as image

ggsave("density_hp.png", plot = density_hp, width = 6, height = 4, dpi = 300)


# -------------------------------

# Step 4: Faceted Scatter Plot of MPG vs Horsepower

# -------------------------------

scatter_mpg_hp <- ggplot(mtcars, aes(x = hp, y = mpg)) +

  geom_point(color = "darkgreen") +

  facet_wrap(~cyl) +

  labs(title = "MPG vs Horsepower by Cylinder Count",

       x = "Horsepower",

       y = "Miles per Gallon") +

  theme_minimal()


# Display plot

print(scatter_mpg_hp)


# Save plot as image

ggsave("scatter_mpg_hp.png", plot = scatter_mpg_hp, width = 6, height = 4, dpi = 300)


Results: 





For this assignment, I used the mtcars dataset to explore the distributions of miles per gallon (mpg) and horsepower (hp). The histogram of mpg showed that most cars cluster around 15–25 mpg, while a few cars exceed 30 mpg. The density plot of horsepower grouped by cylinder count revealed that cars with more cylinders tend to have higher horsepower, with overlapping distributions for 4- and 6-cylinder cars. The faceted scatter plots allowed me to compare mpg vs horsepower across cylinder groups, highlighting how engine size impacts fuel efficiency.

In designing these visualizations, I followed Few’s and Yau’s recommendations by using aligned axes for comparisons, meaningful color to differentiate groups, and clean grid lines to support interpretation. I avoided unnecessary visual embellishments, such as 3D effects or distracting backgrounds. I agree that many common visualizations fail to communicate the structure of distributions clearly, and by focusing on clarity and comparison, these plots effectively reveal patterns in the data.

Comments

Popular posts from this blog

Final Project LIS4317

Module 11 assignment