RStudio Conf 2022 ggplot2 图形设计：极坐标太空任务可视化解析

2025-06-02 02:22:34作者：卓炯娓

项目背景

本文基于 RStudio Conf 2022 中关于 ggplot2 图形设计的研讨会材料，重点解析如何创建极坐标太空任务可视化图表。该图表展示了太空探索人员在太空中的累计停留时间，以及他们首次和最后一次执行太空任务的年份。

数据准备

首先需要准备太空任务数据，包括：

library(tidyverse)

# 读取太空探索数据
df_astro <- read_csv('space_explorers.csv')

# 数据处理
df_missions <- df_astro %>%
  group_by(name) %>%
  summarize(
    hours = sum(hours_mission),
    year = min(year_of_mission),
    max_year = max(year_of_mission)
  ) %>%
  ungroup() %>%
  mutate(year = -year) %>%
  arrange(year) %>%
  mutate(id = row_number())

数据处理步骤包括：

按探索人员姓名分组
计算每位探索人员的总太空停留时间
记录首次和最后一次任务年份
对年份取负值以便后续可视化
为每位探索人员分配唯一ID

基础可视化构建

核心图层设计

g1 <- ggplot(df_missions, aes(x = id, y = hours, color = hours)) +
  geom_linerange(aes(ymin = 0, ymax = hours, alpha = hours), size = .25) +
  geom_point(aes(y = 0), shape = 15, size = .1, color = "#808080") +
  geom_point(aes(y = hours, size = hours))

关键几何对象：

geom_linerange() - 创建从基线到数据点的垂直线段
geom_point() - 在基线位置添加灰色方块标记
geom_point() - 在数据点位置添加彩色气泡

极坐标转换

g1 <- g1 + coord_polar(theta = "y", start = 0, clip = "off")

使用coord_polar()将直角坐标系转换为极坐标系，参数theta = "y"表示使用y轴作为角度轴。

比例尺调整

g1 <- g1 +
  scale_x_continuous(limits = c(-300, NA), expand = c(0, 0)) +
  scale_y_continuous(limits = c(0, 23000), expand = c(0, 0)) +
  scale_color_distiller(palette = "YlGnBu", direction = -1) +
  scale_size(range = c(.001, 3)) +
  scale_alpha(range = c(.33, .95))

比例尺设置包括：

x轴和y轴的范围和扩展
颜色渐变使用YlGnBu调色板
气泡大小范围
透明度范围

主题与标签优化

主题设置

g1 <- g1 +
  theme_void() +
  theme(
    plot.background = element_rect(fill = "black"),
    plot.margin = margin(-70, -70, -70, -70),
    legend.position = "none"
  )

使用theme_void()移除所有默认主题元素，并设置黑色背景和负边距以最大化绘图区域。

添加标签

# 准备标签数据
df_labs <- df_missions %>%
  filter(year %in% -c(1961, 197:201*10, 2019)) %>%
  group_by(year) %>%
  filter(id == min(id))

df_max <- df_missions %>%
  arrange(-hours) %>%
  slice(1) %>%
  mutate(
    first_name = str_remove(name, ".*, "),
    last_name = str_remove(name, "(?<=),.*"),
    label = paste("Between", abs(year), "and", max_year, ",\n", 
                 first_name, last_name, "has spent\n", 
                 format(hours, big.mark = ','), "hours in space.\nThat's roughly", 
                 round(hours / 24, 0), "days!")
  )

# 添加标签
g2 <- g1 +
  geom_text(
    data = df_labs, aes(y = 0, label = abs(year)),
    family = "Lato", fontface = "bold", color = "#808080",
    size = 4.5, hjust = 1.2
  ) +
  geom_text(
    data = df_max, aes(label = label)),
    family = "Lato", size = 3.9, vjust = -.35
  )

标题和说明文字

g2 <- g2 +
  annotate(
    geom = "text", x = -300, y = 0, label = "Travelling to\nOuter Space",
    family = "Boska", fontface = "bold", lineheight = .9,
    size = 20, color = "white", hjust = .57, vjust = .45, alpha = .25
  ) +
  annotate(
    geom = "text", x = -300, y = 0, label = "Travelling to\nOuter Space",
    family = "Boska", fontface = "bold", lineheight = .85,
    size = 20, color = "white", hjust = .55, vjust = .4
  ) +
  labs(caption = "Cumulative time in outer space...") +
  theme(
    plot.caption = element_text(
      family = "Lato",
      size = 15, color = "#808080", hjust = .5,
      margin = margin(-100, 0, 100, 0)
    )
  )

高级扩展技巧

使用扩展包增强视觉效果：

# 使用ggforce和ggblur扩展包
g_ext <- ggplot(df_missions, aes(x = id, y = hours, color = hours)) +
  ggforce::geom_link(aes(xend = id, yend = 0, alpha = hours), size = .25, n = 300) +
  ggblur::geom_point_blur(aes(size = hours, blur_size = hours), blur_steps = 25) +
  scico::scale_color_scico(palette = "buda") +
  ggblur::scale_blur_size_continuous(range = c(.5, 10), guide = "none")