传说在GGPLOT2汇总统计(legend for summary statistics in gg

2019-06-26 19:50发布

下面是阴谋代码

library(ggplot2)
df <- data.frame(gp = factor(rep(letters[1:3], each = 10)), y = rnorm(30))
library(plyr)
ds <- ddply(df, .(gp), summarise, mean = mean(y), sd = sd(y))
ggplot(df, aes(x = gp, y = y)) +
   geom_point() +
   geom_point(data = ds, aes(y = mean), colour = 'red', size = 3)

我想有这样的情节,将确定的数据值和平均值一些像这样的事情一个传奇

Black point = Data
Red point   = Mean.

任何指针来获得所需的结果,将不胜感激。 谢谢

Answer 1:

使用手动的规模,即你的情况scale_colour_manual 。 然后,在使用该标尺的颜色映射到值aes()的每个的geom的功能:

ggplot(df, aes(x = gp, y = y)) +
  geom_point(aes(colour="data")) +
  geom_point(data = ds, aes(y = mean, colour = "mean"), size = 3) +
  scale_colour_manual("Legend", values=c("mean"="red", "data"="black"))



Answer 2:

你可以结合平均变量和数据在同一data.frame和颜色/尺寸通过柱,其是一个因素,无论是datamean

library(reshape2)

# in long format
dsl <- melt(ds, value.name = 'y')
# add variable column to df data.frame
df[['variable']] <- 'data'
# combine
all_data <- rbind(df,dsl)

# drop  sd rows

data_w_mean <- subset(all_data,variable != 'sd',drop = T)

# create vectors for use with scale_..._manual
colour_scales <- setNames(c('black','red'),c('data','mean'))
size_scales <- setNames(c(1,3),c('data','mean') )

ggplot(data_w_mean, aes(x = gp, y = y)) +
  geom_point(aes(colour = variable, size = variable)) +
  scale_colour_manual(name = 'Type', values = colour_scales) +
  scale_size_manual(name = 'Type', values = size_scales)

或者你可以不合并,而是包括在两个数据集列

dsl_mean <- subset(dsl,variable != 'sd',drop = T)  
ggplot(df, aes(x = gp, y = y, colour = variable, size = variable)) +
  geom_point() +
  geom_point(data = dsl_mean) +
  scale_colour_manual(name = 'Type', values = colour_scales) +
  scale_size_manual(name = 'Type', values = size_scales)

其中给出了相同的结果



文章来源: legend for summary statistics in ggplot2
标签: r ggplot2