R 散点图可视化:如何将自定义多个基因着色为散点图?

R scatter plot visualization: how to color custom- multiple genes into scatter plot?

我尝试使用 R 使用基因表达数据制作基本散点图。

#import data: 

oldmice <- read.table("oldmice.txt", header = TRUE)
youngmice <- read.table("youngmice.txt", header = TRUE)

导入的数据包含:两个导入数据的格式相同,但 MGE 具有不同的值。

 gene   MGE
Sox17   -6.74193774617653   
Mrpl15  -0.212567471203473  
Lypla1  -0.711251006455475  
and so on.. 

制作基本的火山图使用:youngmice$MGE vs oldmice$MGE

plot(oldmice$MGE, youngmice$MGE, main="old vs young mice!!",
     xlab="oldmice$MGE ", ylab="youngmice$MGE ", pch=19)

我的问题是如何将multiple_gene_lists中的"genes"着色为oldmice$MGE、youngmice$MGE? (应该将 multiple_gene_lists 中唯一的 multiple_gene_list 标记为 oldmice$MGE、youngmice$MGE)

这是我的 multiple_gene_list

multiple_gene_list <- read.table("multiple_gene_list.txt", header = TRUE)
multiple_gene_list  <- as.vector(multiple_gene_list )

multiple_gene_list 包含:

gene
Six6
Arl2
Tmem74B
Rab9B
Rasgef1B
Ccne1
Apln
Spag7
C17Orf59
Krtap4-4

而我的目标是只在 oldmice$MGE、youngmice$MGE 中标记 multiple_gene_list。 我也尝试了以下代码但失败了!

with(subset(ASC_oldmice_exprs, ASC_oldmice_exprs$gene %in%  multiple_gene_list$gene), points(ASC_youngmice_exprs$MGE, pch=20, col="red"))

谢谢!

我们来获取一些数据:

multiple_gene_list =structure(list(gene = structure(c(8L, 2L, 10L, 6L, 7L, 4L, 1L, 
9L, 3L, 5L), .Label = c("Apln", "Arl2", "C17Orf59", "Ccne1", 
"Krtap4-4", "Rab9B", "Rasgef1B", "Six6", "Spag7", "Tmem74B"), 
class = "factor")), class = "data.frame", row.names = c(NA, 
-10L))

set.seed(111)

oldmice = data.frame(
gene=c("Six6","Arl2","Tmem74B",letters[1:10]),
MGE=runif(13))

youngmice = data.frame(
gene=c("Six6","Arl2","Tmem74B",letters[1:10]),
MGE=runif(13))

有 3 个重叠,我们定义颜色如下:

COLS = ifelse(oldmice$gene %in%  multiple_gene_list$gene,
"turquoise","orange")

和情节:

plot(oldmice$MGE, youngmice$MGE, main="old vs young mice!!",
     xlab="oldmice$MGE ", ylab="youngmice$MGE ", pch=19,col=COLS)

sel = oldmice$gene %in%  multiple_gene_list$gene
text(x=oldmice$MGE[sel]+0.01,
y=youngmice$MGE[sel]+0.01,
oldmice$gene[sel])