R 散点图可视化:如何将自定义多个基因着色为散点图?
R scatter plot visualization: how to color custom- multiple genes into scatter plot?
我尝试使用 R 使用基因表达数据制作基本散点图。
#import data:
oldmice <- read.table("oldmice.txt", header = TRUE)
youngmice <- read.table("youngmice.txt", header = TRUE)
导入的数据包含:两个导入数据的格式相同,但 MGE 具有不同的值。
gene MGE
Sox17 -6.74193774617653
Mrpl15 -0.212567471203473
Lypla1 -0.711251006455475
and so on..
制作基本的火山图使用:youngmice$MGE vs oldmice$MGE
plot(oldmice$MGE, youngmice$MGE, main="old vs young mice!!",
xlab="oldmice$MGE ", ylab="youngmice$MGE ", pch=19)
我的问题是如何将multiple_gene_lists中的"genes"着色为oldmice$MGE、youngmice$MGE? (应该将 multiple_gene_lists 中唯一的 multiple_gene_list 标记为 oldmice$MGE、youngmice$MGE)
这是我的 multiple_gene_list
multiple_gene_list <- read.table("multiple_gene_list.txt", header = TRUE)
multiple_gene_list <- as.vector(multiple_gene_list )
multiple_gene_list 包含:
gene
Six6
Arl2
Tmem74B
Rab9B
Rasgef1B
Ccne1
Apln
Spag7
C17Orf59
Krtap4-4
而我的目标是只在 oldmice$MGE、youngmice$MGE 中标记 multiple_gene_list。
我也尝试了以下代码但失败了!
with(subset(ASC_oldmice_exprs, ASC_oldmice_exprs$gene %in% multiple_gene_list$gene), points(ASC_youngmice_exprs$MGE, pch=20, col="red"))
谢谢!
我们来获取一些数据:
multiple_gene_list =structure(list(gene = structure(c(8L, 2L, 10L, 6L, 7L, 4L, 1L,
9L, 3L, 5L), .Label = c("Apln", "Arl2", "C17Orf59", "Ccne1",
"Krtap4-4", "Rab9B", "Rasgef1B", "Six6", "Spag7", "Tmem74B"),
class = "factor")), class = "data.frame", row.names = c(NA,
-10L))
set.seed(111)
oldmice = data.frame(
gene=c("Six6","Arl2","Tmem74B",letters[1:10]),
MGE=runif(13))
youngmice = data.frame(
gene=c("Six6","Arl2","Tmem74B",letters[1:10]),
MGE=runif(13))
有 3 个重叠,我们定义颜色如下:
COLS = ifelse(oldmice$gene %in% multiple_gene_list$gene,
"turquoise","orange")
和情节:
plot(oldmice$MGE, youngmice$MGE, main="old vs young mice!!",
xlab="oldmice$MGE ", ylab="youngmice$MGE ", pch=19,col=COLS)
sel = oldmice$gene %in% multiple_gene_list$gene
text(x=oldmice$MGE[sel]+0.01,
y=youngmice$MGE[sel]+0.01,
oldmice$gene[sel])
我尝试使用 R 使用基因表达数据制作基本散点图。
#import data:
oldmice <- read.table("oldmice.txt", header = TRUE)
youngmice <- read.table("youngmice.txt", header = TRUE)
导入的数据包含:两个导入数据的格式相同,但 MGE 具有不同的值。
gene MGE
Sox17 -6.74193774617653
Mrpl15 -0.212567471203473
Lypla1 -0.711251006455475
and so on..
制作基本的火山图使用:youngmice$MGE vs oldmice$MGE
plot(oldmice$MGE, youngmice$MGE, main="old vs young mice!!",
xlab="oldmice$MGE ", ylab="youngmice$MGE ", pch=19)
我的问题是如何将multiple_gene_lists中的"genes"着色为oldmice$MGE、youngmice$MGE? (应该将 multiple_gene_lists 中唯一的 multiple_gene_list 标记为 oldmice$MGE、youngmice$MGE)
这是我的 multiple_gene_list
multiple_gene_list <- read.table("multiple_gene_list.txt", header = TRUE)
multiple_gene_list <- as.vector(multiple_gene_list )
multiple_gene_list 包含:
gene
Six6
Arl2
Tmem74B
Rab9B
Rasgef1B
Ccne1
Apln
Spag7
C17Orf59
Krtap4-4
而我的目标是只在 oldmice$MGE、youngmice$MGE 中标记 multiple_gene_list。 我也尝试了以下代码但失败了!
with(subset(ASC_oldmice_exprs, ASC_oldmice_exprs$gene %in% multiple_gene_list$gene), points(ASC_youngmice_exprs$MGE, pch=20, col="red"))
谢谢!
我们来获取一些数据:
multiple_gene_list =structure(list(gene = structure(c(8L, 2L, 10L, 6L, 7L, 4L, 1L,
9L, 3L, 5L), .Label = c("Apln", "Arl2", "C17Orf59", "Ccne1",
"Krtap4-4", "Rab9B", "Rasgef1B", "Six6", "Spag7", "Tmem74B"),
class = "factor")), class = "data.frame", row.names = c(NA,
-10L))
set.seed(111)
oldmice = data.frame(
gene=c("Six6","Arl2","Tmem74B",letters[1:10]),
MGE=runif(13))
youngmice = data.frame(
gene=c("Six6","Arl2","Tmem74B",letters[1:10]),
MGE=runif(13))
有 3 个重叠,我们定义颜色如下:
COLS = ifelse(oldmice$gene %in% multiple_gene_list$gene,
"turquoise","orange")
和情节:
plot(oldmice$MGE, youngmice$MGE, main="old vs young mice!!",
xlab="oldmice$MGE ", ylab="youngmice$MGE ", pch=19,col=COLS)
sel = oldmice$gene %in% multiple_gene_list$gene
text(x=oldmice$MGE[sel]+0.01,
y=youngmice$MGE[sel]+0.01,
oldmice$gene[sel])