使用 R shiny 应用程序创建箱线图时出现问题

Problem in creating boxplots with R shiny app

我是 Shiny 应用程序 R 的新手。我正在尝试在 Shiny R 应用程序中为某些数据集制作简单的箱线图。

这里我展示了文件中的一些示例数据df.csv。数据如下所示。显示以下数据的dput

structure(list(Samples = structure(1:10, .Label = c("Sample1", 
"Sample10", "Sample2", "Sample3", "Sample4", "Sample5", "Sample6", 
"Sample7", "Sample8", "Sample9"), class = "factor"), Type = structure(c(2L, 
1L, 2L, 2L, 2L, 1L, 2L, 2L, 1L, 1L), .Label = c("Normal", "Tumor"
), class = "factor"), A1BG = c(0, 0.01869105, 0.026705782, 0.016576987, 
0, 0.007636787, 0.015756547, 0.00609601, 0.115575528, 0.04717536
), A1BG.AS1 = c(0, 0.096652515, 0.086710002, 0.04683499, 0.188283185, 
0.104318353, 0.102735593, 0.100064808, 0.04717536, 0.159745808
), A1CF = c(1.616942802, 1.367084444, 1.101855892, 1.3823884, 
0.631627098, 2.407159505, 1.687449785, 1.229844138, 0.87989414, 
0.642785868), A2M = c(3.357654845, 3.149165846, 3.654774122, 
2.851143092, 2.952601867, 4.002335454, 4.123949457, 3.691343955, 
3.553064673, 3.425443559), A2M.AS1 = c(0.217308191, 0.08268571, 
0.297320544, 0.101579093, 0.020102613, 0.35578965, 0.288014115, 
0.145352771, 0.043808388, 0.104677012), A2ML1 = c(0, 0.017949113, 
0.00984907, 0.002289616, 0, 0.002100359, 0.032146138, 0.052275569, 
0.537892142, 0), A2ML1.AS1 = c(0.631627098, 0.04717536, 1.229844138, 
0, 4.002335454, 0, 1.229844138, 1.229844138, 0.04717536, 0)), row.names = c(NA, 
-10L), class = "data.frame")

根据以上信息,我正在尝试制作一个闪亮的应用程序。我的代码如下所示:

library(shiny)

ui <- fluidPage(
  sidebarLayout(
    sidebarPanel(
      selectInput("thegene", "Gene", choices = c("A2M", "A1CF", "A2MP1"), selected = "A2M"),
      radioButtons("colour","Colour of histogram",choices=c("red","green","blue"),selected="red"),
      width = 3
    ),
    mainPanel(
      plotOutput("boxplot"),
      width = 9
    )
  )
)

server <- function(input, output) {

  df <- read.csv("df.csv")

  library(reshape2)
  library(ggplot2)
  library(ggpubr)
  library(EnvStats)

  df.m <- melt(df, c("Samples", "Type"))

  output$boxplot <- renderPlot({
    ggplot(data=df.m, aes(x = Type, y = value, fill=variable)) +
      geom_boxplot() +
      theme_bw(base_size = 14) + xlab("") + ylab("Expression logFPKM") +
      theme(axis.text=element_text(size=15, face = "bold", color = "black"),
            axis.title=element_text(size=15, face = "bold", color = "black"),
            strip.text = element_text(size=15, face = "bold", color = "black")) +
      stat_compare_means(method = "t.test", size=5) + stat_n_text()
  })

}

# Run the application 
shinyApp(ui = ui, server = server)

因此,我重塑了信息,然后尝试制作一个应用程序来为肿瘤(6 个样本)和正常(4 个样本)之间的每个基因创建一个箱线图。

我没有看到任何错误,但我也没有得到想要的结果。我上面代码的输出如下所示:

1)每个Type下面箱线图中的样本数是错误的。

2) 对于基因的选择,我在那里只能看到三个基因。我在那里没有看到其他基因。如何检查其他基因?

3) 直方图的颜色也不起作用。

感谢任何帮助。谢谢。

试试这个。

我做了一些改动,你可以保留一些,而另一些则反转。

  1. 我没有ggpubrEnvStats,所以我删除了一些绘图摘要。
  2. 我定义了静态数据,您应该 return 到您的 read.csv 解决方案。
  3. 我在服务器声明中添加了 session,如果您想以编程方式更新任何输入,则需要这样做。
  4. 我有一个低效的反应块,它只是 return 所有原始数据;就目前而言,这是反惯用语,但添加它只是为了演示 updateSelectInput if/when 源数据更改的正确使用。仅当您的数据动态变化(例如,用户上传数据或数据库查询)时才需要这样做,否则 alldat() 应该只是 df.m(并且您的输入应该静态定义)。
  5. 我更新了颜色单选按钮的使用。
library(shiny)
library(reshape2)
library(ggplot2)
library(ggpubr)
library(EnvStats)

df <- structure(list(Samples = structure(1:10, .Label = c("Sample1", 
"Sample10", "Sample2", "Sample3", "Sample4", "Sample5", "Sample6", 
"Sample7", "Sample8", "Sample9"), class = "factor"), Type = structure(c(2L, 
1L, 2L, 2L, 2L, 1L, 2L, 2L, 1L, 1L), .Label = c("Normal", "Tumor"
), class = "factor"), A1BG = c(0, 0.01869105, 0.026705782, 0.016576987, 
0, 0.007636787, 0.015756547, 0.00609601, 0.115575528, 0.04717536
), A1BG.AS1 = c(0, 0.096652515, 0.086710002, 0.04683499, 0.188283185, 
0.104318353, 0.102735593, 0.100064808, 0.04717536, 0.159745808
), A1CF = c(1.616942802, 1.367084444, 1.101855892, 1.3823884, 
0.631627098, 2.407159505, 1.687449785, 1.229844138, 0.87989414, 
0.642785868), A2M = c(3.357654845, 3.149165846, 3.654774122, 
2.851143092, 2.952601867, 4.002335454, 4.123949457, 3.691343955, 
3.553064673, 3.425443559), A2M.AS1 = c(0.217308191, 0.08268571, 
0.297320544, 0.101579093, 0.020102613, 0.35578965, 0.288014115, 
0.145352771, 0.043808388, 0.104677012), A2ML1 = c(0, 0.017949113, 
0.00984907, 0.002289616, 0, 0.002100359, 0.032146138, 0.052275569, 
0.537892142, 0), A2ML1.AS1 = c(0.631627098, 0.04717536, 1.229844138, 
0, 4.002335454, 0, 1.229844138, 1.229844138, 0.04717536, 0)), row.names = c(NA, 
-10L), class = "data.frame")
df.m <- reshape2::melt(df, c("Samples", "Type"))

ui <- fluidPage(
  sidebarLayout(
    sidebarPanel(
      selectInput("thegene", "Gene", choices = c("A2M", "A1CF", "A2MP1"), selected = "A2M"),
      radioButtons("colour","Colour of histogram",choices=c("red","green","blue"),selected="red"),
      width = 3
    ),
    mainPanel(
      plotOutput("boxplot"),
      width = 9
    )
  )
)

server <- function(input, output, session) {

  alldat <- reactive({
    # this is not an efficient use of a reactive block: since it does
    # not depend on any dynamic data, it will fire only once, so if
    # your data is static then this might be a touch overkill ... but
    # the premise is that your `df.m` is data that can change based on
    # updating it (e.g., DB query) or user-uploaded data (e.g., CSV
    # upload)
    choices <- unique(df.m$variable)
    selected <- isolate(input$thegene)
    if (!selected %in% choices) selected <- choices[1]
    updateSelectInput(session, "thegene", choices = choices, selected = selected)
    df.m
  })

  dat <- reactive({
    x <- alldat()
    x[ x$variable == input$thegene,,drop=FALSE]
  })

  output$boxplot <- renderPlot({
    ggplot(data = dat(), aes(x = Type, y = value, fill = variable)) +
      geom_boxplot() +
      theme_bw(base_size = 14) + xlab("") + ylab("Expression logFPKM") +
      theme(axis.text=element_text(size=15, face = "bold", color = "black"),
            axis.title=element_text(size=15, face = "bold", color = "black"),
            strip.text = element_text(size=15, face = "bold", color = "black")) +
      scale_fill_manual(values = input$colour)
  })

}

# Run the application 
shinyApp(ui = ui, server = server)

一些notes/opinions:

  • 当由于过滤或用户提供的修饰符而有动态数据时,我发现有一个反应块很好只是filtering/modifying,这样修改后的数据可以在多个依赖的反应块中使用,ergo my dat <- reactive(...)
  • 更重要的是,我发现许多不太好的闪亮应用程序试图在单个反应块中做太多事情;当我看到很多事情发生时,我倾向于认为 (a) 将反应块拆分成更小的块,尤其是当代码在多个块中重复时; and/or (b) 编写完成大部分工作的外部函数,以便闪亮的应用程序本身看起来更紧凑。声明函数名称可以使 readability/maintainability 更容易(并且可以进行单元测试!)。
  • 没有对此添加任何保护措施;一种这样的保护措施(尽管此应用程序不会立即显示)是使用 req() 来确保输入在启动期间具有 "stabilized"。对于较大的应用程序,人们可能会注意到一些反应块在(例如)input$thegene 具有有效值之前触发,这可能导致一些 plots/tables 闪烁。
  • 当输入 select 很快就会变成 over-written/updated 时,我通常会选择 choices="(initializing)" 或类似的东西;在这种情况下,具有合理的默认选择是有意义的,只要这些选择很可能或肯定会出现在真实数据中。