R 中缺少 date/value 的 'Interpolation'？

Question

我有一个这样的数据框：

Month         CumulativeSum
2019-02-01    40
2019-03-01    70
2019-04-01    80
2019-07-01    100
2019-08-01    120

问题是5月和6月什么都没有发生，所以没有数据。在条形图中绘制它会导致 x 轴上出现一些空的 space。有没有办法像这样使用最后一个已知值“填充”缺失的位置？:

Month         CumulativeSum
2019-02-01    40
2019-03-01    70
2019-04-01    80
**2019-05-01    80**  <--
**2019-06-01    80**  <--
2019-07-01    100
2019-08-01    120

Answer 1

我们可以使用complete

library(dplyr)
library(tidyr)
df1 %>%
  complete(Month = seq(min(Month), max(Month), by = '1 month')) %>%
  fill(CumulativeSum)

-输出

# A tibble: 7 x 2
#  Month      CumulativeSum
#  <date>             <int>
#1 2019-02-01            40
#2 2019-03-01            70
#3 2019-04-01            80
#4 2019-05-01            80
#5 2019-06-01            80
#6 2019-07-01           100
#7 2019-08-01           120

数据

df1 <- structure(list(Month = structure(c(17928, 17956, 17987, 18078, 
18109), class = "Date"), CumulativeSum = c(40L, 70L, 80L, 100L, 
120L)), row.names = c(NA, -5L), class = "data.frame")

Answer 2

这是使用 cummax

的基础 R 选项

transform(
  data.frame(
    Month = seq(min(df$Month), max(df$Month), by = "1 month"),
    CumulativeSum = -Inf
  ),
  CumulativeSum = cummax(replace(CumulativeSum, Month %in% df$Month, df$CumulativeSum))
)

这给出了

       Month CumulativeSum
1 2019-02-01            40
2 2019-03-01            70
3 2019-04-01            80
4 2019-05-01            80
5 2019-06-01            80
6 2019-07-01           100
7 2019-08-01           120

R 中缺少 date/value 的 'Interpolation'？

'Interpolation' of a missing date/value in R?

interpolation

r

dataframe

dplyr

数据