将时间戳转换为 R 中的频率分档时间序列？

Question

我根据对特定服务的请求选择了一些分散的时间戳数据。此数据涵盖大约 3.5-4 年的针对此服务的请求。

我希望将这种可变间隔时间戳的选择转换为 R 中的频率分级时间序列。

我如何将这些时间戳转换为频率分档时间序列，例如 "between 1 and 1:15PM on this day, there were 7 requests, and between 1:15 and 1:30PM there were 2, and between 1:30 and 1:45, there were 0"，确保也有一个什么都没有的分档？

数据只是来自数据库转储的时间戳向量，所有格式为：“”2014-02-17 13:10:46”。只是一个包含约 200 万个对象的大向量.

Answer 1

您可以使用工具来处理来自 xts 和 zoo 的时间序列数据。请注意，您将需要一些人工 'data':

library(xts)
set.seed(42)
ts.index <- ISOdatetime(2018, 1, 8, 8:9, sample(60, 10), 0)
ts <- xts(rep(1, length(ts.index)), ts.index)
aggregate(ts, time(ts) - as.numeric(time(ts)) %% 900, length, regular = TRUE)
#>                      
#> 2018-01-08 08:15:00 1
#> 2018-01-08 08:30:00 3
#> 2018-01-08 08:45:00 1
#> 2018-01-08 09:00:00 1
#> 2018-01-08 09:15:00 1
#> 2018-01-08 09:45:00 3

编辑：如果你想包含没有观测值的 bins，你可以转换为严格规则的 ts 对象并将插入的 NA 值替换为零：

raw <- aggregate(ts, time(ts) - as.numeric(time(ts)) %% 900, length, regular = TRUE)
as.xts(na.fill(as.ts(raw), 0), dateFormat = "POSIXct")
#>                     zoo(coredata(x), tt)
#> 2018-01-08 08:15:00                    1
#> 2018-01-08 08:30:00                    3
#> 2018-01-08 08:45:00                    1
#> 2018-01-08 09:00:00                    1
#> 2018-01-08 09:15:00                    1
#> 2018-01-08 09:30:00                    0
#> 2018-01-08 09:45:00                    3

编辑 2: 它也适用于提供的示例数据：

library(xts)
data <- c(1228917812, 1245038910, 1245986979, 1268750482, 1281615510, 1292561113)
class(data) = c("POSIXct", "POSIXt")
attr(data, "tzone") <- "UTC"
dput(data)
#> structure(c(1228917812, 1245038910, 1245986979, 1268750482, 1281615510, 
#> 1292561113), class = c("POSIXct", "POSIXt"), tzone = "UTC")
ts <- xts(rep(1, length(data)), data)
raw <- aggregate(ts, time(ts) - as.numeric(time(ts)) %% 900, length, regular = TRUE)
head(as.xts(na.fill(as.ts(raw), 0), dateFormat = "POSIXct"))
#>                     zoo(coredata(x), tt)
#> 2008-12-10 15:00:00                    1
#> 2008-12-10 15:15:00                    0
#> 2008-12-10 15:30:00                    0
#> 2008-12-10 15:45:00                    0
#> 2008-12-10 16:00:00                    0
#> 2008-12-10 16:15:00                    0

将时间戳转换为 R 中的频率分档时间序列？

Convert timestamps to frequency-binned timeseries in R?

r

time-series

aggregation