无法在 MongoDB 中使用子查询创建查询

Question

我有一个看起来像这样的数据集：

{
  "id": "02741544",
  "items": [{
    "item": "A"
  }]
}, {
  "id": "02472691",
  "items": [{
    "item": "A"
  }, {
    "item": "B"
  }, {
    "item": "C"
  }]
}, {

  "id": "01316523",
  "items": [{
    "item": "A"
  }, {
    "item": "B"
  }]
}, {
  "id": "01316526",
  "items": [{
    "item": "A"
  }, {
    "item": "B"
  }]
}, {
  "id": "01316529",
  "items": [{
    "item": "A"
  }, {
    "item": "D"
  }]
},

我正在尝试创建一个查询，它会给我一个如下所示的输出：

{
  "item": "A",
  "ids": [{
    "id": "02741544"

  }, {
    "id": "02472691"

  }, {
    "id": "01316523"

  }, {
    "id": "01316526"

  }, {
    "id": "01316529"

  }]
}, {
  "item": "B",
  "ids": [{
    "id": "02472691"

  }, {
    "id": "01316523"

  }, {
    "id": "01316526"

  }]
}, {
  "item": "C",
  "ids": [{
    "id": "02472691"

  }]
}, {
  "item": "D",
  "ids": [{
    "id": "02472691"

  }]
},

基本上，我试图从对象中的项目数组中获取不同的项目，然后为项目数组中包含该项目的每个对象返回一个 ID 数组。

Answer 1

最好使用 aggregation framework，您需要在其中运行包含以下管道步骤（按给定顺序）的操作：

$unwind - 此初始步骤将展平 items 数组，即它为每个数组条目生成每个文档的副本。这对于在管道中进一步处理文档作为 "denormalised" 文档是必要的，您可以将其聚合为组。
$group - This will group the flattened documents by the item subdocument key and create the ids list by using the $push 累加器运算符。

--更新--

正如@AminJ 在评论中指出的那样，如果 items 可以有重复的项目值并且您不希望结果中有重复的 id，您可以使用 $addToSet instead of $push

以下示例演示了这一点：

db.collection.aggregate([   
    { "$unwind": "$items" },
    {
        "$group": {
            "_id": "$items.item",
            "ids": { 
                "$push": { "id": "$id" } /* or use 
                "$addToSet": { "id": "$id" } if you don't want duplicate ids */                    
            }
        }
    }
])

示例输出

{
    "_id" : "A",
    "ids" : [ 
        { "id" : "02741544" }, 
        { "id" : "02472691" }, 
        { "id" : "01316523" }, 
        { "id" : "01316526" }, 
        { "id" : "01316529" }
    ]
}

/* 2 */
{
    "_id" : "B",
    "ids" : [ 
        { "id" : "02472691" }, 
        { "id" : "01316523" }, 
        { "id" : "01316526" }
    ]
}

/* 3 */
{
    "_id" : "C",
    "ids" : [ 
        { "id" : "02472691" }
    ]
}

/* 4 */
{
    "_id" : "D",
    "ids" : [ 
        { "id" : "01316529" }
    ]
}

aggregate() function is a cursor to the documents produced by the final stage of the aggregation pipeline operation. So if you want the results in an array you can use the cursor's toArray() 方法的结果，returns 一个包含所有文档的数组。

例如：

var pipeline = [    
        { "$unwind": "$items" },
        {
            "$group": {
                "_id": "$items.item",
                "ids": { 
                    "$push": { "id": "$id" } /* or use 
                    "$addToSet": { "id": "$id" } if you don't want duplicate ids */                    
                }
            }
        }
    ],
    results = db.collection.aggregate(pipeline).toArray();

printjson(results);

Answer 2

这是一个使用聚合管道的解决方案：

    db.col.aggregate([
        {
            $unwind: "$items"
        },
        {
            $project: {
                id: 1,
                item: "$items.item"
            }
        },
        {
            $group: {
                _id: "$item",
                ids: {
                    $push: "$id"
                }
            }
        }
    ])

无法在 MongoDB 中使用子查询创建查询

Trouble creating query in MongoDB with subquery

javascript

mongodb

mongodb-query

aggregation-framework

mongodb-aggregation