Numpy 向量化:查找列表和列表列表之间的交集

Numpy vectorization: Find intersection between list and list of lists

我试图找到列表和列表列表之间的交集。这可以通过一个简单的 for 循环轻松解决:

def find_intersec(x,y):
    result = []

    for i in range(len(y)):
        if set(x).intersection(set(y[i])):
            result.append(y[i])

    return(result)

x = [1,2,3,4,5,6]
y = [[1,2,3], [4,5,6], [9,10,11]]



find_intersec(x,y)

如何把上面的改成numpy的矢量化方案?我已经尝试 numpy.intersect1d() 但没有成功。

你可以有这样的功能:

import numpy as np

def find_intersec_vec(x, y):
    y_all = np.concatenate(y)
    y_all_in = np.isin(y_all, x)
    splits = np.cumsum([0] + [len(lst) for lst in y])
    y_in = np.logical_or.reduceat(y_all_in, splits[:-1])
    return [lst for lst, isin in zip(y, y_in) if isin]

测试:

x = [1, 2, 3, 4, 5, 6]
y = [[1, 2, 3], [4, 5], [6, 7], [8, 9, 10, 11]]
print(find_intersec(x, y))
# [[1, 2, 3], [4, 5], [6, 7]]
print(find_intersec_vec(x, y))
# [[1, 2, 3], [4, 5], [6, 7]]

如你所说,可以使用numpy.intersect1d()

import numpy as np

x = [1,2,3,4,5,6]
y = [[1,2,3], [4,5,6], [9,10,11]]

intersec = [np.intersect1d(i, x) for i in y if len(np.intersect1d(i, x)) > 0]

结果:

[array([1, 2, 3]), array([4, 5, 6])]