在 Python 中创建没有 for 循环的内核矩阵

Question

我知道还有其他帖子在问类似的问题，但没找到能回答我的具体问题的帖子。我有以下代码：

def kernel_function(self, x1, x2):
    h = 0.5
    return np.exp(-(np.linalg.norm(x2 - x1)/h)**2)

for i, x1 in enumerate(train_x):
    for j, x2 in enumerate(train_x):
        K[i,j] = self.kernel_function(x1, x2)

其中 x1 和 x2 是形状为 (2,) 的数组。我需要对它进行 vertorize 以提高性能。我看了np.fromfunction、np.outer，但它们似乎不是我要找的...

提前谢谢你。抱歉，如果某处已经有答案！

Answer 1

假设 train_x 具有以下格式：

>>> train_x = np.array(((-.2, -.1), (0, .1), (.2, 0), (.1, -.1)))

执行你的代码你得到：

>>> np.set_printoptions(precision=2)
>>> K
[[1.   0.73 0.51 0.7 ]
 [0.73 1.   0.82 0.82]
 [0.51 0.82 1.   0.92]
 [0.7  0.82 0.92 1.  ]]

你可以重塑 train_x:

>>> train_x_cols = train_x.T.reshape(2, -1, 1)
>>> train_x_rows = train_x.T.reshape(2, 1, -1)

所以，多亏了广播，当你减去它们时你得到了所有的组合：

>>> train_x_rows - train_x_cols
[[[ 0.   0.2  0.4  0.3]
  [-0.2  0.   0.2  0.1]
  [-0.4 -0.2  0.  -0.1]
  [-0.3 -0.1  0.1  0. ]]

 [[ 0.   0.2  0.1  0. ]
  [-0.2  0.  -0.1 -0.2]
  [-0.1  0.1  0.  -0.1]
  [ 0.   0.2  0.1  0. ]]]

并且您可以重写 kernel_function() 以仅计算第一个轴上的范数：

def kernel_function(x1, x2):
    h = 0.5
    return np.exp(-(np.linalg.norm(x2 - x1, axis=0) / h) ** 2)

然后你得到：

>>> kernel_function(train_x_cols, train_x_rows)
[[1.   0.73 0.51 0.7 ]
 [0.73 1.   0.82 0.82]
 [0.51 0.82 1.   0.92]
 [0.7  0.82 0.92 1.  ]]

在 Python 中创建没有 for 循环的内核矩阵

Creating a Kernel matrix without for-loops in Python

python

numpy

vectorization