在非均匀二维网格上插入缺失值

Question

我正在尝试在 Python 中插入二维数组的缺失值。我发现，但是在那种情况下，行和列都是等距的。在我的例子中，我有两个数组

x = [275. 290. 310. 330. 350. 410. 450.]
y = [ 8. 12. 16. 20. 30. 35. 40. 45.]

其中 x 和 y 是表示我的二维数组

所在的列和行节点的网格坐标

c = [[4 6   9 9   9 8 2]
     [1 6   3 7   1 5 4]
     [8 nan 3 nan 2 9 2]
     [8 2   3 4   3 4 7]
     [2 nan 4 nan 6 1 3]
     [4 nan 8 nan 1 7 6]
     [8 nan 6 nan 5 6 5]
     [1 nan 1 nan 3 1 9]]

已定义。

填充缺失值的最佳方法是什么？

Answer 1

scipy包括网格数据的二维插值（还有一些其他插值函数）：

import numpy as np
import pandas as pd
from numpy import nan
from scipy.interpolate import griddata

x = [275, 290, 310, 330, 350, 410, 450]
y = [ 8, 12, 16, 20, 30, 35, 40, 45,]
c = np.array([[ 4.,  6.,  9.,  9.,  9.,  8.,  2.],
       [ 1.,  6.,  3.,  7.,  1.,  5.,  4.],
       [ 8., nan,  3., nan,  2.,  9.,  2.],
       [ 8.,  2.,  3.,  4.,  3.,  4.,  7.],
       [ 2., nan,  4., nan,  6.,  1.,  3.],
       [ 4., nan,  8., nan,  1.,  7.,  6.],
       [ 8., nan,  6., nan,  5.,  6.,  5.],
       [ 1., nan,  1., nan,  3.,  1.,  9.]])

# generate x_coord, y_coord values for each grid point
x_grid, y_grid = np.meshgrid(x,y)

# get known values to set the interpolator
mask = [~np.isnan(c)]
x = x_grid[mask].reshape(-1)
y = y_grid[mask].reshape(-1)
points = np.array([x,y]).T
values = c[mask].reshape(-1) 

# generate interpolated grid data
interp_grid = griddata(points, values, (x_grid, y_grid), method='nearest')

# interp grid:

array([[4., 6., 9., 9., 9., 8., 2.],
       [1., 6., 3., 7., 1., 5., 4.],
       [8., 6., 3., 7., 2., 9., 2.],
       [8., 2., 3., 4., 3., 4., 7.],
       [2., 2., 4., 4., 6., 1., 3.],
       [4., 4., 8., 4., 1., 7., 6.],
       [8., 8., 6., 6., 5., 6., 5.],
       [1., 1., 1., 1., 3., 1., 9.]])

在非均匀二维网格上插入缺失值

Interpolate missing values on non-uniform 2D grid

python

arrays

interpolation