在张量流中为 ANN 校正 NaN values/loss
Correcting NaN values/loss for ANN in tensorflow
我是 运行 一个使用 tensorflow 的流失模型,运行 是一个 NaN 损失。阅读周围,我发现我的数据中可能有一些 NaN 值,正如 print(np.any(np.isnan(X_test)))
.
所证实的那样
我试过使用
def standardize(train, test):
mean = np.mean(train, axis=0)
std = np.std(train, axis=0)+0.000001
X_train = (train - mean) / std
X_test = (test - mean) /std
return X_train, X_test
但仍然得出 NaN 值。
如果对您有帮助,请查看完整代码:
import numpy as np
import matplotlib.pyplot as plt
import pandas as pd
import tensorflow as tf
dataset = pd.read_excel('CHURN DATA.xlsx')
X = dataset.iloc[:, 2:45].values
y = dataset.iloc[:, 45].values
from sklearn.preprocessing import LabelEncoder
le = LabelEncoder()
X[:, 1] = le.fit_transform(X[:,1])
from sklearn.compose import ColumnTransformer
from sklearn.preprocessing import OneHotEncoder
ct = ColumnTransformer(transformers=[('encoder', OneHotEncoder(),[0])], remainder = 'passthrough')
X = np.array(ct.fit_transform(X))
from sklearn.model_selection import train_test_split
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size = 0.2)
from sklearn.preprocessing import StandardScaler
sc = StandardScaler()
X_train = sc.fit_transform(X_train)
X_test = sc.transform(X_test)
ann = tf.keras.models.Sequential()
ann.add(tf.keras.layers.Dense(units = 43, activation = 'relu'))
ann.add(tf.keras.layers.Dense(units = 43, activation = 'relu'))
ann.add(tf.keras.layers.Dense(units = 1, activation = 'sigmoid'))
ann.compile(optimizer = 'adam', loss = 'binary_crossentropy', metrics = ['accuracy'])
ann.fit(X_train, y_train, batch_size = 256, epochs = 50)
您还没有替换 nan
值。并且您的数据中可能还有一些 inf
和 -inf
值。您可以将它们都替换为 0
对于数据框
X.replace([np.inf, -np.inf], np.nan, inplace=True)
X = X.fillna(0)
或者如果您的数据在 numpy 数组中
X[np.isnan(X)] = 0
X[X == np.inf] = 0
X[X == -np.inf] = 0
我是 运行 一个使用 tensorflow 的流失模型,运行 是一个 NaN 损失。阅读周围,我发现我的数据中可能有一些 NaN 值,正如 print(np.any(np.isnan(X_test)))
.
我试过使用
def standardize(train, test):
mean = np.mean(train, axis=0)
std = np.std(train, axis=0)+0.000001
X_train = (train - mean) / std
X_test = (test - mean) /std
return X_train, X_test
但仍然得出 NaN 值。
如果对您有帮助,请查看完整代码:
import numpy as np
import matplotlib.pyplot as plt
import pandas as pd
import tensorflow as tf
dataset = pd.read_excel('CHURN DATA.xlsx')
X = dataset.iloc[:, 2:45].values
y = dataset.iloc[:, 45].values
from sklearn.preprocessing import LabelEncoder
le = LabelEncoder()
X[:, 1] = le.fit_transform(X[:,1])
from sklearn.compose import ColumnTransformer
from sklearn.preprocessing import OneHotEncoder
ct = ColumnTransformer(transformers=[('encoder', OneHotEncoder(),[0])], remainder = 'passthrough')
X = np.array(ct.fit_transform(X))
from sklearn.model_selection import train_test_split
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size = 0.2)
from sklearn.preprocessing import StandardScaler
sc = StandardScaler()
X_train = sc.fit_transform(X_train)
X_test = sc.transform(X_test)
ann = tf.keras.models.Sequential()
ann.add(tf.keras.layers.Dense(units = 43, activation = 'relu'))
ann.add(tf.keras.layers.Dense(units = 43, activation = 'relu'))
ann.add(tf.keras.layers.Dense(units = 1, activation = 'sigmoid'))
ann.compile(optimizer = 'adam', loss = 'binary_crossentropy', metrics = ['accuracy'])
ann.fit(X_train, y_train, batch_size = 256, epochs = 50)
您还没有替换 nan
值。并且您的数据中可能还有一些 inf
和 -inf
值。您可以将它们都替换为 0
对于数据框
X.replace([np.inf, -np.inf], np.nan, inplace=True)
X = X.fillna(0)
或者如果您的数据在 numpy 数组中
X[np.isnan(X)] = 0
X[X == np.inf] = 0
X[X == -np.inf] = 0