输出层的形状总是 (1,)

Question

我正在尝试制作一个简单的 Keras 模型。但是无论我指定什么输出形状，输出层总是形状 (1,) 所以我无法训练我的模型，因为输出层和目标数据形状不匹配。

import keras
from keras.models import Sequential
from keras.layers import InputLayer, LSTM, Dense

# 63 is the number of unique characters
# 128 is the length of a sequence of characters

X = ... # X is an one-hot ndarray; X.shape == (96092, 128, 63)
Y = ... # Y is an one-hot ndarray; Y.shape == (96092, 63)

model = Sequential()
model.add(InputLayer([128, 63]))
model.add(LSTM(96))
model.add(Dense(63))

model.compile(
  optimizer=keras.optimizers.RMSprop(1e-3, decay=1e-5),
  loss=keras.losses.sparse_categorical_crossentropy,
)

model.fit(X, Y) # ValueError: Error when checking target: expected dense_4 to have shape (1,) but got array with shape (63,)

如您所见，输出密集层的形状是 (1,)，但它必须是 (63,) 的形状。我做错了什么？

我正在使用 Google 预装 Keras 的 Colab。

Answer 1

那个错误是说输出层的形状是 63。但是，由于某种原因，它期望 1 .

在这种情况下，它需要 1 的原因是因为您正在使用 sparse_categorical_crossentropy，它需要一个表示输出类别索引的整数。相反，使用 categorical_crossentropy，它期望对输出类别进行单热编码。

输出层的形状总是 (1,)

Output layer's shape is always (1,)

python

shapes

keras

jupyter-notebook