I see that you are padding the inputs to get equal length of 52. But, it seems padding is applied to only character inputs but not to words.
# 0-pads all words
def padding(Sentences):
maxlen = 52
for sentence in Sentences:
char = sentence[2]
for x in char:
maxlen = max(maxlen, len(x))
for i, sentence in enumerate(Sentences):
Sentences[i][2] = pad_sequences(Sentences[i][2], 52, padding='post')
return Sentences
dataset.append([wordIndices, caseIndices, charIndices, labelIndices])
return dataset
I see that you have made batches of inputs with words of equal length. Is this the correct approach?
Can you pls let me know.