I'm getting TypeError: expected string or bytes-like object "
I'm getting TypeError: expected string or bytes-like object "
import nltk
from nltk.stem import PorterStemmer
from nltk.corpus import stopwords
paragraph = ''' State-run Bharat Sanchar Nigam Ltd (BSNL) is readying to pay November salary in another two days, which will be raised from internal accruals and bank loans.'''
sentence = nltk.sent_tokenize(paragraph)
stemmer = PorterStemmer()
for i in range(len(sentence)):
words = nltk.word_tokenize(i)
words = [stemmer.stem(word) for word in words if word not in set(stopwords.words('english'))]
sentence[i] = ' '.join(words)
我在这部分遇到错误
words = nltk.word_tokenize(i)
range()
产生一个可迭代的整数。因此,当您将 i
输入 nltk.word_tokenize()
时,您输入的是一个整数。显然,整数不是字符串类型的。
我个人不知道 nltk.word_tokenize()
应该如何工作,但根据上下文线索,您似乎可能希望在索引 i
处传递句子对象 ] 而不仅仅是索引 i
:
words = nltk.word_tokenize(sentence[i])
import nltk
from nltk.stem import PorterStemmer
from nltk.corpus import stopwords
paragraph = ''' State-run Bharat Sanchar Nigam Ltd (BSNL) is readying to pay November salary in another two days, which will be raised from internal accruals and bank loans.'''
sentence = nltk.sent_tokenize(paragraph)
stemmer = PorterStemmer()
for i in range(len(sentence)):
words = nltk.word_tokenize(i)
words = [stemmer.stem(word) for word in words if word not in set(stopwords.words('english'))]
sentence[i] = ' '.join(words)
我在这部分遇到错误
words = nltk.word_tokenize(i)
range()
产生一个可迭代的整数。因此,当您将 i
输入 nltk.word_tokenize()
时,您输入的是一个整数。显然,整数不是字符串类型的。
我个人不知道 nltk.word_tokenize()
应该如何工作,但根据上下文线索,您似乎可能希望在索引 i
处传递句子对象 ] 而不仅仅是索引 i
:
words = nltk.word_tokenize(sentence[i])