์ฌํํ๊ธฐ ์ฌ์ :
>>> from nltk import PorterStemmer
>>> stemmer = PorterStemmer()
>>> stemmer.stem('oing')
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
File "/Users/peterbe/virtualenvs/songsearch/lib/python3.5/site-packages/nltk/stem/porter.py", line 665, in stem
stem = self._step1b(stem)
File "/Users/peterbe/virtualenvs/songsearch/lib/python3.5/site-packages/nltk/stem/porter.py", line 376, in _step1b
lambda stem: (self._measure(stem) == 1 and
File "/Users/peterbe/virtualenvs/songsearch/lib/python3.5/site-packages/nltk/stem/porter.py", line 258, in _apply_rule_list
if suffix == '*d' and self._ends_double_consonant(word):
File "/Users/peterbe/virtualenvs/songsearch/lib/python3.5/site-packages/nltk/stem/porter.py", line 214, in _ends_double_consonant
word[-1] == word[-2] and
IndexError: string index out of range
>>> import nltk
>>> nltk.__version__
'3.2.2'
"aed"
๋จ์ด์ ๋ํด์ด ์ค๋ฅ๊ฐ ๋ฐ์ํ์ต๋๋ค.
from nltk.stem.porter import PorterStemmer
from nltk.corpus import stopwords
stemmer = PorterStemmer()
stemmer.stem('aed')
์ค๋ฅ :
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
File "/Users/peter.hurford/.virtualenvs/rex/lib/python2.7/site-packages/nltk/stem/porter.py", line 665, in stem
stem = self._step1b(stem)
File "/Users/peter.hurford/.virtualenvs/rex/lib/python2.7/site-packages/nltk/stem/porter.py", line 376, in _step1b
lambda stem: (self._measure(stem) == 1 and
File "/Users/peter.hurford/.virtualenvs/rex/lib/python2.7/site-packages/nltk/stem/porter.py", line 258, in _apply_rule_list
if suffix == '*d' and self._ends_double_consonant(word):
File "/Users/peter.hurford/.virtualenvs/rex/lib/python2.7/site-packages/nltk/stem/porter.py", line 214, in _ends_double_consonant
word[-1] == word[-2] and
IndexError: string index out of range
๋ค์๊ณผ ํจ๊ป ์ค์น :
pip install nltk
python -m nltk.downloader -d
๋ฒ์ :
import nltk
nltk.__version__ # '3.2.2'
https://github.com/nltk/nltk/issues/1581๊ณผ ์ค๋ณต
์ด ๋ฒ๊ทธ๋ ๋ฒ์ 3.2.2์์ ๋์
๋์์ผ๋ฉฐ master์์ ์์ ๋์์ต๋๋ค. develop
๋๋ ๋ฒ์ 3.2.1์ ์ฌ์ฉํ์ฌ ๋ฒ๊ทธ๋ฅผ ์ ๊ฑฐ ํ ์ ์์ต๋๋ค.
์์ ๋ ์ํ๋ก ์ข ๋ฃ ํ์๊ฒ ์ต๋๊น?
๊ทธ๋์ ๋ฌธ์ ๊ฐ ํด๊ฒฐ ๋์์ต๋๊น?
์ด ๋ฌธ์ ๋ # 1582 ๐์ ์ํด ํด๊ฒฐ๋์์ ๊ฒ์ ๋๋ค.
>>> import nltk
>>> nltk.__version__
'3.2.5'
>>> from nltk import PorterStemmer
>>> porter = PorterStemmer()
>>> porter.stem('oing')
u'o'
๊ฐ์ฅ ์ ์ฉํ ๋๊ธ
๊ทธ๋์ ๋ฌธ์ ๊ฐ ํด๊ฒฐ ๋์์ต๋๊น?