Nltk: https://raw.githubusercontent.com/nltk/nltk_data/gh-pages/index.xml์— ๋ฌธ์ œ๊ฐ€ ์žˆ์Šต๋‹ˆ๋‹ค.

์— ๋งŒ๋“  2017๋…„ 04์›” 14์ผ  ยท  5์ฝ”๋ฉ˜ํŠธ  ยท  ์ถœ์ฒ˜: nltk/nltk

xml ์˜จ๋ผ์ธ ์œ ํšจ์„ฑ ๊ฒ€์‚ฌ๊ธฐ์—์„œ ๋‹ค์Œ๊ณผ ๊ฐ™์ด ๋งํ•ฉ๋‹ˆ๋‹ค.

An error has been found! 
Click on  to jump to the error. In the document, you can point at  with your mouse to see the error message. 
Errors in file xml-schema: 
    23: 144 Attribute name "unzipped_size" associated with an element type "package" must be followed by the ' = ' character.

python3์—์„œ stopwords ๋ฅผ ๋‹ค์šด๋กœ๋“œํ•˜๋ ค๊ณ  ํ•  ๋•Œ

import nltk
nltk.download('stopwords')

์˜ค๋ฅ˜๊ฐ€ ๋ฐœ์ƒํ–ˆ์Šต๋‹ˆ๋‹ค

>>> import nltk
>>> nltk.download('stopwords')

Traceback (most recent call last):
  File "/usr/lib/python3.5/code.py", line 91, in runcode
    exec(code, self.locals)
  File "<input>", line 1, in <module>
  File "/usr/local/lib/python3.5/dist-packages/nltk/downloader.py", line 664, in download
    for msg in self.incr_download(info_or_id, download_dir, force):
  File "/usr/local/lib/python3.5/dist-packages/nltk/downloader.py", line 534, in incr_download
    try: info = self._info_or_id(info_or_id)
  File "/usr/local/lib/python3.5/dist-packages/nltk/downloader.py", line 508, in _info_or_id
    return self.info(info_or_id)
  File "/usr/local/lib/python3.5/dist-packages/nltk/downloader.py", line 875, in info
    self._update_index()
  File "/usr/local/lib/python3.5/dist-packages/nltk/downloader.py", line 825, in _update_index
    ElementTree.parse(compat.urlopen(self._url)).getroot())
  File "/usr/lib/python3.5/xml/etree/ElementTree.py", line 1184, in parse
    tree.parse(source, parser)
  File "/usr/lib/python3.5/xml/etree/ElementTree.py", line 596, in parse
    self._root = parser._parse_whole(source)
xml.etree.ElementTree.ParseError: not well-formed (invalid token): line 23, column 143

๋‹น์‹ ์˜ XML ์—์„œ
<package checksum="6f9c042774b96366c93fd0f9a9adb697" id="dolch" name="Dolch Word List" size="2116" subdir="corpora" unzip="1" unzipped_size"1917" url="https://en.wikipedia.org/wiki/Dolch_word_list" />

unzipped_size"1917"์€ unzipped_size="1917" ์ด์–ด์•ผ ํ•ฉ๋‹ˆ๋‹ค.
๋ˆ„๋ฝ๋œ ๋“ฑํ˜ธ

๊ฐ€์žฅ ์œ ์šฉํ•œ ๋Œ“๊ธ€

์ฃ„์†กํ•ฉ๋‹ˆ๋‹ค. nltk_data ์ชฝ์—์„œ ์ฝ”๋“œ๊ฐ€ ์†์ƒ๋˜์—ˆ์Šต๋‹ˆ๋‹ค. nltk/nltk_data#70์—์„œ ํŒจ์น˜ํ–ˆ์Šต๋‹ˆ๋‹ค.

๋ชจ๋“  5 ๋Œ“๊ธ€

๊ฐ™์€ ์˜ค๋ฅ˜์ž…๋‹ˆ๋‹ค. ์ด์ „ ๋ฒ„์ „์„ ๋‹ค์šด๋กœ๋“œํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๊นŒ?

@svfat 3.2.1 ๋ฒ„์ „์—์„œ ํ•ด๋‹น ์˜ค๋ฅ˜๋ฅผ ํฌ์ฐฉํ•˜๊ณ  3.2.2๋กœ ์—…๊ทธ๋ ˆ์ด๋“œํ–ˆ๋Š”๋ฐ ๋™์ผํ•œ ์˜ค๋ฅ˜๊ฐ€ ๋ฐœ์ƒํ–ˆ์Šต๋‹ˆ๋‹ค.

์ฃ„์†กํ•ฉ๋‹ˆ๋‹ค. nltk_data ์ชฝ์—์„œ ์ฝ”๋“œ๊ฐ€ ์†์ƒ๋˜์—ˆ์Šต๋‹ˆ๋‹ค. nltk/nltk_data#70์—์„œ ํŒจ์น˜ํ–ˆ์Šต๋‹ˆ๋‹ค.

@alvations ๋น ๋ฅธ ์ˆ˜์ • ๊ฐ์‚ฌํ•ฉ๋‹ˆ๋‹ค

@alvations tnx๋Š” ์ด์ œ ์ž‘๋™ํ•ฉ๋‹ˆ๋‹ค.
๋‹ค๊ฐ€์˜ค๋Š” ํœด์ผ

์ด ํŽ˜์ด์ง€๊ฐ€ ๋„์›€์ด ๋˜์—ˆ๋‚˜์š”?
0 / 5 - 0 ๋“ฑ๊ธ‰

๊ด€๋ จ ๋ฌธ์ œ

ndvbd picture ndvbd  ยท  4์ฝ”๋ฉ˜ํŠธ

alvations picture alvations  ยท  4์ฝ”๋ฉ˜ํŠธ

libingnan54321 picture libingnan54321  ยท  3์ฝ”๋ฉ˜ํŠธ

mwess picture mwess  ยท  5์ฝ”๋ฉ˜ํŠธ

DavidNemeskey picture DavidNemeskey  ยท  4์ฝ”๋ฉ˜ํŠธ