>>> nltk.download("all")
[nltk_data] Error loading all: HTTP Error 405: Not allowed.
>>> nltk.version_info
sys.version_info(major=3, minor=5, micro=2, releaselevel='final', serial=0)
ãŸãã httpsïŒ//raw.githubusercontent.com/nltk/nltk_data/gh-pages/packages/corpora/cmudict.zipã«ã¢ã¯ã»ã¹ããŠã¿ãŸããã åãHTTP405ãšã©ãŒãçºçããŸããã
stackoverflowã§åãåé¡ãèŠã€ããŸãïŒ //stackoverflow.com/questions/45318066/getting-405-while-trying-to-download-nltk-dta
ã³ã¡ã³ããããã ããã°å¹žãã§ãã
Githubããªããžããªäžã®çã®ã³ã³ãã³ããžã®ã¢ã¯ã»ã¹ãããŠã³/ãããã¯ããŠããããã§ãã
äžæ¹ãäžæçãªè§£æ±ºçã¯æ¬¡ã®ãããªãã®ã§ãã
PATH_TO_NLTK_DATA=/home/username/nltk_data/
wget https://github.com/nltk/nltk_data/archive/gh-pages.zip
unzip gh-pages.zip
mv nltk_data-gh-pages/ $PATH_TO_NLTK_DATA
çŸåšã gh-pages.zip
ãããŠã³ããŒããã nltk_data
ãã£ã¬ã¯ããªã眮ãæããããšããçŸæç¹ã§æ©èœãããœãªã¥ãŒã·ã§ã³ã§ãã
nltk_data
ãé
åžããå¥ã®ãã£ãã«ãèŠã€ããåã«ãäžèšã®ãœãªã¥ãŒã·ã§ã³ã䜿çšããŠãã ããã
ãäžæè°ãªããšã«ãããã¯nltk
ãŠãŒã¶ãŒã¢ã«ãŠã³ãã«ã®ã¿åœ±é¿ããããã§ãã ãã©ãŒã¯ã§ã¯æ£åžžã«æ©èœããŸãïŒ https ïŒ
ããããè¡ãããšãæ©èœããŸãïŒã
@alvationsã©ããããããšãããããŸããïŒ
ãã®ãããªã³ãã³ãã©ã€ã³ããŠã³ããŒãã®ä»£æ¿æ段ã¯ãããŸããïŒ
python -m nltk.downloader -d ./nltk_data punkt
@plaihonen python -m nltk.downloader -u https://pastebin.com/raw/D3TBY4Mj punkt
ãããªããšãããããšã§ããã®ä»£æ¿ã€ã³ããã¯ã¹ã䜿çšã§ããã¯ãã§ãã
@rvauseã¯å®å šã«æ©èœããŸãã ããããšãããããŸããïŒ
+1ã ããã¯ä»ææ°æéã®é©ãã§ããã ä»ã®ãšããnltkããŠã³ããŒããå®å šã«ãã€ãã¹ããŠè¡ããŸãã
ããŠãŒã¶ãŒããã¡ã€ã«ãèŠæ±ãã垯åå¹ ãéåžžã«å€§éã«æ¶è²»ããŠããããããGitHubã¯çŸåšã¢ã¯ã»ã¹ããããã¯ããŠããŸãã 圌ãã¯ãŸããS3ãªã©ã®ããŒã¿ããã±ãŒãžãé åžããå¥ã®æ¹æ³ãæ€èšããå¿ èŠãããããšã瀺åããŠããŸãã
代æ¿ã€ã³ããã¯ã¹ã䜿çšããŠããäžéšã®ããã±ãŒãžããŸã æ©èœããªãããšã«æ°ä»ãã人ã¯ããŸããïŒ
å ·äœçã«ã¯ãç§ã«ãšã£ãŠãstopwordsããã±ãŒãžã¯405ãæäŸããŸãããä»ã®ããã±ãŒãžïŒbrownãwordnetãpunktãªã©ïŒã¯æäŸããŸããã
ã¯ããnltkã¹ãããã¯ãŒããããŠã³ããŒãã§ããŸããã > python -m nltk.downloader -u http://nltk.github.com/nltk_data/ãå®è¡ãããšã405ãšã©ãŒãçºçããŸãã
ãããç§ã¯python -m nltk.downloader stopwords
ãå®è¡ããããšããŠããŸããã405ãšã©ãŒãçºçããŸãã 誰ããç§ãæ£ããæ¹åã«åããããšãã§ããŸããïŒ
@ dfridman1 @ prakruthi-karunaäžèšã®åé¡ãèªãã§ãã ããã åé¿çã¯æ¬¡ã®ãšããã§ãã
python -m nltk.downloader -u https://pastebin.com/raw/D3TBY4Mj all
ciã·ã¹ãã ã§ããã䜿çšãããããžã§ã¯ããããã€ããããŸãã ããããã¹ãŠã-uãã©ã¡ãŒã¿ãŒã§æŽæ°ããã®ã§ã¯ãªãããã®ããŒã¿ãæå®ããå¥ã®æ¹æ³ããããŸãã å€åç°å¢å€æ°ãŸãã¯èšå®ãã¡ã€ã«ïŒ
@alvationsãã©ãŒã¯ãããããŒãžã§ã³ãçŠæ¢ãããŠããããããœãªã¥ãŒã·ã§ã³ãæ©èœããªããªã£ãããã§ãã ããã«ã€ããŠçŸåšgithubãµããŒãã«é£çµ¡ããŠãã人ã¯ããŸããïŒ
>>> import nltk
>>> dler = nltk.downloader.Downloader('https://pastebin.com/raw/D3TBY4Mj')
>>> dler.download('punkt')
[nltk_data] Downloading package punkt to /home/zeryx/nltk_data...
[nltk_data] Error downloading u'punkt' from
[nltk_data] <https://raw.githubusercontent.com/alvations/nltk_data
[nltk_data] /gh-pages/packages/tokenizers/punkt.zip>: HTTP Error
[nltk_data] 403: Forbidden.
False
é£çµ¡å ããŒãžãããã±ãããéãããšããã§ãã
GitHubã¯ãã®åé¡ãèªèããŠãããåãçµãã§ããããã§ãã ããã圌ããç§ã«èšã£ãããšã§ãïŒ
ãè¿·æãããããŠç³ãèš³ãããŸããã é床ã®äœ¿çšãGitHubãµãŒãã¹ã§åé¡ãåŒãèµ·ãããŠãããããnltk / nltk_dataãªããžããªãšãã®ãã©ãŒã¯ã®raw.githubusercontent.comURLãžã®ãªã¯ãšã¹ãããããã¯ããå¿ èŠããããŸããã çŸåšãåé¡ã®è§£æ±ºã«åãçµãã§ããŸãããæ®å¿µãªããçŸæç¹ã§ã¯ãããã®ãªã¯ãšã¹ããèš±å¯ããããšã¯ã§ããŸããã
ãããç§ããããåãåããŸããïŒ
ããã«ã¡ã¯ãªãªã³ã°ã
ç§ã¯GitHubã®ãµããŒãããŒã ã§åããŠããŸãããalvations / nltk_dataãªããžããªã®raw.githubusercontent.comURLsããæäŸããããã¡ã€ã«ãžã®ã¢ã¯ã»ã¹ãäžæçã«ãããã¯ããå¿ èŠãããããšããç¥ããããŸãã çŸåšããŠãŒã¶ãŒã¯ãã®ãªããžããªãããã¡ã€ã«ãèŠæ±ããéåžžã«å€§éã®åž¯åå¹ ãæ¶è²»ããŠãããçŸæç¹ã§ã®å¯äžã®ãªãã·ã§ã³ã¯ãã¹ãŠã®èŠæ±ããããã¯ããããšã§ãã åé¡ã軜æžããæ¹æ³ã«ç©æ¥µçã«åãçµãã§ãããæŽæ°ããããŸããããã©ããŒã¢ããããŸãããäžæãªç¹ãããããŸãããããæ°è»œã«ãåãåãããã ããã
也æ¯ãã·ã§ãŒã
@zxiirohttps ïŒ //stackoverflow.com/questions/3522372/how-to-config-nltk-data-directory-from-codeãåç §ããŠãã ãã
@ ewan -klein nltk.downloader.py
æçŽããå¿
èŠã«ãªããŸãã
ããã€ãã®ææ¡ïŒ
ã©ããããããŒã¿é ä¿¡ãã£ãã«ãå€æŽãã以å€ã«éžæè¢ã¯ãããŸããã
ããã«ã¡ã¯ãªãªã³ã°ã
ããã€ãã®è¿œå æ å ±ã§ããããã©ããŒã¢ãããããã£ãã ãã®åé¡ã«ã€ããŠã¯ç€Ÿå ã§è©±ãåã£ãŠãããåœé¢ã®éãnltk / nltk_dataãã©ãŒã¯ãããã¯ãŒã¯å ã®ãªããžããªãžã®rawã¢ã¯ã»ã¹ã埩å ããªãå¯èœæ§ãé«ãã§ãã åé¡ã¯ãéåžžã«é«ãé »åºŠã§nltk.downloadïŒïŒãåŒã³åºããŠãããã·ã³ãå€æ°ããããšã§ãã ãã®ã¢ã¯ãã£ããã£ãåæ¢ãããŸã§ãrawã¢ã¯ã»ã¹ã埩å ããããšã¯ã§ããŸããããã®ã¡ãã»ãŒãžãnltkã³ãã¥ããã£ãšèªç±ã«å ±æããŠãã ããã ãããè¡ã£ãŠãã人ã¯èª°ã§ãåé¡ã«ã€ããŠèŠåãåãããããè¡ã£ãŠããããã»ã¹ããã¹ãŠåæ¢ããããšãæãã§ããŸãã
也æ¯ããžã§ã€ããŒ
ãããã®IPãå ·äœçã«ãããã¯ããã ãã§ãããšæãã§ãããã ããããå€åãã以äžã®ãã®ããããŸãã
nltk_dataãããŠã³ããŒãããDockerã€ã¡ãŒãžã¯ãããŸãããé »ç¹ã«åæ§ç¯ããŠããŸããã§ããã ç§ã¯ãããã®ãã©ãã£ãã¯ã®å€ããŠãŒã¶ãŒã®äžäººã§ã¯ãªãã£ããšæããŸã...
githubã«äŸåããªãã€ã³ã¹ããŒã«ããã»ã¹ã¯ãããŸããïŒ
誰ããAWSã§ã¹ã¯ãªãããééã£ãŠèšå®ããå¯èœæ§ããããŸãã @everyoneã¯ãããŒã¿é ä¿¡ã®ä»£æ¿æ段ãèŠã€ããéãã€ã³ã¹ã¿ã³ã¹ã®ç¢ºèªã«ãååãã ãã
ããã«ã¡ã¯ãªãªã³ã°ã
ç¹å®ã®çªå·ãå ±æããããšã¯ã§ããŸãããããªã¯ãšã¹ãã¯å€æ°ã®AWSã€ã³ã¹ã¿ã³ã¹ããéä¿¡ãããŠããŸãã ã¹ã¯ãªãããŸãã¯ãã«ãããã»ã¹ãããŸããããªãã£ãå¯èœæ§ããããŸãã ãã以äžã®ããšã¯ããããããŸããã
也æ¯ããžã§ã€ããŒ
ããã¯å®å¿ã§ããç§ã¯AWSã䜿çšããŠããŸããã
ïŒå®å¿ïŒ
ã³ãŒãçã«ã¯ãåãããã±ãŒãžãnltkdownloader.pyããæŽæ°ãããé »åºŠãå€æŽããå¿ èŠããããããããŸããã ããããªããšãã©ã®é åžãã£ãã«ã«ç§»è¡ããŠããåããµãŒãã¹ã®äžæãçºçããŸãã
ãã¶ããæ¥æµããŒã¹ã®äœããããŸãããã§ããããïŒ
ã©ã€ã»ã³ã¹ãã©ã®ãããªãã®ãã¯ããããŸããããs3ã§å ¬éããããšãã§ããŸãïŒ https ïŒ
@alvationsã¯ã/home/username/nltk_data/
ãã©ã«ããŒã®äžã«ç§»åããå¿
èŠããããŸããã
export PATH_TO_NLTK_DATA=/home/username/nltk_data/
wget https://github.com/nltk/nltk_data/archive/gh-pages.zip
unzip gh-pages.zip
mv nltk_data-gh-pages $PATH_TO_NLTK_DATA
# add below code
mv $PATH_TO_NLTK_DATA/nltk_data-gh-pages/packages/* $PATH_TO_NLTK_DATA/
äžæçãªåé¿çã¯ãããããŸããïŒ
@ darshanlol @ alvationsã¯è§£æ±ºçã«ã€ããŠèšåããŸããã Dockerãæ§ç¯ããããšããŠããå Žåã¯ã次ã®ããšãããŸããããŸããã
ENV PATH_TO_NLTK_DATA $HOME/nltk_data/
RUN apt-get -qq update
RUN apt-get -qq -y install wget
RUN wget https://github.com/nltk/nltk_data/archive/gh-pages.zip
RUN apt-get -y install unzip
RUN unzip gh-pages.zip -d $PATH_TO_NLTK_DATA
# add below code
RUN mv $PATH_TO_NLTK_DATA/nltk_data-gh-pages/packages/* $PATH_TO_NLTK_DATA/
'nltk.downloader.py'ã®ããã©ã«ãã®URLãå€æŽããããšããŸãããããŸã åé¡ããããŸãã
ææ¡ãããåé¿çã¯æ©èœããªããªããŸããã
python -m nltk.downloader -u https://pastebin.com/raw/D3TBY4Mj all
çŸåšããããå¯äžã®æå¹ãªãœãªã¥ãŒã·ã§ã³ã§ãã
PATH_TO_NLTK_DATA=/home/username/nltk_data/
wget https://github.com/nltk/nltk_data/archive/gh-pages.zip
unzip gh-pages.zip
mv nltk_data-gh-pages/ $PATH_TO_NLTK_DATA
@alvationsãèšã£ãããã«ããããå¯äžã®å®çšçãªè§£æ±ºçã§ãã
PATH_TO_NLTK_DATA = / home / username / nltk_data /wget https://github.com/nltk/nltk_data/archive/gh-pages.zipgh-pages.zipã解åããŸãmv nltk_data-gh-pages / $ PATH_TO_NLTK_DATA
ãããããã¹ãŠã®ããŒãžãããŠã³ããŒãããåŸã§ããNLTKããŠã³ããŒããŒãããŠã³ããŒãããããã¹ãŠã®ããã±ãŒãžãæ€åºã§ããªãã£ããããåé¡ãçºçããŠããŸãããã³ãã³ãã䜿çšããŠããŠã³ããŒããã£ã¬ã¯ããªã®å€ãæåã§å€æŽããå¿ èŠãããå ŽåããããŸãã
äžèšã®ãªã³ã¯ãã¯ãªãã¯ããŠåçããŠãã ããã
èªãã§ä»£æ¿æ¡ãèŠã€ããåŸããã®åé¡ã解決ããããã®ããã€ãã®ææ¡ããããŸãã
nltk_data
ããã€ãå¯èœã«ãªãããã«å€æŽããŸãã ïŒãããã£ãŠããã¹ãŠã®æ°ããç°å¢ã§ã¯æ°ããpipã®ã€ã³ã¹ããŒã«ãå¿
èŠã«ãªããç©çãã£ã¬ã¯ããªã«äŸåããªããªããŸãïŒæ¬¡ã«ãã³ãŒããdownloader.pyãããã³é¢é£ãããã¹ãŠã®ã³ãŒãã¹ãªãŒããŒã€ã³ã¿ãŒãã§ã€ã¹ãäœããã®æ¹æ³ã§ãªãŒããŒããŒã«ããå¿ èŠããããŸãã
ããããpip
å¶éïŒPyPIåŽããïŒã¯ãé«é »åºŠã®ãªã¯ãšã¹ãã§äžæ£ãªãŠãŒã¶ãŒ/ãã·ã³ãé»æ¢ããããšãã§ããŸã
ããã«ã¯ãindex.xmlå ã®ãªã³ã¯ãé©åãªãªã³ã¯ã«åãªã³ã¯ããã ãã§æžã¿ãŸãã Webãã¹ãã§åã ã®ãã¡ã€ã«ãèšå®ããåŸã
ããããã€ã³ã¹ããŒã«/èªååã¹ã¯ãªãããééã£ãŠããããã«ãã©ãã£ãã¯ãé«ããŸãŸã§ããå ŽåããããµãŒãã¹ãããã€ããŒãå¥ã®ãµãŒãã¹ãããã€ããŒã«çèŽããããšã«ãªããŸãã
ä»ã«äœãææ¡ã¯ãããŸããïŒ
ãããåŒãåãããåæ¢ãªéã¯ããŸããïŒ
@ harigovind511 ããããããŠã³ããŒãããnltk_data
ãã©ã«ããŒããnltkãæ€çŽ¢ã§ããæšæºçãªå Žæã®ããããã«é
眮ãããã nltk.data.path
ã«è¿œå ããŠæ€çŽ¢å Žæãæå®ããå¿
èŠããããŸãã èªåããŠã³ããŒããŒã¯ãæšæºã®å Žæãæ¢ãã ãã§ãã
äžæ£ãªãã·ã³ã®ã¬ãŒãå¶é/解決ã¯ããããåã³éãé ãããããªãããã«ããããã«ããããå¿ èŠã§ãã ãããã®å€§ããªããã±ãŒãžã«åé¡ïŒãŸãã¯ã¿ããŒïŒããªãéããç§ã®æ祚ã¯ãããã«ãªããŸããïŒ
pipã䜿çšãããšãæåã®nltk.downloadïŒïŒããã³ã³ãŒãå ã®ããã±ãŒãžç®¡çã解決ãããŸãã
ãã¡ã€ã«ãããã¯ã¢ãããããŠããããã«èŠããŸããïŒ ãããã代æ¿ã®é åžã¡ã«ããºã ã暡玢ãç¶ããã®ã¯è³¢æãªããã§ãã ç§èªèº«ã®çµç¹ã§ã¯ã瀟å ã§ã®ãã¹ãã£ã³ã°ã«ç§»è¡ããååæããšã«ãã§ãã¯ã€ã³ããäºå®ã§ãã
$ PATH_TO_NLTK_DATAã®æ©èœãç解ããããšæããŸãã NLTKãããŒã¿ãååŸããå Žæã®ä»£æ¿ããŒã«ã«ããŠã³ããŒãURLãæ§æããŠããŸããïŒ
NLTKããŒã¿ã®ããŒã«ã«ãã£ãã·ã¥ãèšå®ãããã®ã§ããããèšå®ãããšNLTKããªãã©ã€ã³ã§åäœããããã«ãªããã©ããçåã«æããŸããã
åé¡ã®æ ¹æ¬ã¯åž¯åå¹
ã®ä¹±çšã§ãããããåé¿çãšããŠnltk_data
ã ãªãœãŒã¹IDãURLïŒ @alvations ïŒã«ã©ã®ããã«ãããããããã瀺ããŠãpunkt
ãã³ãã«ã ããwget
ã§ããã®
é·æçãªè§£æ±ºçã¯ãåå¿è
ãŠãŒã¶ãŒãããŒã¿ãã³ãã«å
šäœããã§ããããã®ãç°¡åã«ããããšã ãšæããŸãïŒãã§ãã¯ãããšããã638MBå§çž®ãããŠããŸãïŒã ç¡æå³ãªããŠã³ããŒãã«æµªè²»ããããã«ããå€ãã®åž¯åå¹
ãé
眮ããïŒãããŠæ¯æãïŒä»£ããã«ãããŠã³ããŒããªãã·ã§ã³ãšããŠ"all"
ãæäŸããã®ããããŸãã 代ããã«ãããã¥ã¡ã³ãã«ã¯ãäžæ³šæãªã¹ã¯ãªãã¿ãŒãå¿
èŠãšããç¹å®ã®ãªãœãŒã¹ãããŠã³ããŒãããæ¹æ³ã瀺ãããŠããå¿
èŠããããŸãã ãããŸã§ã®éãstackoverflowïŒç§ã¯ããªããèŠãŠããŸãã@ alvationsïŒãšããŠã³ããŒããŒã®docstringã§ããµã³ãã«ãŸãã¯æšå¥šããã䜿çšæ³ãšããŠnltk.download("all")
ïŒãŸãã¯åçã®ãã®ïŒãæžãç¿æ
£ããæãåºããŠãã ããã ïŒnltkãæ¢çŽ¢ããå Žåã nltk.dowload("book")
ã§ã¯ãªã"all"
åæ§ã«äŸ¿å©ã§ãã¯ããã«å°ãããªããŸããïŒ
çŸåšãã©ã®ãªãœãŒã¹ãããŠã³ããŒãããå¿
èŠãããããå€æããã®ã¯å°é£ã§ãã nltkãã€ã³ã¹ããŒã«ããŠnltk.pos_tag(["hello", "friend"])
ãè©ŠããŠã¿ããšããšã©ãŒã¡ãã»ãŒãžãnltk.download(<resource id>)
æž¡ãããšãã§ãããªãœãŒã¹IDã«ãããããæ¹æ³ããããŸããã ãã®ãããªå Žåããã¹ãŠãããŠã³ããŒãããããšã¯æãããªåé¿çã§ãã ãã®ãããªå Žåã«nltk.data.load()
ãŸãã¯nltk.data.find()
ã«ããããé©çšããŠãªãœãŒã¹IDãæ€çŽ¢ã§ãããšãããã nltk_data
䜿çšéã¯é·æçã«å€§å¹
ã«æžå°ãããšæããŸãã
@zxiiro $PATH_TO_NLTK_DATA
ã¯nltkã«ãšã£ãŠæå³ããªãããµã³ãã«ã¹ã¯ãªããã®åãªãå€æ°ã§ãã ç°å¢å€æ°$NLTK_DATA
ã¯ç¹å¥ãªæå³ããããŸãã http://www.nltk.org/data.htmlãåç
§ããŠ
@alexisdimiã¯nltk.download('all')
åæããŸããã ç³ãèš³ãããŸããããããã¯ç§ã®åæã®é ããã®ãšãŠãå€ãçãã§ããã ç§ã¯ããã«å察ããã¹ãã§ãã 代ããã«ãSOã®åçãnltk.download('popular')
ã«å€æŽããŸããïŒ https ïŒ
ããã±ãŒãžã«çŽæ¥wget
ã䜿çšããå Žåã®åé¡ã®1ã€ã¯ãããããŸã githubã®çã®ã³ã³ãã³ãã«äŸåããŠããããšã§ãã ããŠã³ã¿ã€ã äžã httpsïŒ//github.com/nltk/nltk_data/blob/gh-pages/packages/tokenizers/punkt.zipãªã³ã¯ã403/405ãšã©ãŒãåŒãèµ·ãããŠããŸããã
ãããã£ãŠãåé¿çã¯gitããªãŒå šäœãããŠã³ããŒãããããšã§ããã æ¯ãè¿ã£ãŠã¿ããšãããã¯è¯ãèãã§ã¯ãªããããããŸããã
ããã¯ã¢ãŠãã解é€ãããããã§ããããã¯çŽ æŽãããããšã§ãïŒ ä»ãç§ã¯å°æ¥åæ§ã®åé¡ãé²ãããã«åãããã€ãã®ãã±ãããããããšãé¡ã£ãŠããŸãïŒå€åç§ãææ¡ããç·ã«æ²¿ã£ãŠãããããããŸããããããã§ã¯ãªããããããŸããïŒã
ïŒã¡ãªã¿ã«ãããŠã³ããŒããåã³æ©èœããããã«ãªã£ãã®ã§ããã®åé¡ããã¯ããŒãºããšããŒã¯ããå¿ èŠããããŸããïŒïŒ
@alexisdimiã¯ããŠãŒã¶ãŒã«é©åãªã¢ãã«ãããŠã³ããŒãããããã«ææ¡ããèŠåã衚瀺ããããšããå§ãããŸãã
CIç°å¢ã§NLTKãå®è¡ããŠããå Žåã ããŠã³ããŒãçšã®ä»£æ¿URLãæå®ã§ããGH-1795ãææ¡ããããšæããŸãã ããã§ã®èãæ¹ã¯ãWebãµãŒããŒïŒãŸãã¯python -m http.serverïŒã«nltk_dataã®ããŒã«ã«ã³ããŒãã»ããã¢ããããããŠã³ããŒãURLããªãŒããŒã©ã€ãã§ããã°ããŒãã«å€æ°ãæã€ããšãã§ãããšãããã®ã§ãã
ããã¯ããããžã§ã¯ãã®ããŒã«ã«ã³ãã³ãåŒã³åºããå€æŽããã«ãªãŒããŒã©ã€ãããŠãJenkinsãªã©ã®CIã·ã¹ãã ããã®-u
ãå«ããããšãã§ããããã«ããããã§ãã
ãªãªãŒã¹ãšpipã€ã³ã¹ããŒã«ã䜿çšããpipããŒã¿é åžã«é¢ããGithubãžã®è³ªåïŒ
ãžã§ã€ããŒããµããŒãããŠãããŠããããšãïŒ
nltk_dataããã¹ãããããã®ä»£æ¿æ段ãæ¢ããŠããŸãããã®1ã€ã¯ãSpaCyãè¡ãæ¹æ³ãšåãããã«ãªããžããªãªãªãŒã¹ãšããŠãã¹ãããããšã§ãhttps://github.com/explosion/spacy-models/releases
ãªããžããªãªãªãŒã¹ã«å¯ŸããŠåæ§ã®é«é »åºŠã®ãªã¯ãšã¹ããè¡ãããå Žåã«åããããã¯ãå®è¡ããããã©ããã確èªã§ããŸããïŒ ãŸãã¯ããªããžããªã®ãªãªãŒã¹ã¯Githubã®çã®ã³ã³ãã³ããšã¯ç°ãªãæ¹æ³ã§åŠçãããŸããïŒ
ããããã
ã©ã€ãªã³ã°
GithubåŽã®ããã€ãã®æŽæ°ïŒ
ããã«ã¡ã¯ãªãªã³ã°ã
ãªãªãŒã¹ã䜿çšãããšããªã¯ãšã¹ããã€ã³ãã©ã¹ãã©ã¯ãã£ã®å¥ã®éšåã«ç§»åããã ãã§ãã ãã®ããªã¥ãŒã ã®åž¯åå¹ ãåã³èµ·åããå ŽåããªãªãŒã¹ã«å¯Ÿãããã®ã§ãã£ãŠãããããã®èŠæ±ããããã¯ããå¿ èŠããããŸãã
ããŒã¿ããã±ãŒãžãGitHubã«æ®ãæ¹æ³ãããã€ãèããããšããŸããããæ£çŽãªãšãããè¯ã解決çã¯ãããŸããã 倧éã®CDNã«ãªãããã«èšå®ãããŠããªãã ãã§ãã
也æ¯ã
ãžã§ã€ããŒ
@owaaa / @zxiiroCIã®å éšãã¹ãã£ã³ã°ã§+1ã çŸåšãããè¡ã£ãŠããŸããEC2/ S3ãŠãŒã¶ãŒã«ãšã£ãŠã®å©ç¹ã¯ããã·ã³ãæ§ç¯ããå Žæã®è¿ãã«ããŒã¿ïŒãŸãã¯å¿ èŠãªããŒã¿ã®ãµãã»ããïŒãé 眮ã§ããããšã§ãã ã¢ãã€ã©ããªãã£ãŒãŸãŒã³ã«ãŸããã£ãŠããå Žåã¯ãå¿ èŠãªå Žæã«ãã±ãããè€è£œããã ãã§ãAWSã®å€éšã§èµ·ãã£ãŠããããšã«ããã«å ç¢ã«ãªããŸãã
@alvationsç§ã¯spaCyã®_data / model as package_ã¢ã€ãã¢ããšãŠã奜ãã§ãããçµæã®1ã€ã¯ã virtualenv
ã䜿çšãããšãããã±ãŒãžãããã«ãããšãã«ç°å¢ãã£ã¬ã¯ããªã®ãµã€ãºãèšããå¯èœæ§ãããããšã§ãã ãã¡ãããããã«ãããå®å
šã«åé¢ãããç£æ»å¯èœãªããŒã¿/ã¢ãã«ããŒãžã§ã³ã賌å
¥ãããŸããããã¯ãspaCyã®ããã«ã¢ãã«ãé »ç¹ã«æŽæ°ããããããžã§ã¯ãã«ãšã£ãŠã¯äŸ¡å€ããããŸãããç¡æã®ã©ã³ãã§ã¯ãããŸããð
æãåèã«ãªãã³ã¡ã³ã
@plaihonen
python -m nltk.downloader -u https://pastebin.com/raw/D3TBY4Mj punkt
ãããªããšãããããšã§ããã®ä»£æ¿ã€ã³ããã¯ã¹ã䜿çšã§ããã¯ãã§ãã