Postgres_load_dataã¹ã¯ãªãããå®è¡ãããšãæåã®3ã€ã®ããŒãã«ãèªã¿èŸŒãŸãããã®åŸã次ã®ã¡ãã»ãŒãžã衚瀺ãããŸãããã¡ã€ã«CHARTEVENTS.csvãçµ±èšã§ããŸããã§ããïŒäžæãªãšã©ãŒã 誰ãããã®ç¶æ³ã«ãããå©ããããšãã§ããŸããïŒ
ãããžã§ã¯ãã®ããŠã³ããŒãããŒãžã«ãããã§ãã¯ãµã ãã¡ã€ã«ã䜿çšããŠã chartevents.csv
ã³ããŒã®æŽåæ§ããã§ãã¯ããŸãããïŒ ãããããããŠã³ããŒããŸãã¯è§£åäžã«ç ŽæããŸããã
ã¯ããã³ãã³ãmd5 checksum_md5_zipped.txtã䜿çšããŸãããããã¹ãŠã®ããŒãã«ã§åé¡ãããŸãã...
ãŸããzipããŒã¿ãè©ŠããŠãpostgres_load_datascript_7zipãå®è¡ããŸããã ãã®å Žåã次ã®ããã«ãªããŸããããŒã¿ã«åŒçšç¬Šã§å²ãŸããŠããªãæ¹è¡ãèŠã€ãããŸããã ãã³ãïŒåŒçšç¬Šã§å²ãŸããCSVãã£ãŒã«ãã䜿çšããŠæ¹è¡ãè¡šããŸãã
md5 checksum_md5_unzipped.txtããã§ãã¯ããŸãããããã¹ãŠåé¡ãããŸããã
å®è¡ããŠããã¹ã¯ãªãããšäœ¿çšããŠããããŒã¿ã®éã«äžäžèŽãããããã«èãããŸãã ç§ã¯ç¢ºèªããŸãïŒ
.csv.gz
ãããŸã§ã¯ããã©ã«ãã»ããã¢ããã®ã¹ã¯ãªãŒã³ã·ã§ãããã·ã¹ãã æ å ±ãå®è¡ããæ£ç¢ºãªã³ãã³ããæ£ç¢ºãªãšã©ãŒã¡ãã»ãŒãžãªã©ã®æ å ±ããªããšããªã¢ãŒãã§ãããã°ããã®ã¯éåžžã«å°é£ã§ãã
ããã«ã¡ã¯ã
ãåçããããšãããããŸãã
è¿œå æ
å ±ãããããšããããã¯éåžžã«åœ¹ã«ç«ã¡ãŸãã ãã¡ã€ã«ããã©ã«ãã«ãªãã®ãšåããããç°¡åã ãšæããŸãã ãã©ã«ãC:/Users/Lejla/Desktop/MIMICIII
ã«CHARTEVENTS.csv
ãã¡ã€ã«ãããããšãå確èªã§ããŸããïŒ
ãã¹ãŠã®å§çž®ãã¡ã€ã«ãæœåºããããšãããããã£ãŒãã€ãã³ãã§å€±æããããã .csv.gz
ãã¡ã€ã«ãããªãå¯èœæ§ããããŸãïŒæœåºããããã¡ã€ã«ã33GBã§ãããã¹ããŒã¹ãäžè¶³ããŠããããšãåå ã§ããå¯èœæ§ããããŸãããã¡ã€ã«ã·ã¹ãã ã¯FAT32ïŒïŒïŒããŸãã¯èª°ãç¥ã£ãŠãããïŒã§ãã ãã®å ŽåãããŒãã¹ã¯ãªãããç·šéããŠã .csv.gz
ããçŽæ¥ããŒãããããšããå§ãããŸãã ããªãã¯çœ®ãæããããšã«ãã£ãŠãããè¡ãããšãã§ããŸãïŒ
\copy CHARTEVENTS from 'CHARTEVENTS.csv' delimiter ',' csv header NULL ''
ãš
\copy CHARTEVENTS from PROGRAM '7z e -so CHARTEVENTS.csv.gz' delimiter ',' csv header NULL ''
åçããããšãããããŸãã ä»åã¯zipãã¡ã€ã«ã䜿ã£ãŠã¹ã¯ãªãããå®è¡ããŠã¿ãŸããã ä»åã¯ä»ã®ãã®ãæã«å
¥ããŸãã
ã¡ãã»ãŒãž...ããããããã¯åœ¹ç«ã€ã§ãããã
ãã£ã¬ã¯ããªã®å 容ã衚瀺ããŠãããããã§ããïŒ
ç§ã¯æ°ã«ããŸãããããã«ç§ã®ãã©ã«ãã®å
容ããããŸã
ããŠã could not stat file "CHARTEVENTS.csv": Unknown error
ã¯å®éã«ã¯PostgreSQL 11ã®ãã°ã§ããå
éšã§ã¯ããã¡ã€ã«ããã£ã¬ã¯ããªã§ã¯ãªãããšã確èªããããã«fstat()
ãåŒã³åºããŸãããæ®å¿µãªããfstat()
ã¯charteventsã®ãããªå€§ããªãã¡ã€ã«ãåŠçã§ããªã32ãããããã°ã©ã ã PostgreSQL 10.5ã䜿çšããŠWindowsã§ãã«ãããã¹ãããŸãããããã®ãšã©ãŒã¯çºçããªãã£ããããããªãæ°ãããšæããŸãã
æåã®åé¿çã¯ããã¡ã€ã«ãå§çž®ãããŸãŸã«ãïŒã€ãŸãã .csv.gz
ãã¡ã€ã«ãšããŠä¿æãïŒã7zipã䜿çšããŠå§çž®ãã¡ã€ã«ããçŽæ¥ããŒã¿ãããŒãããããšã§ãã ãã¹ãã§ã¯ãããã¯ãŸã æ©èœããŠããããã«èŠããŸããã ãããè¡ãæ¹æ³ã«ã€ããŠã¯ã https ïŒ ãŸãã
äžèšã®ç°¡åãªããŒãžã§ã³ã§ã¯ã .csv.gz
ãã¡ã€ã«ãä¿æãã7zipãã€ããªãWindowsç°å¢ãã¹ã«è¿œå ããŠããã postgres_load_data_7zip.sql
ãã¡ã€ã«ãåŒã³åºããŠããŒã¿ãããŒãããŸãã ãã¹ãŠã®åŸã«postgres_checks.sql
ãã¡ã€ã«ã䜿çšããŠããã¹ãŠã®ããŒã¿ãæ£ããããŒãããããšã確èªã§ããŸãã
ç·šéïŒãã®7zipã¢ãããŒãã䜿çšããŠããåŸã®ãšã©ãŒã«ã€ããŠã¯ããªãããŒããããªãã®ãããããŸããã ADMISSIONS.csv.gzãã¡ã€ã«ã ããåããŠã³ããŒãããŠãåããšã©ãŒãçºçãããã©ããã確èªããŠãã ããã ãã¶ããã¹ã¯ãªãããäœããæŽæ°ããå¿ èŠããã7zipã®æ°ããããŒãžã§ã³ããããŸãïŒ
ããã«ã¡ã¯ã
詳现説æããããšãããããŸãã PostgreSQL 10.5ãã€ã³ã¹ããŒã«ããŸããããããã»ã¹ãå®è¡ãããŠããŸãã ãã¹ãŠã®ããŒãã«ãããŒãããã®ã«æéãããããšæããŸããããäžæãªãšã©ãŒãã¯çºçããªããªããŸããã å©ããŠãããŠããããšãã
çŽ æŽãããïŒ
ããŠã
could not stat file "CHARTEVENTS.csv": Unknown error
ã¯å®éã«ã¯PostgreSQL 11ã®ãã°ã§ããå éšã§ã¯ããã¡ã€ã«ããã£ã¬ã¯ããªã§ã¯ãªãããšã確èªããããã«fstat()
ãåŒã³åºããŸãããæ®å¿µãªããfstat()
ã¯charteventsã®ãããªå€§ããªãã¡ã€ã«ãåŠçã§ããªã32ãããããã°ã©ã ã PostgreSQL 10.5ã䜿çšããŠWindowsã§ãã«ãããã¹ãããŸãããããã®ãšã©ãŒã¯çºçããªãã£ããããããªãæ°ãããšæããŸããæåã®åé¿çã¯ããã¡ã€ã«ãå§çž®ãããŸãŸã«ãïŒã€ãŸãã
.csv.gz
ãã¡ã€ã«ãšããŠä¿æãïŒã7zipã䜿çšããŠå§çž®ãã¡ã€ã«ããçŽæ¥ããŒã¿ãããŒãããããšã§ãã ãã¹ãã§ã¯ãããã¯ãŸã æ©èœããŠããããã«èŠããŸããã ãããè¡ãæ¹æ³ã«ã€ããŠã¯ã https ïŒ ãŸããäžèšã®ç°¡åãªããŒãžã§ã³ã§ã¯ã
.csv.gz
ãã¡ã€ã«ãä¿æãã7zipãã€ããªãWindowsç°å¢ãã¹ã«è¿œå ããŠãããpostgres_load_data_7zip.sql
ãã¡ã€ã«ãåŒã³åºããŠããŒã¿ãããŒãããŸãã ãã¹ãŠã®åŸã«postgres_checks.sql
ãã¡ã€ã«ã䜿çšããŠããã¹ãŠã®ããŒã¿ãæ£ããããŒãããããšã確èªã§ããŸããç·šéïŒãã®7zipã¢ãããŒãã䜿çšããŠããåŸã®ãšã©ãŒã«ã€ããŠã¯ããªãããŒããããªãã®ãããããŸããã ADMISSIONS.csv.gzãã¡ã€ã«ã ããåããŠã³ããŒãããŠãåããšã©ãŒãçºçãããã©ããã確èªããŠãã ããã ãã¶ããã¹ã¯ãªãããäœããæŽæ°ããå¿ èŠããã7zipã®æ°ããããŒãžã§ã³ããããŸãïŒ
PostgreSQL10.11ã䜿çšããããšã¯ç§ãå©ããŸãã...ããããšã
è¿œå æ å ±ãããããšããããã¯éåžžã«åœ¹ã«ç«ã¡ãŸãã ãã¡ã€ã«ããã©ã«ãã«ãªãã®ãšåããããç°¡åã ãšæããŸãã ãã©ã«ã
C:/Users/Lejla/Desktop/MIMICIII
ã«CHARTEVENTS.csv
ãã¡ã€ã«ãããããšãå確èªã§ããŸããïŒãã¹ãŠã®å§çž®ãã¡ã€ã«ãæœåºããããšãããããã£ãŒãã€ãã³ãã§å€±æããããã
.csv.gz
ãã¡ã€ã«ãããªãå¯èœæ§ããããŸãïŒæœåºããããã¡ã€ã«ã33GBã§ãããã¹ããŒã¹ãäžè¶³ããŠããããšãåå ã§ããå¯èœæ§ããããŸãããã¡ã€ã«ã·ã¹ãã ã¯FAT32ïŒïŒïŒããŸãã¯èª°ãç¥ã£ãŠãããïŒã§ãã ãã®å ŽåãããŒãã¹ã¯ãªãããç·šéããŠã.csv.gz
ããçŽæ¥ããŒãããããšããå§ãããŸãã ããªãã¯çœ®ãæããããšã«ãã£ãŠãããè¡ãããšãã§ããŸãïŒ
\copy CHARTEVENTS from 'CHARTEVENTS.csv' delimiter ',' csv header NULL ''
ãš
\copy CHARTEVENTS from PROGRAM '7z e -so CHARTEVENTS.csv.gz' delimiter ',' csv header NULL ''
ãããã§ãããã¯ç§ã®ããã«åããïŒ
\ copy my_table_name from program'cmd / c type input_data.csv 'delimiter'ã 'csv header;
11GBãµã€ãºã®ãããªinput_data.csvã
ã倧ããªãã¡ã€ã«ãã³ããŒã§ããªãããšããåé¡ã¯ã11ããŒãžã§ã³ãš12ããŒãžã§ã³ã§çºçããŸãã ãããã10ã®å Žåã¯åé¡ãããŸããã ããŒã¿ãã¡ã€ã«ãå§çž®ããã«ãªãŒããŒã©ã€ãããæ¹æ³ã§ãããPostgresqlããã°ã©ã ãã¡ã€ã«ãv.10ããv 11ããã³12ã«ã¢ãããµãŒã/ã¹ã¯ããããæ¹æ³ã¯ãããŸããïŒ
åé¿çïŒ
ããã°ã©ã 'cmd / c "type xïŒ\ pathto \ file.txt"'ããtïŒcãdïŒãïŒããã¹ã圢åŒïŒã§ã³ããŒããŸãã
-ç§ã®ããŒãºã«ã¯ããªãé
ãã§ãã ããã©ã«ãã®ã³ããŒã³ãã³ãã®é床ãå¿
èŠã§ã
ä»ã®ã³ãã³ãã©ã€ã³ããŒã«ã䜿çšããŠãã¡ã€ã«ãè€æ°ã®ãã¡ã€ã«ã«åå²ããåã
ã®ãã¡ã€ã«ãäžåºŠã«1ã€ãã€ããŒãããããšãæ€èšã§ããŸãã UNIXã·ã¹ãã ã§ã¯ãããã¯split
ã䜿çšããŠå®è¡ã§ããWindowsçšã®GNUcoreutilsãã€ã³ã¹ããŒã«ããŠäœ¿çšã§ããŸãã
ç§ã¯ããªããšåãåé¡ã«ééãããšæããŸãããç§ã¯éåžžã«æ°ããããŒãžã§ã³12ã䜿çšããŠããŸããããã解決ããæ¹æ³ã¯ãããŸããïŒ å§çž®ãã¡ã€ã«ã䜿çšããŸããïŒ
ã¯ããæ£ããæãåºãã°ãå§çž®ãã¡ã€ã«ã¯4 GBæªæºã§ãããå§çž®ããŒãã¹ã¯ãªããïŒ7zãŸãã¯gzipïŒã䜿çšããŠãã®ãšã©ãŒãåé¿ã§ããŸãã
OKãä»ãããã®æ¹æ³ãè©ŠããŠã¿ãŸãããè¿ä¿¡ããããšãããããŸãã
ãããã£ãŠãå§çž®ãŸãã¯åå²ããŸã£ãã䜿çšããã«åé¿çã¯ãããŸãããïŒ 11ã12ãšã³ãžã³çšã®Postgresqlã®COPYã³ãã³ãã®10ããŒãžã§ã³ã®äœ¿çšïŒ
ç§ã瀺ããããã«ïŒ
ããã©ã«ãã®ã³ããŒã³ãã³ãã®é床ãå¿
èŠã§ããã倧ããªãã¡ã€ã«+12ã®ããŒãžã§ã³ã®å Žå
ããã¯ç§ã®ããŒãºã«ãšã£ãŠäžå¯æ¬ ã§ãã
PostgreSQLã¯ãªãŒãã³ãœãŒã¹ãªã®ã§ãèªåã§ä¿®æ£ãè©Šã¿ãŠè²¢ç®ããããšãæè¿ããŸã:)
é¢é£ãããã£ã¹ã«ãã·ã§ã³ã¯æ¬¡ã®ãšããã§ãïŒ https ïŒ
ãã以å€ã®å Žåã¯ããã®ã¹ã¬ããã§ææ¡ãããŠãã3ã€ã®åé¿çããããŸãïŒããŒãžã§ã³ã®å€æŽãå§çž®ãã¡ã€ã«ã®äœ¿çšããã¡ã€ã«ã®è€æ°ã®éšåãžã®åå²ïŒã ä»ã«ãåé¿çããããšæããŸãã
vã10ã®COPYæ©èœã®ã³ãŒãã®åäœéšåã11ãš12ã«ç§»è¡ããã®ã¯æããã§ã¯ãããŸãããïŒ ãŸãã¯ãããŒãã³ãŒãã£ã³ã°ãããŠããããããã¹ãŠã®ãŠãŒã¶ãŒãã¯ã©ãã·ã¥ããŸããïŒ :)
@ghYuraããã¯ã³ãã¥ããã£ã管çãããªãœãŒã¹ã§ãããããã³ãŒãããŒã¹ãæ¹åããããã®ææ¡ãããå Žåã¯ããã«ãªã¯ãšã¹ããäœæããããšããå§ãããŸãã
12.XããŒãžã§ã³ãš13.XããŒãžã§ã³ã®äž¡æ¹ã§CSVãããŒãã«ã«ããŒãããŠãããšãã«ãšã©ãŒãçºçããŸããããPostgreSQLããŒãžã§ã³10.15ã§ã¯é åã®ããã«æ©èœããŸãã ã¿ããªå©ããŠãããŠããããšã:)
æãåèã«ãªãã³ã¡ã³ã
ããŠã
could not stat file "CHARTEVENTS.csv": Unknown error
ã¯å®éã«ã¯PostgreSQL 11ã®ãã°ã§ããå éšã§ã¯ããã¡ã€ã«ããã£ã¬ã¯ããªã§ã¯ãªãããšã確èªããããã«fstat()
ãåŒã³åºããŸãããæ®å¿µãªããfstat()
ã¯charteventsã®ãããªå€§ããªãã¡ã€ã«ãåŠçã§ããªã32ãããããã°ã©ã ã PostgreSQL 10.5ã䜿çšããŠWindowsã§ãã«ãããã¹ãããŸãããããã®ãšã©ãŒã¯çºçããªãã£ããããããªãæ°ãããšæããŸããæåã®åé¿çã¯ããã¡ã€ã«ãå§çž®ãããŸãŸã«ãïŒã€ãŸãã
.csv.gz
ãã¡ã€ã«ãšããŠä¿æãïŒã7zipã䜿çšããŠå§çž®ãã¡ã€ã«ããçŽæ¥ããŒã¿ãããŒãããããšã§ãã ãã¹ãã§ã¯ãããã¯ãŸã æ©èœããŠããããã«èŠããŸããã ãããè¡ãæ¹æ³ã«ã€ããŠã¯ã https ïŒ ãŸããäžèšã®ç°¡åãªããŒãžã§ã³ã§ã¯ã
.csv.gz
ãã¡ã€ã«ãä¿æãã7zipãã€ããªãWindowsç°å¢ãã¹ã«è¿œå ããŠãããpostgres_load_data_7zip.sql
ãã¡ã€ã«ãåŒã³åºããŠããŒã¿ãããŒãããŸãã ãã¹ãŠã®åŸã«postgres_checks.sql
ãã¡ã€ã«ã䜿çšããŠããã¹ãŠã®ããŒã¿ãæ£ããããŒãããããšã確èªã§ããŸããç·šéïŒãã®7zipã¢ãããŒãã䜿çšããŠããåŸã®ãšã©ãŒã«ã€ããŠã¯ããªãããŒããããªãã®ãããããŸããã ADMISSIONS.csv.gzãã¡ã€ã«ã ããåããŠã³ããŒãããŠãåããšã©ãŒãçºçãããã©ããã確èªããŠãã ããã ãã¶ããã¹ã¯ãªãããäœããæŽæ°ããå¿ èŠããã7zipã®æ°ããããŒãžã§ã³ããããŸãïŒ