Pandas: DataFrame.describe๋Š” ๋ฐ์ดํ„ฐ ์„ธํŠธ์— nan์ด ํฌํ•จ๋œ ๊ฒฝ์šฐ ๋ฐฑ๋ถ„์œ„์ˆ˜๋ฅผ ๋ฐ˜ํ™˜ํ•  ์ˆ˜ ์—†์Šต๋‹ˆ๋‹ค.

์— ๋งŒ๋“  2016๋…„ 05์›” 09์ผ  ยท  3์ฝ”๋ฉ˜ํŠธ  ยท  ์ถœ์ฒ˜: pandas-dev/pandas

์ฝ”๋“œ ์ƒ˜ํ”Œ, ๊ฐ€๋Šฅํ•œ ๊ฒฝ์šฐ ๋ณต์‚ฌํ•˜์—ฌ ๋ถ™์—ฌ๋„ฃ๊ธฐ ๊ฐ€๋Šฅํ•œ ์˜ˆ

des_table = df_final_S1415.describe(๋ฐฑ๋ถ„์œ„์ˆ˜=[.05, .25, .5, .75, .95]).T

์˜ˆ์ƒ ์ถœ๋ ฅ

๋ฒ„์ „ 18.0์—์„œ describe ํ•จ์ˆ˜๋Š” ์—ด์— nan์ด ํฌํ•จ๋œ ๊ฒฝ์šฐ ๋ฐฑ๋ถ„์œ„์ˆ˜๋ฅผ ๋ฐ˜ํ™˜ํ•ฉ๋‹ˆ๋‹ค.

pd.show_versions() ์ถœ๋ ฅ

๊ทธ๋Ÿฌ๋‚˜ ๋ฒ„์ „ 18.1์—์„œ ์„ค๋ช… ํ•จ์ˆ˜๋Š” ์—ด์— nan์ด ํฌํ•จ๋œ ๊ฒฝ์šฐ ๋ฐฑ๋ถ„์œ„์ˆ˜๋ฅผ ๋ฐ˜ํ™˜ํ•˜์ง€ ์•Š์Šต๋‹ˆ๋‹ค.

Duplicate

๊ฐ€์žฅ ์œ ์šฉํ•œ ๋Œ“๊ธ€

๋‹ค์Œ์€ ์žฌํ˜„ ๊ฐ€๋Šฅํ•œ ์˜ˆ์ž…๋‹ˆ๋‹ค(์‹ค์ œ ๋ฌธ์ œ๋Š” quantile ๋ฉ”์„œ๋“œ์— ์žˆ์Œ).

In [24]: s = pd.Series(range(5))

In [25]: s.quantile(0.5)
Out[25]: 2.0

In [26]: s[0] = np.nan

In [27]: s.quantile(0.5)
Out[27]: nan

In [28]: pd.__version__
Out[28]: '0.18.1+20.gaf7bdd3'

๋ชจ๋“  3 ๋Œ“๊ธ€

@tade0726 ์žฌํ˜„ ๊ฐ€๋Šฅํ•œ ์˜ˆ๋ฅผ ๋ณด์—ฌ์ฃผ์‹œ๊ฒ ์Šต๋‹ˆ๊นŒ? (๋ฌธ์ œ๋ฅผ ๋ณด์—ฌ์ฃผ๋Š” ๋ฐ์ดํ„ฐ ํ”„๋ ˆ์ž„์„ ๊ตฌ์„ฑํ•˜๋Š” ์ผ๋ถ€ ์ฝ”๋“œ)

๋‹ค์Œ์€ ์žฌํ˜„ ๊ฐ€๋Šฅํ•œ ์˜ˆ์ž…๋‹ˆ๋‹ค(์‹ค์ œ ๋ฌธ์ œ๋Š” quantile ๋ฉ”์„œ๋“œ์— ์žˆ์Œ).

In [24]: s = pd.Series(range(5))

In [25]: s.quantile(0.5)
Out[25]: 2.0

In [26]: s[0] = np.nan

In [27]: s.quantile(0.5)
Out[27]: nan

In [28]: pd.__version__
Out[28]: '0.18.1+20.gaf7bdd3'

๊ทธ๋ฆฌ๊ณ  https://github.com/pydata/pandas/issues/13098 ์˜ ๋ณต์ œ๋ณธ์ž…๋‹ˆ๋‹ค.

์‹ ๊ณ ํ•ด ์ฃผ์…”์„œ ๊ฐ์‚ฌํ•ฉ๋‹ˆ๋‹ค.

์ด ํŽ˜์ด์ง€๊ฐ€ ๋„์›€์ด ๋˜์—ˆ๋‚˜์š”?
0 / 5 - 0 ๋“ฑ๊ธ‰