Xgboost: ์ •๊ทœํ™”๊ฐ€ ํ•„์š”ํ•œ๊ฐ€์š”?

์— ๋งŒ๋“  2015๋…„ 06์›” 17์ผ  ยท  3์ฝ”๋ฉ˜ํŠธ  ยท  ์ถœ์ฒ˜: dmlc/xgboost

xgboost๊ฐ€ ์ด๋ก ์ ์œผ๋กœ ์–ด๋–ป๊ฒŒ ์ž‘๋™ํ•˜๋Š”์ง€ ์ž˜ ๋ชจ๋ฅด๊ฒ ์Šต๋‹ˆ๋‹ค. ๊ทธ๋Ÿฌ๋‚˜ xgboost๋Š” ํŠธ๋ฆฌ ๊ธฐ๋ฐ˜ ๋ถ„๋ฅ˜๊ธฐ์ด๋ฏ€๋กœ ๊ธฐ๋Šฅ์˜ ์ •๊ทœํ™”๊ฐ€ ํ•„์š”ํ•˜์ง€ ์•Š๋‹ค๊ณ  ๊ฐ€์ •ํ•ด๋„ ๋ ๊นŒ์š”?

๊ฐ€์žฅ ์œ ์šฉํ•œ ๋Œ“๊ธ€

์•„๋‹ˆ์š” ๊ธฐ๋Šฅ์„ ์ •๊ทœํ™”ํ•  ํ•„์š”๊ฐ€ ์—†์Šต๋‹ˆ๋‹ค.

๋ชจ๋“  3 ๋Œ“๊ธ€

์•„๋‹ˆ์š” ๊ธฐ๋Šฅ์„ ์ •๊ทœํ™”ํ•  ํ•„์š”๊ฐ€ ์—†์Šต๋‹ˆ๋‹ค.

์›์น™์ ์œผ๋กœ ๋‚˜๋ฌด๋ฅผ ๋ถ€์ŠคํŠธํ•  ๋•Œ ์ •๊ทœํ™”ํ•  ํ•„์š”๊ฐ€ ์—†๋‹ค๋Š” ๊ฒƒ์„ ์ดํ•ดํ•˜๊ณ  ์žˆ๋‹ค๊ณ  ์ƒ๊ฐํ•ฉ๋‹ˆ๋‹ค.

๊ทธ๋Ÿฌ๋‚˜ ํŠนํžˆ ' reg:gamma '๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ๋Œ€์ƒ y๋ฅผ ์กฐ์ •ํ•  ๋•Œ ์ƒ๋‹นํ•œ ์˜ํ–ฅ์„ ๋ณผ ์ˆ˜ ์žˆ์ง€๋งŒ ' reg:linear '(๊ธฐ๋ณธ๊ฐ’)์— ๋Œ€ํ•ด์„œ๋„ (๋œ ์ •๋„) ์˜ํ–ฅ์„ ๋ฏธ์นฉ๋‹ˆ๋‹ค. ๊ทธ ์ด์œ ๋Š” ๋ฌด์—‡์ž…๋‹ˆ๊นŒ?

Boston Housing ๋ฐ์ดํ„ฐ ์„ธํŠธ์˜ ์˜ˆ:

import numpy as np
import xgboost as xgb
from sklearn.metrics import mean_squared_error
from sklearn.datasets import load_boston

boston = load_boston()
y = boston['target']
X = boston['data']

for scale in np.logspace(-6, 6, 7):
    xgb_model = xgb.XGBRegressor().fit(X, y / scale)
    predictions = xgb_model.predict(X) * scale
    print('{} (scale={})'.format(mean_squared_error(y, predictions), scale))

2.3432734454908335(์ถ•์ฒ™=1e-06)
2.343273977065266(์ถ•์ฒ™=0.0001)
2.3432793874455315(์ถ•์ฒ™=0.01)
2.290595204136888(์ถ•์ฒ™=1.0)
2.528513393507719(์ถ•์ฒ™=100.0)
7.228978353091473(์ถ•์ฒ™=10000.0)
272.29640759874474(์ถ•์ฒ™=1000000.0)

' reg:gamma '๋ฅผ ์‚ฌ์šฉํ•  ๋•Œ y ์Šค์ผ€์ผ๋ง์˜ ์˜ํ–ฅ์€ ์ •๋ง ํฝ๋‹ˆ๋‹ค.

for scale in np.logspace(-6, 6, 7):
    xgb_model = xgb.XGBRegressor(objective='reg:gamma').fit(X, y / scale)
    predictions = xgb_model.predict(X) * scale
    print('{} (scale={})'.format(mean_squared_error(y, predictions), scale))

591.6509503519147(์ถ•์ฒ™=1e-06)
545.8298971540023(์ถ•์ฒ™=0.0001)
37.68688286293508(์ถ•์ฒ™=0.01)
4.039819858716935(์Šค์ผ€์ผ=1.0)
2.505477263590776(์ถ•์ฒ™=100.0)
198.94093800190453(์ถ•์ฒ™=10000.0)
592.1469169959003(์ถ•์ฒ™=1000000.0)

@tqchen Boosted Trees์— ๋Œ€ํ•œ ํ›Œ๋ฅญํ•œ

์ด ํŽ˜์ด์ง€๊ฐ€ ๋„์›€์ด ๋˜์—ˆ๋‚˜์š”?
0 / 5 - 0 ๋“ฑ๊ธ‰