Evalml: Elastic Net Classifier 추정기로 SHAP 테스트 실패

에 만든 2021년 05월 17일 · 4코멘트 · 출처: alteryx/evalml

이 PR 에서는 test_algoritthms.py 파일의 shap 테스트(test_shap)를 통과하기 위해 ENC의 초기화 매개변수를 변경합니다. init 매개변수를 alpha = 0.0001 및 l1_ratio=0.15 로 두면 shap을 계산하는 동안 ZeroDivisionError 가 발생했으며 이는 이것에 연결되었을 가능성

이 문제를 제출하여 테스트가 실패한 이유를 확인하고 이 오류를 방지할 수 있는 좋은 방법을 찾으십시오.

bug

출처

bchen1116

모든 4 댓글

나는 이것이 샤프 문제라고 생각한다. 토론을 위해 https://github.com/slundberg/shap/issues/2000 을 제출했습니다. 단기적으로는 다음과 같이 할 수 있다고 생각합니다.

선형 모델의 경우 KernelExplainer 에 link="identity" 사용
로짓 링크와 함께 LinearExplainer 사용, explainer = shap.LinearExplainer(classifier, X, link=shap.links.logit)

freddyaboulton 에 2021년 05월 18일

👍1

실패한 추가 테스트:

이 테스트를 evalml/tests/model_understanding_tests/prediction_explaination_tests/test_explainers.py 추가하면 메인에서 alpha=0.5, l1_ratio=0.5 를 사용 합니다 .

@pytest.mark.parametrize("estimator", ["Extra Trees Classifier", "Elastic Net Classifier"])
def test_elastic_net(estimator, fraud_100):
    pytest.importorskip('imblearn', reason='Skipping test because imblearn not installed')
    X, y = fraud_100
    pipeline = BinaryClassificationPipeline(component_graph=["Imputer", "One Hot Encoder", "DateTime Featurization Component", estimator])
    pipeline.fit(X=X, y=y)
    pipeline.predict(X)
    importance = explain_predictions(pipeline, X, y, indices_to_explain=[0], top_k_features=4)
    assert report['feature_names'].isnull().sum() == 0
    assert report['feature_values'].isnull().sum() == 0

테스트 실패:

alpha와 l1_ratio를 변경해도 여전히 실패합니다.

bchen1116 에 2021년 05월 18일

좋아 @bchen1116 , 가자.

chukarsten 에 2021년 05월 21일

😄1

이 PR로 마무리

bchen1116 에 2021년 06월 23일

이 페이지가 도움이 되었나요?

0 / 5 - 0 등급

Evalml: Elastic Net Classifier 추정기로 SHAP 테스트 실패

모든 4 댓글

관련 문제