Skip to content

I can‘t understand the Riemman approximation compute in this paper. #11

@1143long

Description

@1143long

When calculating the Riemann estimate, it is mentioned in the article that the effect is best when m=20, but in the scaled_input function in the code, batch_ Size=16, num_ Batch=4, I don't quite understand here, and the gradient and attention_weights, Where is the code for multiplying weights to obtain attribution score? As a novice, I still have some questions and hope you can help me solve them. Thank you very much!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions