Can I use IntrospectiveRationaleExplainer to explain pre-trained model ?

Hello, I have a pre-trained model for text sentiment polarity classification, with a structure roughly composed of RoBERTa+TextCNN. Can I use the Introspective Rationale Explainer to interpret its output? I aim to obtain the importance/contribution of each word towards the final predicted polarity.