METAPROBE: A Representation- and Task-Agnostic Probe

Yichu Zhou and Vivek Srikumar

Proceedings of the 6th BlackboxNLP Workshop: Analyzing and Interpreting Neural Networks for NLP, 2023.

Abstract

Probing contextualized representations typically involves comparing task-specific model predictions against ground truth linguistic labels. Although this methodology shows textit{what} information can be recovered by a classifier, it does not reveal textit{how} a classifier uses the representation to make its decision. To address the latter problem, we ask: Do task-classifiers rely on representation- and task-independent geometric patterns in the embedding space? We explore this question by developing MetaProbe, an approach that uses geometric properties of representations to predict the behavior of task-specific classifiers (i.e., their predictions as opposed to the ground truth). Our experiments reveal the existence of universal geometric patterns across representations that can predict classifier predictions. Consequently, this allows us to posit a geometric explanation for the impressive performance of contextualized representations.

Bib Entry

@inproceedings{zhou2023metaprobe,
  author = {Zhou, Yichu and Srikumar, Vivek},
  title = {{METAPROBE}: A Representation- and Task-Agnostic Probe},
  booktitle = {Proceedings of the 6th BlackboxNLP Workshop: Analyzing and Interpreting Neural Networks for NLP},
  year = {2023}
}

Vivek Srikumar