Counterfactual Explanations for Entity Resolution

Ditto represents a solution based on pre-trained language models for the Entity Resolution problem. Analyzing this model, it was observed that some of Ditto's predictions are incorrect, and it is inexplicable why the model made a certain choice. In an attempt to address this issue, techniques based on Explainable AI methods were considered.

CERTA provides explanations for predictions of ER systems using methods based on saliency. Using this model, we adopted data augmentation techniques to enhance the accuracy of Ditto's predictions using counterfactual explanations (CE). A counterfactual explanation provides input samples, in our case pairs of tuples, that change a prediction into a desired outcome.

The general idea was to train the Ditto model on a specific training set and measure its performance on the corresponding test set. At that point, pairs of records in the training set that were misclassified were identified. For these pairs, CERTA was used to generate a set of counterfactual explanations. The samples obtained from these explanations were added to the original training set, and Ditto was retrained on the augmented training set. Once the training was completed, Ditto's performance on the same test set was measured once again.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Ditto_with_CE.ipynb		Ditto_with_CE.ipynb
README.md		README.md
Slides.pptx		Slides.pptx
ditto-data-integration.ipynb		ditto-data-integration.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Counterfactual Explanations for Entity Resolution

About

Releases

Packages

Languages

menicacci/entity-matching-xai

Folders and files

Latest commit

History

Repository files navigation

Counterfactual Explanations for Entity Resolution

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages