Add `InfoLM` #849

stancld · 2022-02-21T10:22:40Z

🚀 Feature

Add InfoLM

Sources:

Paper: InfoLM: A New Metric to Evaluate Summarization & Data2Text Generation ((accepted at AAAI '22))
Repo

Motivation

The recent NLG metrics are more often based on BERT (or related) embeddings. As such, I believe, we should also start adding such metrics into TorchMetrics with an extra dependency on transformers if a user wants to use any of these metrics. The InfoLM metric is from a family of untrained metrics (i.e. the model is not fine-tuned on any specific task) so it should be easier for us to begin with it. (Any opinion on this? @Borda :] )

Abstract:

Assessing the quality of natural language generation systems through human annotation is very expensive. Additionally, human annotation campaigns are time-consuming and include non-reusable human labour. In practice, researchers rely on automatic metrics as a proxy of quality. In the last decade, many string-based metrics (e.g., BLEU) have been introduced. However, such metrics usually rely on exact matches and thus, do not robustly handle synonyms. In this paper, we introduce InfoLM a family of untrained metrics that can be viewed as a string-based metric that addresses the aforementioned flaws thanks to a pre-trained masked language model. This family of metrics also makes use of information measures allowing the adaptation of InfoLM to various evaluation criteria. Using direct assessment, we demonstrate that InfoLM achieves statistically significant improvement and over 10 points of correlation gains in many configurations on both summarization and data2text generation.

The text was updated successfully, but these errors were encountered:

Borda · 2022-03-09T19:54:12Z

@stancld ho is it doing, seems that this is the key for two additional metrics where we have volunteers to help... 🐰

stancld · 2022-03-11T18:52:52Z

@Borda Sorry, was a bit busy this week, but it's already wip. I'm gonna try to finish it this weekend so that the other two PRs can be opened.

stancld added enhancement New feature or request good first issue Good for newcomers New metric labels Feb 21, 2022

stancld self-assigned this Feb 21, 2022

stancld mentioned this issue Feb 22, 2022

Add BaryScore #852

Open

stancld added this to the v0.8 milestone Feb 22, 2022

This was referenced Feb 25, 2022

Add DepthScore #854

Open

Make a package release PierreColombo/nlg_eval_via_simi_measures#6

Closed

Borda modified the milestones: v0.8, v0.9 Mar 22, 2022

stancld mentioned this issue Mar 25, 2022

Add InfoLM #915

Merged

4 tasks

SkafteNicki removed this from the v0.9 milestone May 12, 2022

Borda closed this as completed in #915 Jul 12, 2022

Borda added the topic: Text label Aug 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `InfoLM` #849

Add `InfoLM` #849

stancld commented Feb 21, 2022 •

edited

Loading

Borda commented Mar 9, 2022

stancld commented Mar 11, 2022

Add InfoLM #849

Add InfoLM #849

Comments

stancld commented Feb 21, 2022 • edited Loading

🚀 Feature

Sources:

Motivation

Abstract:

Borda commented Mar 9, 2022

stancld commented Mar 11, 2022

Add `InfoLM` #849

Add `InfoLM` #849

stancld commented Feb 21, 2022 •

edited

Loading