Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fully interactive visualization supporting editing of the DataFrame #204

Closed
frreiss opened this issue Jun 8, 2021 · 1 comment
Closed

Comments

@frreiss
Copy link
Member

frreiss commented Jun 8, 2021

Extend the DataFrame visualization from #203 to support making edits to the DataFrame itself to support data cleaning and active learning applications. Major types of edits that would be useful:

  • Change the begin and end offsets of spans by dragging
  • Add new spans (adding new rows to the DataFrame)
  • Remove spans (remove rows from the DataFrame)
  • Add labels to span (add a new column to the DataFrame with metadata like "is correct", or "entity type", and populate that column from the UI)

With this editing support, it should be possible to do a data cleaning application end-to-end, entirely in the notebook. Early cells in the notebook load the data, maybe train a model on it, and generate a DataFrame with information about what was found in each document. Then the user edits the DataFrames in place. Then later cells in the notebook consume the results from the editing session and do things like retraining the model or writing out a new, corrected data set.

This kind of interactivity and interplay between the JavaScript and the backing Python objects will require a JupyterLab widget.

@frreiss
Copy link
Member Author

frreiss commented Oct 21, 2021

#238 implements most of this functionality, so closing this issue.

@frreiss frreiss closed this as completed Oct 21, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant