Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Consider to add named entity recognition #1

Open
ivbeg opened this issue Feb 11, 2022 · 1 comment
Open

Consider to add named entity recognition #1

ivbeg opened this issue Feb 11, 2022 · 1 comment
Assignees
Labels
enhancement New feature or request

Comments

@ivbeg
Copy link
Collaborator

ivbeg commented Feb 11, 2022

Named entity recognitions technology helps to identify named objects inside texts.

Strong

  • allows to identify objects inside text blobs
  • could allow to support more named entities (identifiers)

Weakness

  • could be very slow
  • need to prepare PII and identifier rules for recognition

Possible implementation - Slovnet https://github.com/natasha/slovnet

@ivbeg ivbeg added the enhancement New feature or request label Feb 11, 2022
@ivbeg ivbeg self-assigned this May 18, 2022
@ivbeg
Copy link
Collaborator Author

ivbeg commented Aug 5, 2022

Presidio looks like possible NER engine.
The ways to implement:

  • support analysis of list of fields
  • support analysis of any string fields with length greater than max_len parameter. Support this parameter.
  • link NER entities to semantic types registry

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Development

No branches or pull requests

1 participant