-
Notifications
You must be signed in to change notification settings - Fork 4.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: nltk text splitter support #3403
feat: nltk text splitter support #3403
Conversation
Pull Request Validation ReportThis comment is automatically generated by Conventional PR Whitelist Report
Result Pull request does not satisfy any enabled whitelist criteria. Pull request will be validated. Validation Report
Result Pull request satisfies all enabled pull request rules. Last Modified at 16 Aug 24 23:39 UTC |
bce782b
to
f48c0ea
Compare
This pull request is automatically being deployed by Amplify Hosting (learn more). |
f48c0ea
to
bed3140
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks, @uladkaminski
LGTM
* feat: nltk text splitter support * feat: add doc link to nltk text splitter * [autofix.ci] apply automated fixes --------- Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com>
Added NLTKTextSplitter support. This text splitter uses The Natural Language Toolkit, or more commonly NLTK, a suite of libraries and programs for symbolic and statistical natural language processing (NLP).
Rather than just splitting on "\n\n", we can use NLTK to split based on NLTK tokenizers.
nltk_demo.mov