-
Notifications
You must be signed in to change notification settings - Fork 1.1k
Add support for datasets in local json files #597
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
@microsoft-github-policy-service agree |
applications/DeepSpeed-Chat/training/utils/data/raw_datasets.py
Outdated
Show resolved
Hide resolved
|
Hi @conglongli thank you for your reviewing and you really offer me some good advices. |
|
@LeetJoe Most of my comments are resolved, but there are two new issues. Also you missed my ask of fixing the formatting issue. If formatting is not fixed, we can't merge it |
fix some about format
|
@conglongli I have fixed the problems you mentioned above. Thanks for your patience. |
|
@LeetJoe formatting is still failling, please do exactly the following: go to the repo |
reformat code using yapf
|
@conglongli I check the failure and find there is a tool called yapf and I have used it reformat my code. |
|
@LeetJoe yapf is just one of the checks we did. This time it was the only format issue so that I can merge. In future I highly recommend you install and use pre-commit. |
|
I have followed the code you provided above, pre-commit is installed now. |
* fix gitignore * add local dataset dir * add ignore * add local dataset support * add some about local dataset in README.md * fix some * add some in README * remove data dir * add line to gitignore * fix some following good advices * fix some about format * reformat code using yapf --------- Co-authored-by: 宋超 <sc@chainshome.org>

README is also updated.