-
Notifications
You must be signed in to change notification settings - Fork 110
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Remove minhash conditional for 25.02 #558
base: main
Are you sure you want to change the base?
Remove minhash conditional for 25.02 #558
Conversation
Signed-off-by: Praateek <[email protected]>
Signed-off-by: Praateek <[email protected]>
|
||
# TODO: remove when dask min version gets bumped | ||
DASK_SHUFFLE_METHOD_ARG = _dask_version > parse_version("2024.1.0") | ||
DASK_P2P_ERROR = _dask_version < parse_version("2023.10.0") | ||
DASK_SHUFFLE_CAST_DTYPE = _dask_version > parse_version("2023.12.0") | ||
DASK_CUDF_PARQUET_READ_INCONSISTENT_SCHEMA = _dask_version > parse_version("2024.12") | ||
DASK_CUDF_PARQUET_READ_INCONSISTENT_SCHEMA = _dask_cudf_version > parse_version( |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this was a bug we had.. added condition for 25.02 in the hope rapidsai/cudf#17554 gets merged for 25.04
Signed-off-by: Praateek <[email protected]>
nemo_curator/_compat.py
Outdated
|
||
# TODO: remove when dask min version gets bumped | ||
DASK_SHUFFLE_METHOD_ARG = _dask_version > parse_version("2024.1.0") | ||
DASK_P2P_ERROR = _dask_version < parse_version("2023.10.0") | ||
DASK_SHUFFLE_CAST_DTYPE = _dask_version > parse_version("2023.12.0") | ||
DASK_CUDF_PARQUET_READ_INCONSISTENT_SCHEMA = _dask_version > parse_version("2024.12") | ||
DASK_CUDF_PARQUET_READ_INCONSISTENT_SCHEMA = _dask_cudf_version > parse_version( | ||
"2025.02" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
"2025.02" | |
"25.2.0" |
We should still hold off on merging this since one mode of installation is the NeMo-FW container that is currently tied to Rapids 24.10 (for 0.7.0). I believe the next release should include Rapids 25.02 or newer, but until then, this logic might fail in those containers. |
Signed-off-by: Praateek <[email protected]>
Description
Since now stable points to 25.02 we can remove the old conditionla logic we had to handle various minhash api's.
Also fixes an issue with dask_cudf.read_parquet inconssitent schema, we hope rapidsai/cudf#17554 get's merged in 25.04
Closes #557.
Usage
# Add snippet demonstrating usage
Checklist