-
Notifications
You must be signed in to change notification settings - Fork 4.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support Embeddings in mltransform #29564
Merged
AnandInguva
merged 58 commits into
apache:master
from
AnandInguva:embeddings_mltransform
Dec 11, 2023
Merged
Changes from 2 commits
Commits
Show all changes
58 commits
Select commit
Hold shift + click to select a range
80e5c4a
Make base.py framework agnostic and add helper transforms
AnandInguva 0d34847
Add tests for base.py
AnandInguva 58b24f6
Add sentence-transformers
AnandInguva 88f9ceb
Add tensorflow hub
AnandInguva 23f7027
Add vertex_ai
AnandInguva 04ebdb0
Make TFTProcessHandler a PTransform
AnandInguva f86c259
raise RuntimeError in ArtifactsFetcher when it is used for embeddings
AnandInguva fc4ec00
Add JsonPickle to requirements
AnandInguva 3da5ce8
Add tox tests
AnandInguva 4b4ee58
Mock frameworks in pydocs
AnandInguva 01ba217
Add Row type check
AnandInguva f080c25
Remove requires_chaining
AnandInguva 6111c31
change name of PTransformProvider to MLTransformProvider
AnandInguva ba24e81
remove batch_len in utility fun
AnandInguva d690aec
Change type annotation and redundant comments
AnandInguva af7496b
Remove get_transforms method
AnandInguva d713555
remove requires_chaining from tft
AnandInguva 50450f3
add tests to sentence-transformers
AnandInguva c2b691f
Merge remote-tracking branch 'origin/master' into embeddings_mltransform
AnandInguva 8823a75
Pass inference_args to RunInference
AnandInguva a7e2bd3
Add TODO GH issue
AnandInguva 519b3ed
Merge branch 'embeddings_mltransform' of https://github.com/AnandIngu…
AnandInguva f77ae60
refactor variables in vertex_ai embeddings
AnandInguva 95ed3c5
remove try/catch and throw error if options is empty for GCS artifact…
AnandInguva c235499
Refactor NotImplementedError message
AnandInguva 6eebfa4
remove tensorflow hub from this PR
AnandInguva c27aabb
Add _validate_transform method
AnandInguva 422a86a
add more tests
AnandInguva 08b3665
fix test
AnandInguva 91255ad
Fix test
AnandInguva c7237c3
Add more tests in sentence-transformer
AnandInguva a942885
use np.max instead of max
AnandInguva 89c19fb
round to 2 decimals
AnandInguva 2db4a20
Remove gradle command action
AnandInguva b7a48d5
Refactor throwing dataflow client exception
AnandInguva eb46e08
Merge branch 'embeddings_mltransform' of https://github.com/AnandIngu…
AnandInguva bad1b3b
skip the test if gcp is not installed
AnandInguva b850cee
remove toxTests for hub
AnandInguva ffff21a
remove toxTests for hub
AnandInguva 88412ea
Fix values in assert for sentence_transformer_test
AnandInguva 617f9d6
rename sentence_transformers to huggingface
AnandInguva 5cae04b
fix pydocs
AnandInguva 489200f
Change the model name for tests since it is getting different results…
AnandInguva 816174a
Fix pydoc in vertexai
AnandInguva cfb1883
add suffix to artifact_location
AnandInguva 2cb6f03
Revert "add suffix to artifact_location"
AnandInguva cd7050e
add no_xdist
AnandInguva 98cd949
Try fixing pydoc for vertexai
AnandInguva 8ea0906
change tox.ini to use pytest directly
AnandInguva 5187b0e
Merge remote-tracking branch 'origin/master' into embeddings_mltransform
AnandInguva 6f83d3c
raise FileExistError if Attribute file is already present
AnandInguva c9ddb25
Merge branch 'embeddings_mltransform' of https://github.com/AnandIngu…
AnandInguva 9dce3cf
modify build.gradle to match tox task names
AnandInguva 539c9ad
Add note to CHANGES.md
AnandInguva b967cd8
change gcs bucket to gs://temp-storage-for-perf-tests
AnandInguva f1bb42c
Add TODO GH links
AnandInguva 8d0b47d
Merge remote-tracking branch 'origin/master' into embeddings_mltransform
AnandInguva c173d6a
Update CHANGES.md
AnandInguva File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Good call - one possible future enhancement would be to support an overwrite argument that allows users to do this