feat: standardize devices parameter and device initialization #3062

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged

vblagoje merged 3 commits into deepset-ai:main from vblagoje:t_fix_devices

Aug 31, 2022

docs/_src/api/api/document_classifier.md

-Original file line number
+Diff line change
@@ Expand Up @@
     #### TransformersDocumentClassifier.\_\_init\_\_
     ```python
-    def __init__(model_name_or_path: str = "bhadresh-savani/distilbert-base-uncased-emotion", model_version: Optional[str] = None, tokenizer: Optional[str] = None, use_gpu: bool = True, return_all_scores: bool = False, task: str = "text-classification", labels: Optional[List[str]] = None, batch_size: int = 16, classification_field: str = None, progress_bar: bool = True, use_auth_token: Optional[Union[str, bool]] = None)
+    def __init__(model_name_or_path: str = "bhadresh-savani/distilbert-base-uncased-emotion", model_version: Optional[str] = None, tokenizer: Optional[str] = None, use_gpu: bool = True, return_all_scores: bool = False, task: str = "text-classification", labels: Optional[List[str]] = None, batch_size: int = 16, classification_field: str = None, progress_bar: bool = True, use_auth_token: Optional[Union[str, bool]] = None, devices: Optional[List[Union[str, torch.device]]] = None)
     ```
     Load a text classification model from Transformers.
@@ Expand Down Expand Up @@
     `transformers-cli login` (stored in ~/.huggingface) will be used.
     Additional information can be found here
     https://huggingface.co/transformers/main_classes/model.html#transformers.PreTrainedModel.from_pretrained
+    - `devices`: List of torch devices (e.g. cuda, cpu, mps) to limit inference to specific devices.
+    A list containing torch device objects and/or strings is supported (For example
+    [torch.device('cuda:0'), "mps", "cuda:1"]). When specifying `use_gpu=False` the devices
+    parameter is not used and a single cpu device is used for inference.
     <a id="transformers.TransformersDocumentClassifier.predict"></a>
@@ Expand Down @@

docs/_src/api/api/document_store.md

-Original file line number
+Diff line change
@@ Expand Up / @@ -1652,7 +1652,7 @@ In-memory document store @@
     #### InMemoryDocumentStore.\_\_init\_\_
     ```python
-    def __init__(index: str = "document", label_index: str = "label", embedding_field: Optional[str] = "embedding", embedding_dim: int = 768, return_embedding: bool = False, similarity: str = "dot_product", progress_bar: bool = True, duplicate_documents: str = "overwrite", use_gpu: bool = True, scoring_batch_size: int = 500000)
+    def __init__(index: str = "document", label_index: str = "label", embedding_field: Optional[str] = "embedding", embedding_dim: int = 768, return_embedding: bool = False, similarity: str = "dot_product", progress_bar: bool = True, duplicate_documents: str = "overwrite", use_gpu: bool = True, scoring_batch_size: int = 500000, devices: Optional[List[Union[str, torch.device]]] = None)
     ```
     **Arguments**:
@@ Expand Down Expand Up @@
     you have at least `embedding_dim`*`scoring_batch_size`*4 bytes available in GPU memory.
     Since the data is originally stored in CPU memory there is little risk of overruning memory
     when running on CPU.
+    - `devices`: List of torch devices (e.g. cuda, cpu, mps) to limit inference to specific devices.
+    A list containing torch device objects and/or strings is supported (For example
+    [torch.device('cuda:0'), "mps", "cuda:1"]). When specifying `use_gpu=False` the devices
+    parameter is not used and a single cpu device is used for inference.
     <a id="memory.InMemoryDocumentStore.write_documents"></a>
@@ Expand Down @@

docs/_src/api/api/extractor.md

-Original file line number
+Diff line change
@@ Expand Up @@
     `transformers-cli login` (stored in ~/.huggingface) will be used.
     Additional information can be found here
     https://huggingface.co/transformers/main_classes/model.html#transformers.PreTrainedModel.from_pretrained
+    - `devices`: List of torch devices (e.g. cuda, cpu, mps) to limit inference to specific devices.
+    A list containing torch device objects and/or strings is supported (For example
+    [torch.device('cuda:0'), "mps", "cuda:1"]). When specifying `use_gpu=False` the devices
+    parameter is not used and a single cpu device is used for inference.
     <a id="entity.EntityExtractor.run"></a>
@@ Expand Down @@

docs/_src/api/api/generator.md

-Original file line number
+Diff line change
@@ Expand Up @@
     #### RAGenerator.\_\_init\_\_
     ```python
-    def __init__(model_name_or_path: str = "facebook/rag-token-nq", model_version: Optional[str] = None, retriever: Optional[DensePassageRetriever] = None, generator_type: str = "token", top_k: int = 2, max_length: int = 200, min_length: int = 2, num_beams: int = 2, embed_title: bool = True, prefix: Optional[str] = None, use_gpu: bool = True, progress_bar: bool = True, use_auth_token: Optional[Union[str, bool]] = None)
+    def __init__(model_name_or_path: str = "facebook/rag-token-nq", model_version: Optional[str] = None, retriever: Optional[DensePassageRetriever] = None, generator_type: str = "token", top_k: int = 2, max_length: int = 200, min_length: int = 2, num_beams: int = 2, embed_title: bool = True, prefix: Optional[str] = None, use_gpu: bool = True, progress_bar: bool = True, use_auth_token: Optional[Union[str, bool]] = None, devices: Optional[List[Union[str, torch.device]]] = None)
     ```
     Load a RAG model from Transformers along with passage_embedding_model.
@@ Expand Down Expand Up @@
     `transformers-cli login` (stored in ~/.huggingface) will be used.
     Additional information can be found here
     https://huggingface.co/transformers/main_classes/model.html#transformers.PreTrainedModel.from_pretrained
+    - `devices`: List of torch devices (e.g. cuda, cpu, mps) to limit inference to specific devices.
+    A list containing torch device objects and/or strings is supported (For example
+    [torch.device('cuda:0'), "mps", "cuda:1"]). When specifying `use_gpu=False` the devices
+    parameter is not used and a single cpu device is used for inference.
     <a id="transformers.RAGenerator.predict"></a>
@@ Expand Down Expand Up @@
     #### Seq2SeqGenerator.\_\_init\_\_
     ```python
-    def __init__(model_name_or_path: str, input_converter: Optional[Callable] = None, top_k: int = 1, max_length: int = 200, min_length: int = 2, num_beams: int = 8, use_gpu: bool = True, progress_bar: bool = True, use_auth_token: Optional[Union[str, bool]] = None)
+    def __init__(model_name_or_path: str, input_converter: Optional[Callable] = None, top_k: int = 1, max_length: int = 200, min_length: int = 2, num_beams: int = 8, use_gpu: bool = True, progress_bar: bool = True, use_auth_token: Optional[Union[str, bool]] = None, devices: Optional[List[Union[str, torch.device]]] = None)
     ```
     **Arguments**:
@@ Expand All @@
     `transformers-cli login` (stored in ~/.huggingface) will be used.
     Additional information can be found here
     https://huggingface.co/transformers/main_classes/model.html#transformers.PreTrainedModel.from_pretrained
+    - `devices`: List of torch devices (e.g. cuda, cpu, mps) to limit inference to specific devices.
+    A list containing torch device objects and/or strings is supported (For example
+    [torch.device('cuda:0'), "mps", "cuda:1"]). When specifying `use_gpu=False` the devices
+    parameter is not used and a single cpu device is used for inference.
     <a id="transformers.Seq2SeqGenerator.predict"></a>
@@ Expand Down @@

docs/_src/api/api/pseudo_label_generator.md

-Original file line number
+Diff line change
@@ Expand Up / @@ -53,7 +53,7 @@ For example: @@
     #### PseudoLabelGenerator.\_\_init\_\_
     ```python
-    def __init__(question_producer: Union[QuestionGenerator, List[Dict[str, str]]], retriever: BaseRetriever, cross_encoder_model_name_or_path: str = "cross-encoder/ms-marco-MiniLM-L-6-v2", max_questions_per_document: int = 3, top_k: int = 50, batch_size: int = 16, progress_bar: bool = True, use_auth_token: Optional[Union[str, bool]] = None)
+    def __init__(question_producer: Union[QuestionGenerator, List[Dict[str, str]]], retriever: BaseRetriever, cross_encoder_model_name_or_path: str = "cross-encoder/ms-marco-MiniLM-L-6-v2", max_questions_per_document: int = 3, top_k: int = 50, batch_size: int = 16, progress_bar: bool = True, use_auth_token: Optional[Union[str, bool]] = None, use_gpu: bool = True, devices: Optional[List[Union[str, torch.device]]] = None)
     ```
     Loads the cross-encoder model and prepares PseudoLabelGenerator.
@@ Expand All @@
     `transformers-cli login` (stored in ~/.huggingface) will be used.
     Additional information can be found here
     https://huggingface.co/transformers/main_classes/model.html#transformers.PreTrainedModel.from_pretrained
+    - `devices`: List of torch devices (e.g. cuda, cpu, mps) to limit CrossEncoder inference to specific devices.
+    A list containing torch device objects and/or strings is supported (For example
+    [torch.device('cuda:0'), "mps", "cuda:1"]). When specifying `use_gpu=False` the devices
+    parameter is not used and a single cpu device is used for inference.
     <a id="pseudo_label_generator.PseudoLabelGenerator.generate_questions"></a>
@@ Expand Down @@

docs/_src/api/api/query_classifier.md

-Original file line number
+Diff line change
@@ Expand Up / @@ -144,7 +144,7 @@ This node also supports zero-shot-classification. @@
     #### TransformersQueryClassifier.\_\_init\_\_
     ```python
-    def __init__(model_name_or_path: Union[Path, str] = "shahrukhx01/bert-mini-finetune-question-detection", model_version: Optional[str] = None, tokenizer: Optional[str] = None, use_gpu: bool = True, task: str = "text-classification", labels: List[str] = DEFAULT_LABELS, batch_size: int = 16, progress_bar: bool = True, use_auth_token: Optional[Union[str, bool]] = None)
+    def __init__(model_name_or_path: Union[Path, str] = "shahrukhx01/bert-mini-finetune-question-detection", model_version: Optional[str] = None, tokenizer: Optional[str] = None, use_gpu: bool = True, task: str = "text-classification", labels: List[str] = DEFAULT_LABELS, batch_size: int = 16, progress_bar: bool = True, use_auth_token: Optional[Union[str, bool]] = None, devices: Optional[List[Union[str, torch.device]]] = None)
     ```
     **Arguments**:
@@ Expand All @@
     `transformers-cli login` (stored in ~/.huggingface) will be used.
     Additional information can be found here
     https://huggingface.co/transformers/main_classes/model.html#transformers.PreTrainedModel.from_pretrained
+    - `devices`: List of torch devices (e.g. cuda, cpu, mps) to limit inference to specific devices.
+    A list containing torch device objects and/or strings is supported (For example
+    [torch.device('cuda:0'), "mps", "cuda:1"]). When specifying `use_gpu=False` the devices
+    parameter is not used and a single cpu device is used for inference.

docs/_src/api/api/question_generator.md

-Original file line number
+Diff line change
@@ Expand Up / @@ -23,7 +23,7 @@ come from earlier in the document. @@
     #### QuestionGenerator.\_\_init\_\_
     ```python
-    def __init__(model_name_or_path="valhalla/t5-base-e2e-qg", model_version=None, num_beams=4, max_length=256, no_repeat_ngram_size=3, length_penalty=1.5, early_stopping=True, split_length=50, split_overlap=10, use_gpu=True, prompt="generate questions:", num_queries_per_doc=1, sep_token: str = "<sep>", batch_size: int = 16, progress_bar: bool = True, use_auth_token: Optional[Union[str, bool]] = None)
+    def __init__(model_name_or_path="valhalla/t5-base-e2e-qg", model_version=None, num_beams=4, max_length=256, no_repeat_ngram_size=3, length_penalty=1.5, early_stopping=True, split_length=50, split_overlap=10, use_gpu=True, prompt="generate questions:", num_queries_per_doc=1, sep_token: str = "<sep>", batch_size: int = 16, progress_bar: bool = True, use_auth_token: Optional[Union[str, bool]] = None, devices: Optional[List[Union[str, torch.device]]] = None)
     ```
     Uses the valhalla/t5-base-e2e-qg model by default. This class supports any question generation model that is
@@ Expand All @@
     `transformers-cli login` (stored in ~/.huggingface) will be used.
     Additional information can be found here
     https://huggingface.co/transformers/main_classes/model.html#transformers.PreTrainedModel.from_pretrained
+    - `devices`: List of torch devices (e.g. cuda, cpu, mps) to limit inference to specific devices.
+    A list containing torch device objects and/or strings is supported (For example
+    [torch.device('cuda:0'), "mps", "cuda:1"]). When specifying `use_gpu=False` the devices
+    parameter is not used and a single cpu device is used for inference.
     <a id="question_generator.QuestionGenerator.generate_batch"></a>
@@ Expand Down @@

docs/_src/api/api/ranker.md

-Original file line number
+Diff line change
@@ Expand Up @@
     - `model_version`: The version of model to use from the HuggingFace model hub. Can be tag name, branch name, or commit hash.
     - `top_k`: The maximum number of documents to return
     - `use_gpu`: Whether to use all available GPUs or the CPU. Falls back on CPU if no GPU is available.
-    - `devices`: List of GPU (or CPU) devices, to limit inference to certain GPUs and not use all available ones
-    The strings will be converted into pytorch devices, so use the string notation described here:
-    https://pytorch.org/docs/stable/tensor_attributes.html?highlight=torch%20device#torch.torch.device
-    (e.g. ["cuda:0"]).
     - `batch_size`: Number of documents to process at a time.
     - `scale_score`: The raw predictions will be transformed using a Sigmoid activation function in case the model
     only predicts a single label. For multi-label predictions, no scaling is applied. Set this
@@ Expand All @@
     `transformers-cli login` (stored in ~/.huggingface) will be used.
     Additional information can be found here
     https://huggingface.co/transformers/main_classes/model.html#transformers.PreTrainedModel.from_pretrained
+    - `devices`: List of torch devices (e.g. cuda, cpu, mps) to limit inference to specific devices.
+    A list containing torch device objects and/or strings is supported (For example
+    [torch.device('cuda:0'), "mps", "cuda:1"]). When specifying `use_gpu=False` the devices
+    parameter is not used and a single cpu device is used for inference.
     <a id="sentence_transformers.SentenceTransformersRanker.predict"></a>
@@ Expand Down @@

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: standardize devices parameter and device initialization #3062

Uh oh!

Diff view

Diff view

There are no files selected for viewing

Uh oh!