Dataset load for benchmarking #75

LuvvAggarwal · 2023-08-14T17:17:41Z

Added a utility for loading dataset from hugging face

CLAassistant · 2023-08-14T17:17:46Z

All committers have signed the CLA.

HashemAlsaket · 2023-08-15T15:58:37Z

Awesome work @LuvvAggarwal . Could you add an example with the changes added from PR #72 for review?

HashemAlsaket · 2023-08-22T22:52:03Z

prompttools/benchmarks/load_data.py

@@ -0,0 +1,44 @@
+from datasets import load_dataset_builder,load_dataset,get_dataset_config_names, Dataset


We'll want to guard prompttools from requiring datasets to be installed as part of the requirements.

Suggested change:

try: from datasets import load_dataset_builder, load_dataset, get_dataset_config_names, Dataset from datasets.dataset_dict import DatasetDict except ImportError: load_dataset = None

HashemAlsaket · 2023-08-22T22:53:18Z

prompttools/benchmarks/load_data.py

+        dataset_name: str,
+        split: Literal["train","validation","test"] | None
+    ):
+        self.dataset_name = dataset_name


Suggested change:

if load_dataset is None: raise ModuleNotFoundError( "Package `datasets` is required to be installed to use this experiment." "Please use `pip install datasets` to install the package" )

Dataset load for benchmarking

6bc2c86

Added a utility for loading dataset from hugging face

LuvvAggarwal mentioned this pull request Aug 14, 2023

Benchmarking #72

Merged

LuvvAggarwal marked this pull request as ready for review August 15, 2023 10:25

HashemAlsaket reviewed Aug 22, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dataset load for benchmarking #75

Dataset load for benchmarking #75

LuvvAggarwal commented Aug 14, 2023

CLAassistant commented Aug 14, 2023 •

edited

Loading

HashemAlsaket commented Aug 15, 2023

HashemAlsaket Aug 22, 2023

HashemAlsaket Aug 22, 2023

		@@ -0,0 +1,44 @@
		from datasets import load_dataset_builder,load_dataset,get_dataset_config_names, Dataset

Dataset load for benchmarking #75

Are you sure you want to change the base?

Dataset load for benchmarking #75

Conversation

LuvvAggarwal commented Aug 14, 2023

CLAassistant commented Aug 14, 2023 • edited Loading

HashemAlsaket commented Aug 15, 2023

HashemAlsaket Aug 22, 2023

Choose a reason for hiding this comment

HashemAlsaket Aug 22, 2023

Choose a reason for hiding this comment

CLAassistant commented Aug 14, 2023 •

edited

Loading