Eventual-Inc · jaychia · Aug 28, 2024 · Aug 27, 2024
diff --git a/docs/source/user_guide/integrations/huggingface.rst b/docs/source/user_guide/integrations/huggingface.rst
@@ -1,7 +1,7 @@
 Huggingface Datasets
 ====================
 
-Daft is able to read datasets directly from Huggingface via the ``hf://`` protocol.
+Daft is able to read datasets directly from Huggingface via the ``hf://datasets/`` protocol.
 
 Since huggingface will `automatically convert <https://huggingface.co/docs/dataset-viewer/en/parquet>`_ all public datasets to parquet format,
 we can read these datasets using the ``read_parquet`` method.
@@ -19,7 +19,7 @@ Reading Public Datasets
 
     import daft
 
-    df = daft.read_parquet("hf://username/dataset_name")
+    df = daft.read_parquet("hf://datasets/username/dataset_name")
 
 This will read the entire dataset into a daft DataFrame.
 
@@ -29,12 +29,12 @@ Not only can you read entire datasets, but you can also read individual files fr
 
     import daft
 
-    df = daft.read_parquet("hf://username/dataset_name/file_name.parquet")
+    df = daft.read_parquet("hf://datasets/username/dataset_name/file_name.parquet")
     # or a csv file
-    df = daft.read_csv("hf://username/dataset_name/file_name.csv")
+    df = daft.read_csv("hf://datasets/username/dataset_name/file_name.csv")
 
     # or a glob pattern
-    df = daft.read_parquet("hf://username/dataset_name/**/*.parquet")
+    df = daft.read_parquet("hf://datasets/username/dataset_name/**/*.parquet")
 
 
 Authorization
@@ -47,18 +47,18 @@ For authenticated datasets:
     from daft.io import IOConfig, HTTPConfig
 
     io_config = IoConfig(http=HTTPConfig(bearer_token="your_token"))
-    df = daft.read_parquet("hf://username/dataset_name", io_config=io_config)
+    df = daft.read_parquet("hf://datasets/username/dataset_name", io_config=io_config)
 
 
 It's important to note that this will not work with standard tier private datasets.
 Huggingface does not auto convert private datasets to parquet format, so you will need to specify the path to the files you want to read.
 
 .. code:: python
 
-    df = daft.read_parquet("hf://username/my_private_dataset", io_config=io_config) # Errors
+    df = daft.read_parquet("hf://datasets/username/my_private_dataset", io_config=io_config) # Errors
 
 to get around this, you can read all files using a glob pattern *(assuming they are in parquet format)*
 
 .. code:: python
 
-    df = daft.read_parquet("hf://username/my_private_dataset/**/*.parquet", io_config=io_config) # Works
+    df = daft.read_parquet("hf://datasets/username/my_private_dataset/**/*.parquet", io_config=io_config) # Works