huggingface · patil-suraj · Mar 29, 2022 · Mar 29, 2022
diff --git a/docs/source/preprocessing.mdx b/docs/source/preprocessing.mdx
@@ -175,7 +175,7 @@ Set the `return_tensors` parameter to either `pt` for PyTorch, or `tf` for Tenso
 ...     "Don't think he knows about second breakfast, Pip.",
 ...     "What about elevensies?",
 ... ]
->>> encoded_input = tokenizer(batch, padding=True, truncation=True, return_tensors="tf")
+>>> encoded_input = tokenizer(batch_sentences, padding=True, truncation=True, return_tensors="tf")
 >>> print(encoded_input)
 {'input_ids': <tf.Tensor: shape=(2, 9), dtype=int32, numpy=
 array([[  101,   153,  7719, 21490,  1122,  1114,  9582,  1623,   102],
@@ -494,4 +494,4 @@ A processor combines a feature extractor and tokenizer. Load a processor with [`
 
 Notice the processor has added `input_values` and `labels`. The sampling rate has also been correctly downsampled to 16kHz.
 
-Awesome, you should now be able to preprocess data for any modality and even combine different modalities! In the next tutorial, learn how to fine-tune a model on your newly preprocessed data.
+Awesome, you should now be able to preprocess data for any modality and even combine different modalities! In the next tutorial, learn how to fine-tune a model on your newly preprocessed data.