Skip to content

Commit c422d58

Browse files
author
Nick Pentreath
committed
Clean up review comments
1 parent 4c2c78c commit c422d58

File tree

1 file changed

+4
-4
lines changed

1 file changed

+4
-4
lines changed

docs/ml-collaborative-filtering.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -66,7 +66,7 @@ test dataset that were not present during training the model. This typically occ
6666
scenarios:
6767

6868
1. In production, for new users or items that have no rating history and on which the model has not
69-
been trained (this is the "cold start problem")
69+
been trained (this is the "cold start problem").
7070
2. During cross-validation, the data is split between training and evaluation sets. When using
7171
simple random splits as in Spark's `CrossValidator` or `TrainValidationSplit`, it is actually
7272
very common to encounter users and/or items in the evaluation set that are not in the training set
@@ -80,12 +80,12 @@ in `NaN` results for the evaluation metric (for example when using `RegressionEv
8080
This makes model selection impossible.
8181

8282
Spark allows users to set the `coldStartStrategy` parameter
83-
to `drop` in order to drop any rows in the `DataFrame` of predictions that contain `NaN` values.
83+
to "drop" in order to drop any rows in the `DataFrame` of predictions that contain `NaN` values.
8484
The evaluation metric will then be computed over the non-`NaN` data and will be valid.
8585
Usage of this parameter is illustrated in the example below.
8686

87-
**Note:** currently the supported cold start strategies are `nan` (the default behavior mentioned
88-
above) and `drop`. Further strategies may be supported in future versions.
87+
**Note:** currently the supported cold start strategies are "nan" (the default behavior mentioned
88+
above) and "drop". Further strategies may be supported in future.
8989

9090
**Examples**
9191

0 commit comments

Comments
 (0)