File tree Expand file tree Collapse file tree 1 file changed +4
-4
lines changed
Expand file tree Collapse file tree 1 file changed +4
-4
lines changed Original file line number Diff line number Diff line change @@ -66,7 +66,7 @@ test dataset that were not present during training the model. This typically occ
6666scenarios:
6767
68681 . In production, for new users or items that have no rating history and on which the model has not
69- been trained (this is the "cold start problem")
69+ been trained (this is the "cold start problem").
70702 . During cross-validation, the data is split between training and evaluation sets. When using
7171simple random splits as in Spark's ` CrossValidator ` or ` TrainValidationSplit ` , it is actually
7272very common to encounter users and/or items in the evaluation set that are not in the training set
@@ -80,12 +80,12 @@ in `NaN` results for the evaluation metric (for example when using `RegressionEv
8080This makes model selection impossible.
8181
8282Spark allows users to set the ` coldStartStrategy ` parameter
83- to ` drop ` in order to drop any rows in the ` DataFrame ` of predictions that contain ` NaN ` values.
83+ to " drop" in order to drop any rows in the ` DataFrame ` of predictions that contain ` NaN ` values.
8484The evaluation metric will then be computed over the non-` NaN ` data and will be valid.
8585Usage of this parameter is illustrated in the example below.
8686
87- ** Note:** currently the supported cold start strategies are ` nan ` (the default behavior mentioned
88- above) and ` drop ` . Further strategies may be supported in future versions .
87+ ** Note:** currently the supported cold start strategies are " nan" (the default behavior mentioned
88+ above) and " drop" . Further strategies may be supported in future.
8989
9090** Examples**
9191
You can’t perform that action at this time.
0 commit comments