You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am currently using the Vowpal Wabbit package in order to implement a Contextual Bandit use case.
My use case is to provide categories(L1/L2/L3/L4/L5) considered action here with personalized ranking to the user on the basis of context like:
recent_searched_categories
clicked_categories
type_of_user = daily/monthly/weekly
age={agebracket} 1,2,3
gender=male/female
tier=tier1/2/3/4
I have simulated a cost function and learned online on the basis of cost and action chosen using --cb_explore_adf -q UA param.
Is the data format mentioned above is correct? If not how should we create an input training dataset to learn the model.
Please suggest what type of algorithms we can use for the above use case for exploring as well as optimizing the probabilities for all the categories on the basis of context and how to validate the performance of the algorithm.
The text was updated successfully, but these errors were encountered:
Otherwise seems correct.
2. Seems like adding cA and rA (clicked_cats * Actions and recent_cats * Actions) interactions should be useful here ("-q UA cA rA")
Hi @ataymano, Thank you for the response.
Also, can you guide me here on which kind of algorithm works best for this above use case (softmax, RND, epsilon-greedy)?
I need ranking for the categories(can consider probabilities from the model) on the basis of recent categories & clicked categories affinity to the particular user. (My reward function will depend on these two).
I am currently using the Vowpal Wabbit package in order to implement a Contextual Bandit use case.
My use case is to provide categories(L1/L2/L3/L4/L5) considered action here with personalized ranking to the user on the basis of context like:
I have simulated a cost function and learned online on the basis of cost and action chosen using --cb_explore_adf -q UA param.
Sample Dataset:
My question here is:
The text was updated successfully, but these errors were encountered: