You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am trying to pair your reproduced baselines (downloaded from Google Drive) with my own human proxy model trained in the new Overcooked-AI environment (with old dynamics).
To make them compatible I first use the old lossless_state_encoding function for the baselines (only for the baselines model because my HP is trained in the new env).
Also, the env requires tensorflow 2 but your models are trained using tensorflow 1, so I load your models through the following code:
defget_model_policy_from_saved_model(save_dir, sim_threads=30):
"""Get a policy function from a saved model"""predictor=tf.saved_model.load(save_dir)
step_fn=lambdaobs: predictor.signatures["serving_default"](tf.convert_to_tensor(obs, dtype=tf.float32))["action_probs"]
returnget_model_policy(step_fn, sim_threads)
I am trying to pair your reproduced baselines (downloaded from Google Drive) with my own human proxy model trained in the new Overcooked-AI environment (with old dynamics).
To make them compatible I first use the old
lossless_state_encoding
function for the baselines (only for the baselines model because my HP is trained in the new env).Also, the env requires
tensorflow 2
but your models are trained usingtensorflow 1
, so I load your models through the following code:However this would raise warnings like:
The code to pair the MEP baseline with my human proxy model is as follows:
But the evaluation results are quite unsatisfying. Am I missing some steps or the warnings do matter?
The text was updated successfully, but these errors were encountered: