Why is torch.float needed in this example? (PyTorch workflow fundamentals, testing loop ~ 7 hr timepoint). Im going mental pls help. #915

hidenitt4 · 2024-05-01T19:39:42Z

hidenitt4
May 1, 2024

Here is some code for context (you can also look here for more context: https://www.learnpytorch.io/01_pytorch_workflow/, scroll down a bit). I bolded (double-asterix) the line of code I was having trouble understanding. The comment tries to explain why .type(torch.float) was used and I realized it is necessary (otherwise the subsequent plot breaks, but it still doesn't make any sense to me why it is necessary. I checked the datatype for both test_pred and y_test and they are both torch.float32 even without doing this operation. I compared the test_loss_values list and train_loss_values list without doing this operation and compared it to when you do include this operation (.type(torch.float)). I don't see a difference, yet when you make the plot not including this operation it breaks. Can someone please help me understand. Thank you.

with torch.inference_mode():
# 1. Forward pass on test data
test_pred = model_0(X_test)

  # 2. Calculate loss on test data
  **test_loss = loss_fn(test_pred, y_test.type(torch.float))** # predictions come in torch.float datatype, so comparisons need to be done with tensors of the same type

  # Print out what's happening
  if epoch % 10 == 0:
        epoch_count.append(epoch)
        train_loss_values.append(loss.detach().numpy())
        test_loss_values.append(test_loss.detach().numpy())
        print(f"Epoch: {epoch} | MAE Train Loss: {loss} | MAE Test Loss: {test_loss} ")
        
        
        
        
        Code for subsequent plot: 
        
        
        # Plot the loss curves

plt.plot(epoch_count, train_loss_values, label="Train loss")
plt.plot(epoch_count, test_loss_values, label="Test loss")
plt.title("Training and test loss curves")
plt.ylabel("Loss")
plt.xlabel("Epochs")
plt.legend();

hidenitt4 · 2024-05-01T21:26:16Z

hidenitt4
May 1, 2024
Author

figured the issue out. it has to do with the fact that you're in interference_mode() as well as using .detach() on the tensors you are getting from test_loss. even though the gradients aren't turned on in the first place so it's causing some kind of weird interaction i think. getting rid of .detach() and just using .numpy() fixes the issue for me.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why is torch.float needed in this example? (PyTorch workflow fundamentals, testing loop ~ 7 hr timepoint). Im going mental pls help. #915

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

Why is torch.float needed in this example? (PyTorch workflow fundamentals, testing loop ~ 7 hr timepoint). Im going mental pls help. #915

hidenitt4 May 1, 2024

Replies: 1 comment

hidenitt4 May 1, 2024 Author

hidenitt4
May 1, 2024

hidenitt4
May 1, 2024
Author