Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unable to reproduce the example #5

Open
arn4 opened this issue Sep 12, 2024 · 2 comments
Open

Unable to reproduce the example #5

arn4 opened this issue Sep 12, 2024 · 2 comments

Comments

@arn4
Copy link

arn4 commented Sep 12, 2024

I was trying to reproduce the code in the example of the README file, but it crashes once the request is sent.

In particular, on the second machine I get this error message:

Received tensor request
Error processing tensor: [reshape] Cannot reshape array of size 163840 into shape (1,40,8192).

After a quick look at the code, it seems to me that the function Byte_to_tensor is outputting a mx.array twice as long as it should be, but I was unable to find the error.

Do you have any idea why this is happening?

I've also tried with different models, but the same error is happining (the numbers are different, but the tensor that needs to be reshaped is always long the double as it should be)

@mzbac
Copy link
Owner

mzbac commented Sep 12, 2024

It looks like there are some layer mismatch somehow. Would you be able to install the mlx-sharding and try the example at https://github.com/mzbac/mlx_sharding?tab=readme-ov-file#running-the-servers ?

@arn4
Copy link
Author

arn4 commented Sep 12, 2024

Ok, using the first 2 commands it works.

I've tested both the model of the example and a larger one.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants