Skip to content

Conversation

@Lunderberg
Copy link

The create_shard_func produces both the tir.PrimFunc that performs the sharding and a relax.Function wrapper that calls into it. This separates out the two functions, such that the tir.PrimFunc can be generated without also generating the relax.Function wrapper.

Lunderberg pushed a commit to Lunderberg/mlc-llm that referenced this pull request Apr 1, 2024
@Lunderberg Lunderberg force-pushed the lunderberg/expose_sharding_primfunc_generation branch from a42491e to 2d2c253 Compare April 9, 2024 15:39
@Lunderberg
Copy link
Author

Rebased onto mlc-serve-v0.2.0 to resolve conflict.

The `create_shard_func` produces both the `tir.PrimFunc` that performs
the sharding and a `relax.Function` wrapper that calls into it.  This
separates out the two functions, such that the `tir.PrimFunc` can be
generated without also generating the `relax.Function` wrapper.
@Lunderberg Lunderberg force-pushed the lunderberg/expose_sharding_primfunc_generation branch from 2d2c253 to f1bddee Compare April 12, 2024 13:04
@sunggg sunggg changed the title [MLCChat] Separate function for generating sharding PrimFunc [MLC-LLM] Separate function for generating sharding PrimFunc Apr 12, 2024
@sunggg sunggg merged commit 2995e1c into mlc-serve-v0.2.0 Apr 12, 2024
@Lunderberg Lunderberg deleted the lunderberg/expose_sharding_primfunc_generation branch April 12, 2024 15:53
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants