-
Notifications
You must be signed in to change notification settings - Fork 14
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Model Accuracy Overlay - TT-Explorer #1234
Comments
I think that this was planned for February. Changing the milestone. |
example case
sample dump
|
@tapspatel Step 1. is not something that will happed from tt-explorer? |
@vprajapati-tt After a brief chat with @tapspatel, ttrt golden data only works with ttrt run. In order to make it work from explorer we need to add this as an option. My suggestion would be to have a another option on the frontend similar to Optimization Policy. This way we can let the use decide which overlay they want to se after execution (perf or accuracy). |
Both of these overlays should be provided as NodeData after each execution, with an exception being made if GoldenData is not found. |
For step 1, that has to be done by some external tool/framework. For example, a potential user flow
goldenMap = This is embedded directly into the flatbuffer as a map where key=loc and value is array of bytes of the golden data
From what I understand, explorer uses a ttir graph POV. The problem I think is (and correct me if I'm wrong) but explorer needs a way to access the golden tensors. I discussed with Vraj last week and either explorer can load in a flatbuffer and get the golden tensors or it can point to some location in disk when running the model to do the golden comparison. ttrt can provide this support to pass a folder path to look at during golden comparison. Explorer can also create an internal map of Goldens provided the Goldens exist on disk (ie, during loading of ttir graph, load Goldens into My goal is to be able to provide a ttir.mlir file which already has loc data within it into explorer such that you do not have to figure out loc names for yourself. I will also provide golden data for each of the ttir ops saved in both the flatbuffer that is generated from that ttir.mlir file and saved to disk with their name being "loc.pt". Explorer is free to use whichever way to access to the golden data. Summarize
Let me know your thoughts. Fyi, we are also discussing maybe doing a smaller transformer layer only from the llama prefill model since it is ~3000 ops and the point of this demo is to show case how golden accuracy can be debugged. So you can expect us to use a smaller model (ie ~50 ish ops). I stand corrected, @vprajapati-tt is correct in that golden is supported in perf as well as run mode. I had a brain freeze this morning. I had meant to say we will need run mode because the llama model doesn't run in perf mode (due to buffers running out of space on device and we don't have mechanism to flush buffers in ttrt yet) and golden would also work in ttrt run. However, in light of potentially only using a transformer layer in the demo, we should be able to run golden in ttrt perf mode without issue. So if it works on your end in explorer via the ttir_builder.py models with golden verification in explorer, its fine by me. |
The text was updated successfully, but these errors were encountered: