Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About the FPGA Performance Metrics #36

Open
SinnLiu opened this issue Jul 31, 2024 · 2 comments
Open

About the FPGA Performance Metrics #36

SinnLiu opened this issue Jul 31, 2024 · 2 comments

Comments

@SinnLiu
Copy link

SinnLiu commented Jul 31, 2024

Hi, I have a question about FPGA Performance Metrics. In the "The single core implementation" part of section 5.2, which show above

Table 3. Given the 370M parameter model where L = 24, d = 512, the total projected runtime is 16.08ms, and a throughput of approximately 62 tokens per second. The 1.3B parameter model, where L = 24 and d = 2048, has a projected runtime of 42ms, and a throughput of 23.8 tokens per second.

But as shown in Table 3, the total projected runtime is 43ms where L = 24, d = 512, and the same as the case of L = 24 and d = 2048. Is there something wrong with my understanding?It would be my honor if you could reply.

@SinnLiu
Copy link
Author

SinnLiu commented Aug 15, 2024

So, why won't anyone answer my question?

@ridgerchu
Copy link
Owner

Hello,
I believe our FPGA team will try to figure that out, but it seems they haven't replied yet... @sifferman could you please help solve this issue if possible?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants