-
Notifications
You must be signed in to change notification settings - Fork 0
UPSTREAM PR #16941: Model: add openPangu-Embedded #69
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
|
Access the complete analysis in the LOCI Dashboard Based on my analysis of the performance data and code changes, here's the comprehensive performance impact assessment: Performance Analysis SummaryCritical Function Performance ChangesPrimary Performance Degradation
Secondary Performance Impact
KPI Impact Analysis1. Tokens Per Second ImpactStatus: No Direct Impact Expected Analysis: The degraded functions are located in regex processing components, not in core inference functions:
Conclusion: Based on the reference that 2ms slower 2. Power Consumption ImpactStatus: Minimal Impact Affected Binary:
Root Cause: Increased CPU cycles from inefficient STL container operations in regex processing. 3. Quantization EfficiencyStatus: No Impact Analysis: No changes detected in quantization-related functions:
4. Memory UsageStatus: Potential Indirect Impact Affected Areas:
No Direct Impact on core memory management functions:
5. Batch ProcessingStatus: No Impact Analysis: Core batch processing functions show no performance degradation:
Root Cause AnalysisAssembly-Level Issues
Code Changes ContextThe performance degradation appears unrelated to PR #69 (PanguEmbedded model addition), suggesting:
Action ItemsImmediate Code-Level Actions
Build System Actions
Performance Monitoring Focus
ConclusionThe identified performance regression is isolated to regex processing components and does not directly impact core inference performance metrics. The 0.169% power consumption increase in |
94381d7 to
0eeb29b
Compare
6196a56 to
39290d7
Compare
Mirrored from ggml-org/llama.cpp#16941
Add a new model openPangu-Embedded-1/7B-V1.1.
Yu can get the the model from model path.