Skip to content

DeciLMCausalModel now reads rope_theta from config.json properly

984ffac
Select commit
Loading
Failed to load commit list.
Sign in for the full log view
Merged

Fixed Llama-3_1-Nemotron-51B doesn't work when 4K or more tokens #11008

DeciLMCausalModel now reads rope_theta from config.json properly
984ffac
Select commit
Loading
Failed to load commit list.