[GPTNeoX] Faster rotary embedding for GPTNeoX (based on llama changes)
#25830
+79
−54