Skip to content

feature dump into main#9

Merged
winglian merged 6 commits into
mainfrom
dev
Apr 25, 2023
Merged

feature dump into main#9
winglian merged 6 commits into
mainfrom
dev

Conversation

@winglian

Copy link
Copy Markdown
Collaborator
  • support for Galactica @ehartford
  • support specific data files from hf datasets
  • drops data points from dataset if input, attention mask and label tensors don't match in length
  • ignore system prompt from sharegpt dataset
  • cache prepared datasets right after tokenization to allow for more reconfiguration without rebuilding cache
  • support dataset shards for training
  • require explicit 8bit Adam in config for optimizer

@winglian winglian merged commit 4e705ed into main Apr 25, 2023
@ehartford

Copy link
Copy Markdown
Contributor

Thank you!

@ehartford

Copy link
Copy Markdown
Contributor

djsaunde pushed a commit that referenced this pull request Dec 17, 2024
feature dump into main
OnePunchMonk added a commit to OnePunchMonk/axolotl that referenced this pull request Mar 22, 2026
- #1: Rename original_ffn -> base_ffn (fixes AttributeError in tests)
- axolotl-ai-cloud#2: Fix device mismatch in collect_mixlora_aux_loss zero tensor
- axolotl-ai-cloud#3: Remove > 0 guard on aux loss (always add and log)
- axolotl-ai-cloud#4: Fix or-fallback bug for falsy numeric config values
- axolotl-ai-cloud#5: Remove unnecessary .clone() in MixLoraFFN.forward
- axolotl-ai-cloud#6: Fix MixLoraExpert.forward return type annotation
- axolotl-ai-cloud#7: Update comments for down_proj delta computation
- axolotl-ai-cloud#8: Add inference guard (eval mode) for MixLoRA in adapter loader
- axolotl-ai-cloud#9: Replace @pytest.mark.skip with @pytest.mark.slow
- axolotl-ai-cloud#10: Add ge/gt field constraints and top_k <= num_experts validator
- axolotl-ai-cloud#11: Block flash_attn_fuse_qkv with mixlora
- axolotl-ai-cloud#12: Add mixlora_state_dict/load_mixlora_state_dict for checkpointing
- axolotl-ai-cloud#13: Fix typo intermmediate -> intermediate
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants