Skip to content

Add eval_batch_size for evaluation#12

Merged
winglian merged 1 commit into
axolotl-ai-cloud:mainfrom
NanoCode012:feat/eval_config
May 7, 2023
Merged

Add eval_batch_size for evaluation#12
winglian merged 1 commit into
axolotl-ai-cloud:mainfrom
NanoCode012:feat/eval_config

Conversation

@NanoCode012

@NanoCode012 NanoCode012 commented May 6, 2023

Copy link
Copy Markdown
Collaborator

Problem:

I would OOM during the evaluation stage despite setting micro_batch_size to be low. Turns out that batch size for eval using a different parameter.

Proposal:

New config for eval stage for lower VRAM GPUs.

Edit:

Since it's now a somewhat required argument, I'm not sure whether we should

  • assert value exist
  • set some default if not in config

@winglian winglian left a comment

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

thanks!

@winglian winglian merged commit a15d823 into axolotl-ai-cloud:main May 7, 2023
@NanoCode012 NanoCode012 deleted the feat/eval_config branch May 7, 2023 05:48
djsaunde pushed a commit that referenced this pull request Dec 17, 2024
Add eval_batch_size for evaluation
OnePunchMonk added a commit to OnePunchMonk/axolotl that referenced this pull request Mar 22, 2026
- #1: Rename original_ffn -> base_ffn (fixes AttributeError in tests)
- axolotl-ai-cloud#2: Fix device mismatch in collect_mixlora_aux_loss zero tensor
- axolotl-ai-cloud#3: Remove > 0 guard on aux loss (always add and log)
- axolotl-ai-cloud#4: Fix or-fallback bug for falsy numeric config values
- axolotl-ai-cloud#5: Remove unnecessary .clone() in MixLoraFFN.forward
- axolotl-ai-cloud#6: Fix MixLoraExpert.forward return type annotation
- axolotl-ai-cloud#7: Update comments for down_proj delta computation
- axolotl-ai-cloud#8: Add inference guard (eval mode) for MixLoRA in adapter loader
- axolotl-ai-cloud#9: Replace @pytest.mark.skip with @pytest.mark.slow
- axolotl-ai-cloud#10: Add ge/gt field constraints and top_k <= num_experts validator
- axolotl-ai-cloud#11: Block flash_attn_fuse_qkv with mixlora
- axolotl-ai-cloud#12: Add mixlora_state_dict/load_mixlora_state_dict for checkpointing
- axolotl-ai-cloud#13: Fix typo intermmediate -> intermediate
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants