[Model] Avoid hardcoding pooling type by DarkLight1337 · Pull Request #32119 · vllm-project/vllm

DarkLight1337 · 2026-01-11T14:14:17Z

Purpose

Gracefully handle user-specified pooling types where possible, instead of silently ignoring them.

In the next PR, we will work on passing EmbeddingPoolerHead to head argument so that pooling params are applied correctly.

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Note

Enables user-configurable sequence pooling types instead of hardcoded defaults and aligns pooler construction across models.

BERT/BertWithRope: BertPooler now takes seq_pooling_type (via get_seq_pooling_method); pooling initialized from pooler_config.
ModernBert: ModernBertPooler uses configured seq_pooling_type and validates against HF classifier_pooling; sequence classification uses this pooler.
RoBERTa and Transformers mixins: remove explicit CLSPool usage; DispatchPooler.for_seq_cls/for_embedding handle token extraction; minor comment clarifications.
GritLM: use custom GritLMPooler only for MEAN; otherwise defer to generic pooler_for_embed.
SPLADE embedding: note that built-in sequence pooling is overridden by SPLADESparsePooler.

^{Written by Cursor Bugbot for commit ecb0656. This will update automatically on new commits. Configure here.}

Note

Enables configurable sequence pooling and aligns pooler construction across models.

BERT/BertWithRope: BertPooler now accepts seq_pooling_type and uses get_seq_pooling_method; BertPoolingModel/BertWithRope initialize pooler from pooler_config.
ModernBert: ModernBertPooler uses provided seq_pooling_type; ModernBertForSequenceClassification validates HF classifier_pooling vs. configured type and wires pooling into DispatchPooler.for_seq_cls.
RoBERTa + transformers mixins: Remove explicit CLSPool; rely on DispatchPooler.for_seq_cls/for_embedding for token extraction; clarify comments.
GritLM: Use custom GritLMMeanPool only when seq_pooling_type == "MEAN"; otherwise defer to get_seq_pooling_method.
SPLADE embedding: Note built-in sequence pooling is overridden by SPLADESparsePooler.

^{Written by Cursor Bugbot for commit 7c25d15. This will update automatically on new commits. Configure here.}

Note

^{Cursor Bugbot is generating a summary for commit 5d5064e. Configure here.}

Note

^{Cursor Bugbot is generating a summary for commit ce4cef5. Configure here.}

Note

^{Cursor Bugbot is generating a summary for commit 69c8fde. Configure here.}

Note

Makes sequence pooling user-configurable and standardizes pooler construction.

BERT/BertWithRope: BertPooler now takes PoolerConfig and uses get_seq_pooling_method; pooling instantiated from pooler_config in BertPoolingModel and BertWithRope (when present)
ModernBert: new ModernBertPooler(config, pooler_config) (uses HF classifier_pooling); ModernBertForSequenceClassification wires this through DispatchPooler.for_seq_cls
RoBERTa + transformers mixins: remove explicit CLSPool; rely on DispatchPooler.for_seq_cls/for_embedding for token extraction; update default attribute to default_seq_pooling_type
GritLM: GritLMPooler uses custom GritLMMeanPool only for MEAN, otherwise defers to get_seq_pooling_method; updates DispatchPooler wiring
SPLADE embedding: note that built-in sequence pooling is overridden by SPLADESparsePooler

^{Written by Cursor Bugbot for commit 69c8fde. This will update automatically on new commits. Configure here.}