[Performance]: Opportunities to speed up BlockPool processing

### Proposal to improve performance

# Observations
The trace under block_pool.get_new_blocks seems quite fragmented. And we do see some optimization chances there.
- https://github.com/vllm-project/vllm/pull/21005 
  - [x] Avoid __eq__ invocation against KVCacheBlock dataclass
- [WIP] https://github.com/vllm-project/vllm/pull/21222
  - [ ] Introduce buck append and buck popleft to avoid unnecessary linked list operations
  - [ ] Avoid incr_ref function invocations
  - [ ] Avoid self.enable_caching check in the inner for loop

<img width="1285" height="210" alt="Image" src="https://github.com/user-attachments/assets/c606bdcc-70e8-459e-8333-4cccf8bb392f" />

# Reproduce
```
export VLLM_USE_MODELSCOPE=False;
export VLLM_TORCH_PROFILER_DIR=~/vllm_profile; # for profiling
vllm serve facebook/opt-125m \
    --swap-space 16 \
    --disable-log-requests \
    --no-enable-prefix-caching \
    --host :: \
    --dtype float16

export VLLM_TORCH_PROFILER_DIR=~/vllm_profile; # for profiling
vllm bench serve \
    --dataset-name random \
    --model facebook/opt-125m \
    --served-model-name facebook/opt-125m \
    --random-input-len 700 \
    --random-output-len 1 \
    --endpoint /v1/completions \
    --ignore-eos \
    --host localhost \
    --port 8000 \
    --request-rate 200 \
    --num-prompts 100 \
    --profile
```

### Report of performance regression

N/A

### Misc discussion on performance

N/A

### Your current environment (if you think it is necessary)

```text
The output of `python collect_env.py`
```
N/A

### Before submitting a new issue...

- [x] Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the [documentation page](https://docs.vllm.ai/en/latest/), which can answer lots of frequently asked questions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Performance]: Opportunities to speed up BlockPool processing #21141

Proposal to improve performance

Observations

Reproduce

Report of performance regression

Misc discussion on performance

Your current environment (if you think it is necessary)

Before submitting a new issue...

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

[Performance]: Opportunities to speed up BlockPool processing #21141

Description

Proposal to improve performance

Observations

Reproduce

Report of performance regression

Misc discussion on performance

Your current environment (if you think it is necessary)

Before submitting a new issue...

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions