Skip to content

Adds more stats.#358

Merged
ahmarsuhail merged 8 commits intoawslabs:mainfrom
ahmarsuhail:iostats
Oct 17, 2025
Merged

Adds more stats.#358
ahmarsuhail merged 8 commits intoawslabs:mainfrom
ahmarsuhail:iostats

Conversation

@ahmarsuhail
Copy link
Collaborator

@ahmarsuhail ahmarsuhail commented Oct 8, 2025

Description of change

Adds in support for some additional statistics

  • Footer parsing failed
  • Bytes prefetched

I want to add in some more stats around readVectored(), but will do that once the request coalescing PR is in.

Relevant issues

Does this contribution introduce any breaking changes to the existing APIs or behaviors?

Does this contribution introduce any new public APIs or behaviors?

How was the contribution tested?

Does this contribution need a changelog entry?

  • I have updated the CHANGELOG or README if appropriate

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license and I agree to the terms of the Developer Certificate of Origin (DCO).

Copy link
Contributor

@stubz151 stubz151 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm, just 2 minor comments.

}

@Test
void testContains() {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't really get the point of this test

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

had to add it to ensure test coverage passes

Math.max(maxReadLength, sequentialReadProgression.getSizeForGeneration(generation));

// Record any range extension due to sequential prefetching
requestCallback.onBlockPrefetch(endPos + 1, truncatePos(pos + maxReadLength - 1));
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

can u explain this a bit more for me?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

these are the number of extra bytes AAL requested that the user did not ask for.

the user request was for till endPos. And then AAL extended the request due to sequential prefetching, so we report these as prefetched bytes

@ahmarsuhail ahmarsuhail merged commit 46e7f9e into awslabs:main Oct 17, 2025
4 checks passed
dongjoon-hyun added a commit to apache/spark that referenced this pull request Jan 28, 2026
### What changes were proposed in this pull request?

This PR aims to upgrade `analyticsaccelerator-s3` to 1.3.1 for Apache Spark 4.2.0 in line with Apache Hadoop 3.4.3 (HADOOP-19742).
- apache/hadoop#8093

### Why are the changes needed?

To bring the latest fixes.
- https://github.com/awslabs/analytics-accelerator-s3/releases/tag/v1.3.1
  - awslabs/analytics-accelerator-s3#360
  - awslabs/analytics-accelerator-s3#361
  - awslabs/analytics-accelerator-s3#363
  - awslabs/analytics-accelerator-s3#356
  - awslabs/analytics-accelerator-s3#358

### Does this PR introduce _any_ user-facing change?

No behavior change.

### How was this patch tested?

Pass the CIs.

### Was this patch authored or co-authored using generative AI tooling?

No.

Closes #54031 from dongjoon-hyun/SPARK-55254.

Authored-by: Dongjoon Hyun <dongjoon@apache.org>
Signed-off-by: Dongjoon Hyun <dongjoon@apache.org>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants