Skip to content

Remote replication and recovery changes#20424

Merged
bharath-techie merged 26 commits intoopensearch-project:feature/datafusionfrom
raghuvanshraj:feature/remote-replication
Jan 20, 2026
Merged

Remote replication and recovery changes#20424
bharath-techie merged 26 commits intoopensearch-project:feature/datafusionfrom
raghuvanshraj:feature/remote-replication

Conversation

@raghuvanshraj
Copy link
Contributor

@raghuvanshraj raghuvanshraj commented Jan 15, 2026

Description

Known bug in this PR: Checkpoint tracking during remote recovery is buggy, leading to segments getting recreated from translog.

Related Issues

Resolves #[Issue number to be closed when this PR is merged]

Check List

  • Functionality includes testing.
  • API changes companion pull request created, if applicable.
  • Public documentation issue/PR created, if applicable.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

@coderabbitai
Copy link
Contributor

coderabbitai bot commented Jan 15, 2026

Important

Review skipped

Auto reviews are disabled on base/target branches other than the default branch.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.


Comment @coderabbitai help to get the list of available commands and usage tips.

@github-actions
Copy link
Contributor

❌ Gradle check result for 1f3aa80: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

@github-actions
Copy link
Contributor

❌ Gradle check result for 31041da: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

@github-actions
Copy link
Contributor

❌ Gradle check result for 839a560: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

@github-actions
Copy link
Contributor

❌ Gradle check result for b576be5: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

@raghuvanshraj raghuvanshraj force-pushed the feature/remote-replication branch from b576be5 to d00f2b9 Compare January 16, 2026 13:01
@github-actions
Copy link
Contributor

❌ Gradle check result for d00f2b9: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

@github-actions
Copy link
Contributor

❌ Gradle check result for f62dce8: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

Signed-off-by: Raghuvansh Raj <raghraaj@amazon.com>
Signed-off-by: Raghuvansh Raj <raghraaj@amazon.com>
Kamal Nayan and others added 16 commits January 19, 2026 16:52
Signed-off-by: Raghuvansh Raj <raghraaj@amazon.com>
Signed-off-by: Raghuvansh Raj <raghraaj@amazon.com>
…enabled to avoid client side encryption (opensearch-project#19630)

Signed-off-by: Pranit Kumar <pranikum@amazon.com>
Signed-off-by: Raghuvansh Raj <raghraaj@amazon.com>
Signed-off-by: Raghuvansh Raj <raghraaj@amazon.com>
…ecovery

Signed-off-by: Raghuvansh Raj <raghraaj@amazon.com>
Signed-off-by: Raghuvansh Raj <raghraaj@amazon.com>
@raghuvanshraj raghuvanshraj force-pushed the feature/remote-replication branch from dda84c3 to 760f2c3 Compare January 19, 2026 11:22
@github-actions
Copy link
Contributor

❌ Gradle check result for 760f2c3: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

Signed-off-by: Raghuvansh Raj <raghraaj@amazon.com>
@github-actions
Copy link
Contributor

❌ Gradle check result for 9926737: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

@github-actions
Copy link
Contributor

❌ Gradle check result for 817c024: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

@github-actions
Copy link
Contributor

❌ Gradle check result for 76fc740: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

@github-actions
Copy link
Contributor

❌ Gradle check result for dd5463e: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

…port on parquet files

Signed-off-by: Raghuvansh Raj <raghraaj@amazon.com>
@bharath-techie bharath-techie merged commit b4ff131 into opensearch-project:feature/datafusion Jan 20, 2026
8 of 30 checks passed
@github-actions
Copy link
Contributor

❌ Gradle check result for 7341e0f: null

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

abhita pushed a commit to abhita/OpenSearch that referenced this pull request Jan 26, 2026
* Changes to get remote upload and replication to work for lucene indices

Signed-off-by: Raghuvansh Raj <raghraaj@amazon.com>

* Replication changes

Signed-off-by: Raghuvansh Raj <raghraaj@amazon.com>

* Fixes for remote store integration (opensearch-project#20325)

Signed-off-by: Mohit Godwani <mgodwan@amazon.com>

* Updated and Refactored the code to have only required changes for recovery

* Removed extra logging lines

* Resolving PR comments

* Updated the VSRManagerTests to use ParquetFileMetadata as flushResult

* Fixing RemoteSegmentStoreDirectoryTests and RemoteSegmentStoreDirectoryWithPinnedTimestampTests

* Fix for merged files not getting uploaded/deleted

Signed-off-by: Raghuvansh Raj <raghraaj@amazon.com>

* Added support for SSE KMS testing via run.gradle

Signed-off-by: Raghuvansh Raj <raghraaj@amazon.com>

* [RemoteStore] Add support for repository with server side encryption enabled to avoid client side encryption  (opensearch-project#19630)

Signed-off-by: Pranit Kumar <pranikum@amazon.com>

* Adding support for SSE KMS in RemoteSegmentStoreDirectoryFactory

Signed-off-by: Raghuvansh Raj <raghraaj@amazon.com>

* Fixing lucene remote recovery and files should not get written in replicas

* Removing extra logging statements

* Revert "Fixing lucene remote recovery and files should not get written in replicas"

This reverts commit 1f3aa80.

* Using NIOFSDirectory in GenericStoreDirectory

Signed-off-by: Raghuvansh Raj <raghraaj@amazon.com>

* Adding NRTReplicationCompositeEngine for checkpoint tracking during recovery

Signed-off-by: Raghuvansh Raj <raghraaj@amazon.com>

* Using index input for checksum calculation instead of input stream

Signed-off-by: Raghuvansh Raj <raghraaj@amazon.com>

* Fixing lucene remote recovery and files should not get written in replicas

* Fixed RemoteIndexRecoveryIT::testRerouteRecovery

* Removed the isReadonly check from compositeEngine as we have the NRTEngine now

* Removing override for getHistoryUUID in NRTReplicationCompositeEngine

Signed-off-by: Raghuvansh Raj <raghraaj@amazon.com>

* Updated the IndexFileDeleterTests to use absolute file path

* Refactoring syncSegmentsFromGivenRemoteSegmentStore and syncSegmentsFromRemoteSegmentStore APIs

* Made deleteUnrefrencedFiles api more generic

* Modifying UUID to be added before file name extension for catalog support on parquet files

Signed-off-by: Raghuvansh Raj <raghraaj@amazon.com>

---------

Signed-off-by: Raghuvansh Raj <raghraaj@amazon.com>
Signed-off-by: Mohit Godwani <mgodwan@amazon.com>
Signed-off-by: Pranit Kumar <pranikum@amazon.com>
Co-authored-by: Mohit Godwani <81609427+mgodwan@users.noreply.github.com>
Co-authored-by: Kamal Nayan <askkamal@amazon.com>
Co-authored-by: pranikum <kpranit81@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants

Comments