Skip to content

Add E2E test for MongoDBSearch autoEmbedding support#704

Merged
viveksinghggits merged 7 commits intomasterfrom
e2e-vector-auto-embedding
Jan 19, 2026
Merged

Add E2E test for MongoDBSearch autoEmbedding support#704
viveksinghggits merged 7 commits intomasterfrom
e2e-vector-auto-embedding

Conversation

@viveksinghggits
Copy link
Collaborator

@viveksinghggits viveksinghggits commented Jan 14, 2026

Summary

This PR adds E2E test for auto embedding support for vector search that is added as part of the PR #680.

Proof of Work

https://spruce.mongodb.com/version/69695255ad5383000708cb28/tasks?sorts=STATUS%3AASC%3BBASE_STATUS%3ADESC

Checklist

  • Have you linked a jira ticket and/or is the ticket in the title?
  • Have you checked whether your jira ticket required DOCSP changes?
  • Have you added changelog file?

@github-actions
Copy link

github-actions bot commented Jan 14, 2026

⚠️ (this preview might not be accurate if the PR is not rebased on current master branch)

MCK 1.7.0 Release Notes

New Features

  • Added support for auto embeddings in MongoDB Community to automatically generate vector embeddings for the vector search data. This document can be followed for detailed documentation
  • MongoDBSearch: Updated the default mongodb/mongodb-search image version to 0.60.1. This is the version MCK uses if .spec.version is not specified.

Bug Fixes

  • Fix an issue to ensure that hosts are consistently removed from Ops Manager monitoring during AppDB scale-down events.
  • Fixed an issue where monitoring agents would fail after disabling TLS on a MongoDB deployment.
  • Persistent Volume Claim resize fix: Fixed an issue where the Operator ignored namespaces when listing PVCs, causing conflicts with resizing PVCs of the same name. Now, PVCs are filtered by both name and namespace for accurate resizing.
  • Fixed a panic that occurred when the domain names for a horizon was empty. Now, if the domain names are not valid (RFC 1123), the validation will fail before reconciling.
  • MongoDBMultiCluster, MongoDB: Fix an issue where the operator skipped host removal when an external domain was used, leaving monitoring hosts in Ops Manager even after workloads were correctly removed from the cluster.
  • Fixed an issue where the Operator could crash when TLS certificates are configured using the certificatesSecretsPrefix field without additional TLS settings.

@viveksinghggits viveksinghggits force-pushed the e2e-vector-auto-embedding branch from 641b283 to 62f1de8 Compare January 15, 2026 17:15
@viveksinghggits viveksinghggits added the skip-changelog Use this label in Pull Request to not require new changelog entry file label Jan 15, 2026
@viveksinghggits viveksinghggits marked this pull request as ready for review January 15, 2026 21:21
@viveksinghggits viveksinghggits requested a review from a team as a code owner January 15, 2026 21:21
except pymongo.errors.PyMongoError as e:
logger.debug(f"error: {e}")

return count > 0, status_msg
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

q: is it reasonable to make this assertion a bit stronger? I.e. is the count deterministic and the output of status_msg?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, this is reasonable. The response of the returned data might change because of the algorithm that vector search uses, but we can make the document count in response deterministic by passing the number of documents that we want in response. I have done that.
Since the documents that returned in the response might change, I think it won't be a good idea to actually assert the returned data.

]
)

def assert_auto_emb_vector_search_query(self, retry_timeout: int = 1):
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

q: shouldn't we make the default timeout a bit higher? I see it is invoked with 60 seconds.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In most of the places when we wait for index to be ready, we just sleep for 60 seconds, but I have increased this to 90 seconds.

1. Verify the number of returns documents for auto embed vector search query
2. Increase the timeout to get search data to 90 seconds
Copy link
Contributor

@lsierant lsierant left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

@viveksinghggits viveksinghggits merged commit 306d600 into master Jan 19, 2026
31 of 34 checks passed
@viveksinghggits viveksinghggits deleted the e2e-vector-auto-embedding branch January 19, 2026 09:55
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

skip-changelog Use this label in Pull Request to not require new changelog entry file

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants

Comments