fix: flaky tests due to insufficient generated data #970

nikhil-zlai · 2025-07-16T02:16:32Z

Summary

most of our test flakiness is due to partition containing no data.

Checklist

Added Unit Tests
Covered by existing CI
Integration tested
Documentation update

Summary by CodeRabbit

Refactor
- Improved validation for join operations to ensure data is present before proceeding.
- Simplified log messages for empty join results.
- Cleaned up import statements in tests.
- Optimized aggregation logic for handling empty inputs by reducing redundant computations.
- Removed deprecated label join functionality and associated tests to streamline the codebase.
Tests
- Increased data volumes and adjusted partition counts in several test cases to enhance test coverage and robustness.
- Enhanced test data generation by increasing row counts significantly for more comprehensive testing.
- Strengthened result verification in performance tests with stricter equality checks.
- Added new test suites verifying derivation logging, contextual and group-by derivations.
- Introduced multiple fetcher test suites for deterministic, generated, tiled, and UniqueTopK temporal fetch join scenarios.
- Removed an older fetcher test suite to consolidate testing approach.
- Removed legacy label join and feature-label join test suites.
- Centralized test data setup in migration tests to improve efficiency.
- Simplified certain test cases by removing label join computations and adjusting data volumes.
- Added debug output in failure test to aid troubleshooting.
Bug Fixes
- Added assertion to ensure consistency between timestamp arrays and intermediate results during join operations.
Chores
- Removed deprecated aggregation user-defined function to streamline aggregation logic.
- Improved build tooling to dynamically locate jar executable.

coderabbitai · 2025-07-16T02:16:38Z

Walkthrough

The changes add explicit validation for the left DataFrame presence in join logic, simplify a log message, increase test data volumes significantly, clean imports, optimize aggregation logic, remove a UDF wrapper, add an assertion for window lengths, and tighten test result verification. Several new fetcher test suites replace a deleted comprehensive fetcher test, and a new derivation logging test suite is added. The build rule for jar creation is also improved.

Changes

File(s)	Change Summary
spark/src/main/scala/ai/chronon/spark/JoinBase.scala	Adds explicit check for non-empty left DataFrame before join; throws error if missing.
spark/src/main/scala/ai/chronon/spark/JoinUtils.scala	Simplifies a log message when left side query yields zero rows.
spark/src/test/scala/ai/chronon/spark/test/batch/ModularJoinTest.scala	Increases test data volumes and adjusts partition counts for multiple tables.
spark/src/test/scala/ai/chronon/spark/test/fetcher/FetcherTestUtil.scala	Adds `compareTemporalFetch` method; multiplies test data row count by 50; reduces default keys.
spark/src/test/scala/ai/chronon/spark/test/join/HeterogeneousPartitionColumnsTest.scala	Sets rowCount to 10,000 for test data; removes redundant imports.
spark/src/test/scala/ai/chronon/spark/test/join/NoHistoricalBackfillTest.scala	Increases entity row count to 10,000 for weight and country datasets.
aggregator/src/main/scala/ai/chronon/aggregator/windowing/SawtoothAggregator.scala	Optimizes cumulateAndFinalizeSorted for empty inputs; simplifies IR cloning and finalizing.
spark/src/main/scala/ai/chronon/spark/join/AggregationInfo.scala	Removes aggregatingUdf lazy val from AggregationInfo case class.
spark/src/main/scala/ai/chronon/spark/join/SawtoothUdf.scala	Adds assertion checking headStartTimes and nonRealtimeIrs length equality.
spark/src/test/scala/ai/chronon/spark/test/join/SawtoothUdfPerformanceTest.scala	Moves orderBy after dropDuplicates; replaces fuzzy result comparison with strict equality.
tools/build_rules/thrift/thrift.bzl	Dynamically sets jar binary path from env var; changes jar command construction to list format.
spark/src/test/scala/ai/chronon/spark/test/analyzer/DerivationBootstrapTest.scala	Renames class; removes extensive derivation and logging tests and helpers.
spark/src/test/scala/ai/chronon/spark/test/analyzer/DerivationLoggingTest.scala	Adds new test suite for derivation logging, contextual and group-by derivations.
spark/src/test/scala/ai/chronon/spark/test/fetcher/FetcherTest.scala	Deletes comprehensive fetcher test suite with multiple test cases and utilities.
spark/src/test/scala/ai/chronon/spark/test/fetcher/FetcherDeterministicTest.scala	Adds test for deterministic temporal fetch join using mutation data.
spark/src/test/scala/ai/chronon/spark/test/fetcher/FetcherGeneratedTest.scala	Adds test for temporal fetch join using generated random data.
spark/src/test/scala/ai/chronon/spark/test/fetcher/FetcherTiledTest.scala	Adds test for temporal tiled fetch join deterministic behavior.
spark/src/test/scala/ai/chronon/spark/test/fetcher/FetcherUniqueTopKTest.scala	Adds test for deterministic temporal fetch join with UniqueTopK struct.
spark/src/test/scala/ai/chronon/spark/test/join/EventsEntitiesSnapshotTest.scala	Increases test data volumes by 10x for entities and events; adds blank line in SQL string.
spark/src/main/scala/ai/chronon/spark/Driver.scala	Removes LabelJoin object and related subcommand handling.
spark/src/main/scala/ai/chronon/spark/LabelJoin.scala	Deletes LabelJoin class and all related label join computation logic.
spark/src/main/scala/ai/chronon/spark/batch/LabelJoinV2.scala	Deletes LabelJoinV2 class and related windowed label join logic.
spark/src/test/scala/ai/chronon/spark/test/batch/LabelJoinV2Test.scala	Deletes LabelJoinV2Test suite and all test cases.
spark/src/test/scala/ai/chronon/spark/test/batch/ShortNamesTest.scala	Removes label join and backfill logic from one test; adjusts data volumes in another.
spark/src/test/scala/ai/chronon/spark/test/join/FeatureWithLabelJoinTest.scala	Deletes FeatureWithLabelJoinTest suite and all test methods.
spark/src/test/scala/ai/chronon/spark/test/join/LabelJoinTest.scala	Deletes LabelJoinTest suite and all test methods.
spark/src/test/scala/ai/chronon/spark/test/batch/EvalTest.scala	Reduces test data volumes and partitions for entities and events.
spark/src/test/scala/ai/chronon/spark/test/MigrationCompareTest.scala	Moves test data setup to class initialization; removes redundant setup calls per test.
spark/src/test/scala/ai/chronon/spark/test/fetcher/FetcherFailureTest.scala	Adds print statement for responseMap in partial failure test case.

Sequence Diagram(s)

sequenceDiagram
    participant Caller
    participant JoinBase
    participant JoinUtils

    Caller->>JoinBase: computeJoinOpt()
    JoinBase->>JoinBase: leftDataOpt = leftDf(...)
    JoinBase->>JoinBase: require(leftDataOpt.isDefined)
    alt leftDataOpt is empty
        JoinBase-->>Caller: throw error "No data for left side"
    else leftDataOpt is present
        JoinBase->>JoinUtils: runSmallMode(leftData)
        JoinUtils-->>JoinBase: result
        JoinBase-->>Caller: result
    end

Possibly related PRs

Refactor spark to create modular jobs for orchestrator #402: Introduces BootstrapJob and refactors join computation to modular jobs, related to explicit left DataFrame validation here.
Fixes to make fetch Join work in CLI including use name over nameToFilePath and replacing / to . in MetaData names #398: Fixes fetch join behavior with improved error handling and key naming, related to left DataFrame retrieval logic.
Increasing item queries DF size to cover more partitions #889: Addresses left DataFrame presence validation in join logic, related to explicit checks added here.

Suggested reviewers

piyush-zlai
varant-zlai

Poem

Data swells, joins align,
Checks ensure the flow's divine.
Logs trimmed neat, tests expand,
New fetchers take their stand.
Assertions guard the join's grace,
Exact matches set the pace.
Code refined, we celebrate! 🎉🐇✨

📜 Recent review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 91e9029 and 4d5decc.

📒 Files selected for processing (2)

spark/src/test/scala/ai/chronon/spark/test/MigrationCompareTest.scala (2 hunks)
spark/src/test/scala/ai/chronon/spark/test/fetcher/FetcherFailureTest.scala (1 hunks)

✅ Files skipped from review due to trivial changes (1)

spark/src/test/scala/ai/chronon/spark/test/fetcher/FetcherFailureTest.scala

🧰 Additional context used

🧠 Learnings (1)

spark/src/test/scala/ai/chronon/spark/test/MigrationCompareTest.scala (7)

Learnt from: piyush-zlai
PR: zipline-ai/chronon#33
File: cloud_aws/src/test/scala/ai/chronon/integrations/aws/DynamoDBKVStoreTest.scala:175-175
Timestamp: 2024-10-07T15:09:51.567Z
Learning: Hardcoding future timestamps in tests within `DynamoDBKVStoreTest.scala` is acceptable when data is generated and queried within the same time range, ensuring the tests remain valid over time.

Learnt from: piyush-zlai
PR: zipline-ai/chronon#44
File: hub/app/controllers/ModelController.scala:15-18
Timestamp: 2024-10-17T19:46:42.629Z
Learning: References to `MockDataService` in `hub/test/controllers/SearchControllerSpec.scala` and `hub/test/controllers/ModelControllerSpec.scala` are needed for tests and should not be removed.

Learnt from: nikhil-zlai
PR: zipline-ai/chronon#70
File: service/src/main/java/ai/chronon/service/ApiProvider.java:6-6
Timestamp: 2024-12-03T04:04:33.809Z
Learning: The import `scala.util.ScalaVersionSpecificCollectionsConverter` in `service/src/main/java/ai/chronon/service/ApiProvider.java` is correct and should not be flagged in future reviews.

Learnt from: chewy-zlai
PR: zipline-ai/chronon#50
File: spark/src/test/scala/ai/chronon/spark/test/MockKVStore.scala:19-28
Timestamp: 2024-10-31T18:29:45.027Z
Learning: In `MockKVStore` located at `spark/src/test/scala/ai/chronon/spark/test/MockKVStore.scala`, the `multiPut` method is intended to be a simple implementation without dataset existence validation, duplicate validation logic elimination, or actual storage of key-value pairs for verification.

Learnt from: piyush-zlai
PR: zipline-ai/chronon#43
File: hub/app/controllers/TimeSeriesController.scala:320-320
Timestamp: 2024-10-14T18:44:24.599Z
Learning: In `hub/app/controllers/TimeSeriesController.scala`, the `generateMockTimeSeriesPercentilePoints` method contains placeholder code that will be replaced with the actual implementation soon.

Learnt from: nikhil-zlai
PR: zipline-ai/chronon#793
File: spark/src/main/scala/ai/chronon/spark/join/UnionJoin.scala:95-106
Timestamp: 2025-05-25T15:57:30.687Z
Learning: Spark SQL's array_sort function requires INT casting in comparator expressions, even for timestamp differences. LONG casting is not supported in this context despite potential overflow concerns.

Learnt from: chewy-zlai
PR: zipline-ai/chronon#62
File: spark/src/main/scala/ai/chronon/spark/stats/drift/SummaryUploader.scala:9-10
Timestamp: 2024-11-06T21:54:56.160Z
Learning: In Spark applications, when defining serializable classes, passing an implicit `ExecutionContext` parameter can cause serialization issues. In such cases, it's acceptable to use `scala.concurrent.ExecutionContext.Implicits.global`.

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (17)

GitHub Check: cloud_aws_tests
GitHub Check: cloud_gcp_tests
GitHub Check: online_tests
GitHub Check: service_commons_tests
GitHub Check: service_tests
GitHub Check: aggregator_tests
GitHub Check: flink_tests
GitHub Check: api_tests
GitHub Check: scala_compile_fmt_fix
GitHub Check: analyzer_tests
GitHub Check: groupby_tests
GitHub Check: fetcher_tests
GitHub Check: join_tests
GitHub Check: streaming_tests
GitHub Check: batch_tests
GitHub Check: spark_tests
GitHub Check: enforce_triggered_workflows

🔇 Additional comments (3)

spark/src/test/scala/ai/chronon/spark/test/MigrationCompareTest.scala (3)

31-31: LGTM: Proper import for test lifecycle management.

34-34: LGTM: Correct use of BeforeAndAfterAll for one-time setup.

43-43: LGTM: Efficient test data initialization pattern.

Moving expensive test data setup to class initialization eliminates redundant setup calls and should improve test reliability.

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Explain this complex logic.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
- @coderabbitai explain this code block.
- @coderabbitai modularize this function.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read src/utils.ts and explain its main purpose.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
- @coderabbitai help me debug CodeRabbit configuration file.

Support

Need help? Create a ticket on our support page for assistance with any issues or questions.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai full review to do a full review from scratch and review all the files again.
@coderabbitai summary to regenerate the summary of the PR.
@coderabbitai generate docstrings to generate docstrings for this PR.
@coderabbitai generate sequence diagram to generate a sequence diagram of the changes in this PR.
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai configuration to show the current CodeRabbit configuration for the repository.
@coderabbitai help to get help.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (`.coderabbit.yaml`)

You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
Please see the configuration documentation for more information.
If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (1)

spark/src/test/scala/ai/chronon/spark/test/fetcher/FetcherTestUtil.scala (1)
156-156: Consider configurable thread pool size

Single thread might bottleneck concurrent operations.
-implicit val executionContext: ExecutionContext = ExecutionContext.fromExecutor(Executors.newFixedThreadPool(1))
+implicit val executionContext: ExecutionContext = ExecutionContext.fromExecutor(Executors.newFixedThreadPool(Runtime.getRuntime.availableProcessors()))

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 873db7f and 04b42d6.

📒 Files selected for processing (8)

spark/src/test/scala/ai/chronon/spark/test/analyzer/DerivationBootstrapTest.scala (1 hunks)
spark/src/test/scala/ai/chronon/spark/test/analyzer/DerivationLoggingTest.scala (1 hunks)
spark/src/test/scala/ai/chronon/spark/test/fetcher/FetcherDeterministicTest.scala (1 hunks)
spark/src/test/scala/ai/chronon/spark/test/fetcher/FetcherGeneratedTest.scala (1 hunks)
spark/src/test/scala/ai/chronon/spark/test/fetcher/FetcherTest.scala (0 hunks)
spark/src/test/scala/ai/chronon/spark/test/fetcher/FetcherTestUtil.scala (4 hunks)
spark/src/test/scala/ai/chronon/spark/test/fetcher/FetcherTiledTest.scala (1 hunks)
spark/src/test/scala/ai/chronon/spark/test/fetcher/FetcherUniqueTopKTest.scala (1 hunks)

💤 Files with no reviewable changes (1)

spark/src/test/scala/ai/chronon/spark/test/fetcher/FetcherTest.scala

🧰 Additional context used

🧠 Learnings (7)

spark/src/test/scala/ai/chronon/spark/test/fetcher/FetcherDeterministicTest.scala (6)

Learnt from: piyush-zlai
PR: zipline-ai/chronon#33
File: cloud_aws/src/test/scala/ai/chronon/integrations/aws/DynamoDBKVStoreTest.scala:175-175
Timestamp: 2024-10-07T15:09:51.567Z
Learning: Hardcoding future timestamps in tests within `DynamoDBKVStoreTest.scala` is acceptable when data is generated and queried within the same time range, ensuring the tests remain valid over time.

Learnt from: piyush-zlai
PR: zipline-ai/chronon#44
File: hub/app/controllers/ModelController.scala:15-18
Timestamp: 2024-10-17T19:46:42.629Z
Learning: References to `MockDataService` in `hub/test/controllers/SearchControllerSpec.scala` and `hub/test/controllers/ModelControllerSpec.scala` are needed for tests and should not be removed.

Learnt from: piyush-zlai
PR: zipline-ai/chronon#53
File: hub/app/controllers/TimeSeriesController.scala:224-224
Timestamp: 2024-10-29T15:21:58.102Z
Learning: In the mocked data implementation in `hub/app/controllers/TimeSeriesController.scala`, potential `NumberFormatException` exceptions due to parsing errors (e.g., when using `val featureId = name.split("_").last.toInt`) are acceptable and will be addressed when adding the concrete backend.

Learnt from: piyush-zlai
PR: zipline-ai/chronon#43
File: hub/app/controllers/TimeSeriesController.scala:320-320
Timestamp: 2024-10-14T18:44:24.599Z
Learning: In `hub/app/controllers/TimeSeriesController.scala`, the `generateMockTimeSeriesPercentilePoints` method contains placeholder code that will be replaced with the actual implementation soon.

Learnt from: piyush-zlai
PR: zipline-ai/chronon#44
File: hub/test/store/DynamoDBMonitoringStoreTest.scala:69-86
Timestamp: 2024-10-15T15:33:22.265Z
Learning: In `hub/test/store/DynamoDBMonitoringStoreTest.scala`, the current implementation of the `generateListResponse` method is acceptable as-is, and changes for resource handling and error management are not necessary at this time.

Learnt from: nikhil-zlai
PR: zipline-ai/chronon#70
File: service/src/main/java/ai/chronon/service/ApiProvider.java:6-6
Timestamp: 2024-12-03T04:04:33.809Z
Learning: The import `scala.util.ScalaVersionSpecificCollectionsConverter` in `service/src/main/java/ai/chronon/service/ApiProvider.java` is correct and should not be flagged in future reviews.

spark/src/test/scala/ai/chronon/spark/test/fetcher/FetcherUniqueTopKTest.scala (7)

Learnt from: piyush-zlai
PR: zipline-ai/chronon#33
File: cloud_aws/src/test/scala/ai/chronon/integrations/aws/DynamoDBKVStoreTest.scala:175-175
Timestamp: 2024-10-07T15:09:51.567Z
Learning: Hardcoding future timestamps in tests within `DynamoDBKVStoreTest.scala` is acceptable when data is generated and queried within the same time range, ensuring the tests remain valid over time.

Learnt from: chewy-zlai
PR: zipline-ai/chronon#50
File: spark/src/test/scala/ai/chronon/spark/test/MockKVStore.scala:19-28
Timestamp: 2024-10-31T18:29:45.027Z
Learning: In `MockKVStore` located at `spark/src/test/scala/ai/chronon/spark/test/MockKVStore.scala`, the `multiPut` method is intended to be a simple implementation without dataset existence validation, duplicate validation logic elimination, or actual storage of key-value pairs for verification.

Learnt from: piyush-zlai
PR: zipline-ai/chronon#44
File: hub/test/store/DynamoDBMonitoringStoreTest.scala:69-86
Timestamp: 2024-10-15T15:33:22.265Z
Learning: In `hub/test/store/DynamoDBMonitoringStoreTest.scala`, the current implementation of the `generateListResponse` method is acceptable as-is, and changes for resource handling and error management are not necessary at this time.

Learnt from: piyush-zlai
PR: zipline-ai/chronon#53
File: hub/app/controllers/TimeSeriesController.scala:224-224
Timestamp: 2024-10-29T15:21:58.102Z
Learning: In the mocked data implementation in `hub/app/controllers/TimeSeriesController.scala`, potential `NumberFormatException` exceptions due to parsing errors (e.g., when using `val featureId = name.split("_").last.toInt`) are acceptable and will be addressed when adding the concrete backend.

Learnt from: piyush-zlai
PR: zipline-ai/chronon#43
File: hub/app/controllers/TimeSeriesController.scala:320-320
Timestamp: 2024-10-14T18:44:24.599Z
Learning: In `hub/app/controllers/TimeSeriesController.scala`, the `generateMockTimeSeriesPercentilePoints` method contains placeholder code that will be replaced with the actual implementation soon.

Learnt from: piyush-zlai
PR: zipline-ai/chronon#44
File: hub/app/controllers/ModelController.scala:15-18
Timestamp: 2024-10-17T19:46:42.629Z
Learning: References to `MockDataService` in `hub/test/controllers/SearchControllerSpec.scala` and `hub/test/controllers/ModelControllerSpec.scala` are needed for tests and should not be removed.

Learnt from: nikhil-zlai
PR: zipline-ai/chronon#70
File: service/src/main/java/ai/chronon/service/ApiProvider.java:6-6
Timestamp: 2024-12-03T04:04:33.809Z
Learning: The import `scala.util.ScalaVersionSpecificCollectionsConverter` in `service/src/main/java/ai/chronon/service/ApiProvider.java` is correct and should not be flagged in future reviews.

spark/src/test/scala/ai/chronon/spark/test/fetcher/FetcherTiledTest.scala (3)

Learnt from: piyush-zlai
PR: zipline-ai/chronon#33
File: cloud_aws/src/test/scala/ai/chronon/integrations/aws/DynamoDBKVStoreTest.scala:175-175
Timestamp: 2024-10-07T15:09:51.567Z
Learning: Hardcoding future timestamps in tests within `DynamoDBKVStoreTest.scala` is acceptable when data is generated and queried within the same time range, ensuring the tests remain valid over time.

Learnt from: piyush-zlai
PR: zipline-ai/chronon#44
File: hub/app/controllers/ModelController.scala:15-18
Timestamp: 2024-10-17T19:46:42.629Z
Learning: References to `MockDataService` in `hub/test/controllers/SearchControllerSpec.scala` and `hub/test/controllers/ModelControllerSpec.scala` are needed for tests and should not be removed.

Learnt from: piyush-zlai
PR: zipline-ai/chronon#53
File: hub/app/controllers/TimeSeriesController.scala:224-224
Timestamp: 2024-10-29T15:21:58.102Z
Learning: In the mocked data implementation in `hub/app/controllers/TimeSeriesController.scala`, potential `NumberFormatException` exceptions due to parsing errors (e.g., when using `val featureId = name.split("_").last.toInt`) are acceptable and will be addressed when adding the concrete backend.

spark/src/test/scala/ai/chronon/spark/test/fetcher/FetcherGeneratedTest.scala (7)

Learnt from: piyush-zlai
PR: zipline-ai/chronon#33
File: cloud_aws/src/test/scala/ai/chronon/integrations/aws/DynamoDBKVStoreTest.scala:175-175
Timestamp: 2024-10-07T15:09:51.567Z
Learning: Hardcoding future timestamps in tests within `DynamoDBKVStoreTest.scala` is acceptable when data is generated and queried within the same time range, ensuring the tests remain valid over time.

Learnt from: piyush-zlai
PR: zipline-ai/chronon#43
File: hub/app/controllers/TimeSeriesController.scala:320-320
Timestamp: 2024-10-14T18:44:24.599Z
Learning: In `hub/app/controllers/TimeSeriesController.scala`, the `generateMockTimeSeriesPercentilePoints` method contains placeholder code that will be replaced with the actual implementation soon.

Learnt from: piyush-zlai
PR: zipline-ai/chronon#44
File: hub/test/store/DynamoDBMonitoringStoreTest.scala:69-86
Timestamp: 2024-10-15T15:33:22.265Z
Learning: In `hub/test/store/DynamoDBMonitoringStoreTest.scala`, the current implementation of the `generateListResponse` method is acceptable as-is, and changes for resource handling and error management are not necessary at this time.

Learnt from: nikhil-zlai
PR: zipline-ai/chronon#70
File: service/src/main/java/ai/chronon/service/ApiProvider.java:6-6
Timestamp: 2024-12-03T04:04:33.809Z
Learning: The import `scala.util.ScalaVersionSpecificCollectionsConverter` in `service/src/main/java/ai/chronon/service/ApiProvider.java` is correct and should not be flagged in future reviews.

Learnt from: piyush-zlai
PR: zipline-ai/chronon#53
File: hub/app/controllers/TimeSeriesController.scala:224-224
Timestamp: 2024-10-29T15:21:58.102Z
Learning: In the mocked data implementation in `hub/app/controllers/TimeSeriesController.scala`, potential `NumberFormatException` exceptions due to parsing errors (e.g., when using `val featureId = name.split("_").last.toInt`) are acceptable and will be addressed when adding the concrete backend.

Learnt from: piyush-zlai
PR: zipline-ai/chronon#44
File: hub/app/controllers/ModelController.scala:15-18
Timestamp: 2024-10-17T19:46:42.629Z
Learning: References to `MockDataService` in `hub/test/controllers/SearchControllerSpec.scala` and `hub/test/controllers/ModelControllerSpec.scala` are needed for tests and should not be removed.

Learnt from: chewy-zlai
PR: zipline-ai/chronon#50
File: spark/src/test/scala/ai/chronon/spark/test/MockKVStore.scala:19-28
Timestamp: 2024-10-31T18:29:45.027Z
Learning: In `MockKVStore` located at `spark/src/test/scala/ai/chronon/spark/test/MockKVStore.scala`, the `multiPut` method is intended to be a simple implementation without dataset existence validation, duplicate validation logic elimination, or actual storage of key-value pairs for verification.

spark/src/test/scala/ai/chronon/spark/test/analyzer/DerivationBootstrapTest.scala (10)

Learnt from: piyush-zlai
PR: zipline-ai/chronon#44
File: hub/app/controllers/ModelController.scala:15-18
Timestamp: 2024-10-17T19:46:42.629Z
Learning: References to `MockDataService` in `hub/test/controllers/SearchControllerSpec.scala` and `hub/test/controllers/ModelControllerSpec.scala` are needed for tests and should not be removed.

Learnt from: nikhil-zlai
PR: zipline-ai/chronon#70
File: service/src/main/java/ai/chronon/service/ApiProvider.java:6-6
Timestamp: 2024-12-03T04:04:33.809Z
Learning: The import `scala.util.ScalaVersionSpecificCollectionsConverter` in `service/src/main/java/ai/chronon/service/ApiProvider.java` is correct and should not be flagged in future reviews.

Learnt from: piyush-zlai
PR: zipline-ai/chronon#53
File: hub/app/controllers/TimeSeriesController.scala:224-224
Timestamp: 2024-10-29T15:21:58.102Z
Learning: In the mocked data implementation in `hub/app/controllers/TimeSeriesController.scala`, potential `NumberFormatException` exceptions due to parsing errors (e.g., when using `val featureId = name.split("_").last.toInt`) are acceptable and will be addressed when adding the concrete backend.

Learnt from: chewy-zlai
PR: zipline-ai/chronon#62
File: spark/src/main/scala/ai/chronon/spark/stats/drift/SummaryUploader.scala:9-10
Timestamp: 2024-11-06T21:54:56.160Z
Learning: In Spark applications, when defining serializable classes, passing an implicit `ExecutionContext` parameter can cause serialization issues. In such cases, it's acceptable to use `scala.concurrent.ExecutionContext.Implicits.global`.

Learnt from: piyush-zlai
PR: zipline-ai/chronon#43
File: hub/app/controllers/TimeSeriesController.scala:320-320
Timestamp: 2024-10-14T18:44:24.599Z
Learning: In `hub/app/controllers/TimeSeriesController.scala`, the `generateMockTimeSeriesPercentilePoints` method contains placeholder code that will be replaced with the actual implementation soon.

Learnt from: piyush-zlai
PR: zipline-ai/chronon#33
File: cloud_aws/src/main/scala/ai/chronon/integrations/aws/DynamoDBKVStoreImpl.scala:29-30
Timestamp: 2024-10-08T16:18:45.669Z
Learning: In the codebase, the `KVStore` implementation provides an implicit `ExecutionContext` in scope, so it's unnecessary to import another.

Learnt from: piyush-zlai
PR: zipline-ai/chronon#33
File: cloud_aws/src/test/scala/ai/chronon/integrations/aws/DynamoDBKVStoreTest.scala:175-175
Timestamp: 2024-10-07T15:09:51.567Z
Learning: Hardcoding future timestamps in tests within `DynamoDBKVStoreTest.scala` is acceptable when data is generated and queried within the same time range, ensuring the tests remain valid over time.

Learnt from: chewy-zlai
PR: zipline-ai/chronon#50
File: spark/src/test/scala/ai/chronon/spark/test/MockKVStore.scala:19-28
Timestamp: 2024-10-31T18:29:45.027Z
Learning: In `MockKVStore` located at `spark/src/test/scala/ai/chronon/spark/test/MockKVStore.scala`, the `multiPut` method is intended to be a simple implementation without dataset existence validation, duplicate validation logic elimination, or actual storage of key-value pairs for verification.

Learnt from: piyush-zlai
PR: zipline-ai/chronon#44
File: hub/app/store/DynamoDBMonitoringStore.scala:98-143
Timestamp: 2024-10-15T15:30:15.514Z
Learning: In the Scala file `hub/app/store/DynamoDBMonitoringStore.scala`, within the `makeLoadedConfs` method, the `.recover` method is correctly applied to the `Try` returned by `response.values` to handle exceptions from the underlying store.

Learnt from: chewy-zlai
PR: zipline-ai/chronon#47
File: online/src/main/scala/ai/chronon/online/MetadataStore.scala:232-0
Timestamp: 2024-10-17T00:12:09.763Z
Learning: In the `KVStore` trait located at `online/src/main/scala/ai/chronon/online/KVStore.scala`, there are two `create` methods: `def create(dataset: String): Unit` and `def create(dataset: String, props: Map[String, Any]): Unit`. The version with `props` ignores the `props` parameter, and the simpler version without `props` is appropriate when `props` are not needed.

spark/src/test/scala/ai/chronon/spark/test/fetcher/FetcherTestUtil.scala (6)

Learnt from: piyush-zlai
PR: zipline-ai/chronon#33
File: cloud_aws/src/test/scala/ai/chronon/integrations/aws/DynamoDBKVStoreTest.scala:175-175
Timestamp: 2024-10-07T15:09:51.567Z
Learning: Hardcoding future timestamps in tests within `DynamoDBKVStoreTest.scala` is acceptable when data is generated and queried within the same time range, ensuring the tests remain valid over time.

Learnt from: piyush-zlai
PR: zipline-ai/chronon#43
File: hub/app/controllers/TimeSeriesController.scala:320-320
Timestamp: 2024-10-14T18:44:24.599Z
Learning: In `hub/app/controllers/TimeSeriesController.scala`, the `generateMockTimeSeriesPercentilePoints` method contains placeholder code that will be replaced with the actual implementation soon.

Learnt from: piyush-zlai
PR: zipline-ai/chronon#53
File: hub/app/controllers/TimeSeriesController.scala:224-224
Timestamp: 2024-10-29T15:21:58.102Z
Learning: In the mocked data implementation in `hub/app/controllers/TimeSeriesController.scala`, potential `NumberFormatException` exceptions due to parsing errors (e.g., when using `val featureId = name.split("_").last.toInt`) are acceptable and will be addressed when adding the concrete backend.

Learnt from: piyush-zlai
PR: zipline-ai/chronon#44
File: hub/test/store/DynamoDBMonitoringStoreTest.scala:69-86
Timestamp: 2024-10-15T15:33:22.265Z
Learning: In `hub/test/store/DynamoDBMonitoringStoreTest.scala`, the current implementation of the `generateListResponse` method is acceptable as-is, and changes for resource handling and error management are not necessary at this time.

Learnt from: chewy-zlai
PR: zipline-ai/chronon#62
File: spark/src/main/scala/ai/chronon/spark/stats/drift/SummaryUploader.scala:9-10
Timestamp: 2024-11-06T21:54:56.160Z
Learning: In Spark applications, when defining serializable classes, passing an implicit `ExecutionContext` parameter can cause serialization issues. In such cases, it's acceptable to use `scala.concurrent.ExecutionContext.Implicits.global`.

Learnt from: piyush-zlai
PR: zipline-ai/chronon#33
File: cloud_aws/src/main/scala/ai/chronon/integrations/aws/DynamoDBKVStoreImpl.scala:29-30
Timestamp: 2024-10-08T16:18:45.669Z
Learning: In the codebase, the `KVStore` implementation provides an implicit `ExecutionContext` in scope, so it's unnecessary to import another.

spark/src/test/scala/ai/chronon/spark/test/analyzer/DerivationLoggingTest.scala (4)

Learnt from: piyush-zlai
PR: zipline-ai/chronon#33
File: cloud_aws/src/test/scala/ai/chronon/integrations/aws/DynamoDBKVStoreTest.scala:175-175
Timestamp: 2024-10-07T15:09:51.567Z
Learning: Hardcoding future timestamps in tests within `DynamoDBKVStoreTest.scala` is acceptable when data is generated and queried within the same time range, ensuring the tests remain valid over time.

Learnt from: piyush-zlai
PR: zipline-ai/chronon#44
File: hub/app/controllers/ModelController.scala:15-18
Timestamp: 2024-10-17T19:46:42.629Z
Learning: References to `MockDataService` in `hub/test/controllers/SearchControllerSpec.scala` and `hub/test/controllers/ModelControllerSpec.scala` are needed for tests and should not be removed.

Learnt from: nikhil-zlai
PR: zipline-ai/chronon#70
File: service/src/main/java/ai/chronon/service/ApiProvider.java:6-6
Timestamp: 2024-12-03T04:04:33.809Z
Learning: The import `scala.util.ScalaVersionSpecificCollectionsConverter` in `service/src/main/java/ai/chronon/service/ApiProvider.java` is correct and should not be flagged in future reviews.

Learnt from: piyush-zlai
PR: zipline-ai/chronon#53
File: hub/app/controllers/TimeSeriesController.scala:224-224
Timestamp: 2024-10-29T15:21:58.102Z
Learning: In the mocked data implementation in `hub/app/controllers/TimeSeriesController.scala`, potential `NumberFormatException` exceptions due to parsing errors (e.g., when using `val featureId = name.split("_").last.toInt`) are acceptable and will be addressed when adding the concrete backend.

🧬 Code Graph Analysis (1)

spark/src/test/scala/ai/chronon/spark/test/fetcher/FetcherUniqueTopKTest.scala (3)

spark/src/main/scala/ai/chronon/spark/catalog/TableUtils.scala (1)

sql (293-321)

spark/src/main/scala/ai/chronon/spark/submission/SparkSessionBuilder.scala (1)

SparkSessionBuilder (31-206)

spark/src/test/scala/ai/chronon/spark/test/fetcher/FetcherTestUtil.scala (3)

FetcherTestUtil (50-890)

generateMutationDataWithUniqueTopK (430-565)

compareTemporalFetch (150-290)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (17)

GitHub Check: service_commons_tests
GitHub Check: cloud_aws_tests
GitHub Check: cloud_gcp_tests
GitHub Check: service_tests
GitHub Check: online_tests
GitHub Check: api_tests
GitHub Check: aggregator_tests
GitHub Check: flink_tests
GitHub Check: scala_compile_fmt_fix
GitHub Check: groupby_tests
GitHub Check: streaming_tests
GitHub Check: analyzer_tests
GitHub Check: fetcher_tests
GitHub Check: join_tests
GitHub Check: spark_tests
GitHub Check: batch_tests
GitHub Check: enforce_triggered_workflows

🔇 Additional comments (11)

spark/src/test/scala/ai/chronon/spark/test/analyzer/DerivationBootstrapTest.scala (2)

36-36: LGTM - Class rename improves clarity

The rename from DerivationTest to DerivationBootstrapTest better reflects the test's focus on bootstrap functionality.

42-42: Session name updated correctly

The SparkSession builder name correctly matches the new class name.

spark/src/test/scala/ai/chronon/spark/test/fetcher/FetcherTiledTest.scala (1)

26-45: Well-structured focused test

Clean implementation using utility methods for data generation and comparison. The enableTiling = true parameter correctly targets tiled fetch join scenarios.

spark/src/test/scala/ai/chronon/spark/test/fetcher/FetcherUniqueTopKTest.scala (1)

26-44: Focused UniqueTopK test implementation

Properly uses generateMutationDataWithUniqueTopK for targeted struct testing. The test parameters are appropriate for this aggregation type.

spark/src/test/scala/ai/chronon/spark/test/fetcher/FetcherDeterministicTest.scala (1)

26-44: Clean deterministic test implementation

Uses generateMutationData for consistent test data generation. The deterministic focus helps address flaky test concerns.

spark/src/test/scala/ai/chronon/spark/test/fetcher/FetcherGeneratedTest.scala (1)

26-47: Robust generated data test

Uses generateRandomData with temporal partitioning and enables consistency checks. The random data generation should provide sufficient volume to prevent empty partitions.

spark/src/test/scala/ai/chronon/spark/test/fetcher/FetcherTestUtil.scala (2)

149-290: Well-structured test utility method

Comprehensive coverage of temporal fetch join testing with consistency checks.

576-576: Good fix for data volume issue

100x increase addresses empty partition problem.

spark/src/test/scala/ai/chronon/spark/test/analyzer/DerivationLoggingTest.scala (3)

42-181: Solid test implementation

Good coverage of logging scenarios with proper assertions.

183-307: Excellent edge case coverage

Tests clearly document contextual derivation behavior.

309-344: Clean test with clear expectations

SQL-based validation makes test intent obvious.

fix: flaky tests due to insufficient generated data

af63fbe

nikhil-zlai and others added 8 commits July 15, 2025 20:26

repro sawtooth udf flaky test

16ccca6

configure jar_binary_path on env

0930fb8

sawtooth udf fixes

85a4fde

increase coverage, defensive clone before finalize

f4b3f8a

increase rowcount nohistorical backfill test

062712c

Merge branch 'main' into nikhil/flaky_tests

873db7f

test refactoring for more parallelism

007cf20

formatting

04b42d6

coderabbitai bot reviewed Jul 16, 2025

View reviewed changes

nikhil-zlai added 5 commits July 17, 2025 14:39

flaky fetcher test

747d22c

remove label refs tune tests

ff4465c

eval test tuning

9f114d9

Merge branch 'main' into nikhil/flaky_tests

91e9029

eval test tuning

4d5decc

varant-zlai approved these changes Jul 18, 2025

View reviewed changes

nikhil-zlai merged commit 6298d39 into main Jul 18, 2025
21 checks passed

nikhil-zlai deleted the nikhil/flaky_tests branch July 18, 2025 00:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: flaky tests due to insufficient generated data #970

fix: flaky tests due to insufficient generated data #970

Uh oh!

nikhil-zlai commented Jul 16, 2025 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Jul 16, 2025 •

edited

Loading

Chat

Support

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

CodeRabbit Configuration File (`.coderabbit.yaml`)

Documentation and Community

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fix: flaky tests due to insufficient generated data #970

fix: flaky tests due to insufficient generated data #970

Uh oh!

Conversation

nikhil-zlai commented Jul 16, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Checklist

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Jul 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Possibly related PRs

Suggested reviewers

Poem

Chat

Support

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

CodeRabbit Configuration File (.coderabbit.yaml)

Documentation and Community

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

nikhil-zlai commented Jul 16, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Jul 16, 2025 •

edited

Loading

CodeRabbit Configuration File (`.coderabbit.yaml`)