refactor: prior to introducing planner #586

nikhil-zlai · 2025-04-04T02:24:54Z

Summary

removed some dead code - we are going to introduce planner in a later PR - this PR refactors code around to make the next change easy.

Checklist

Added Unit Tests
Covered by existing CI
Integration tested
Documentation update

Summary by CodeRabbit

New Features
- Introduced a new configuration framework that enables enhanced pipeline asset management and clearer node definitions.
- Added an updated data model enumeration for improved data categorization.
Refactor
- Standardized data model identifiers across the system (switching from lower-case to uppercase constants) to ensure consistency.
- Updated join table naming logic and reorganized internal components for improved clarity and maintainability.
- Streamlined the orchestration subsystem by removing deprecated modules, enhancing overall system performance and reliability.

coderabbitai · 2025-04-04T02:25:03Z

Walkthrough

The pull request removes the deprecated DataModel enumeration and multiple orchestration/repository files while updating naming conventions across the codebase. References to lowercase “Entities” and “Events” are now replaced with uppercase “ENTITIES” and “EVENTS.” In addition, a new configuration-and-node conversion framework is introduced (ConfToNode.scala), package declarations and import statements are restructured, and Thrift definitions are updated with field reordering and type changes.

Changes

File(s)	Change Summary
`api/src/main/scala/ai/chronon/api/DataModel.scala`, `Extensions.scala`, `AvroCodecFn.scala`, `DataStreamBuilder.scala`, `GroupByResponseHandler.scala`, `Analyzer.scala`, `.../spark/…`, `.../online/…`, `.../planners`	Removed deprecated DataModel enum; renamed constants from `Entities`/`Events` to `ENTITIES`/`EVENTS` across various modules and updated pattern matching and method signatures.
`api/src/main/scala/ai/chronon/api/planner/ConfToNode.scala`	Introduced a new configuration/node conversion framework with case classes, traits, and helper methods for asset and node management.
`orchestration/...` (e.g., RepoIndex.scala, RepoNodes.scala, RepoParser.scala, RepoTypes.scala, VersionUpdate.scala, Logical/Physical node impl files)	Removed orchestration and repository management files including logical and physical node implementations and utility classes.
`api/thrift/api.thrift`, `api/thrift/orchestration.thrift`	Added new Thrift `DataModel` enum; reordered struct fields and updated type of fields (e.g. leftDataModel field type change) in orchestration thrift definitions.
`DependencyResolver.scala`, `RelevantLeftForJoinPart.scala`, `DependencyResolverSpec.scala`	Updated package declarations and import statements (moved from `dependency` to `planner`) to reflect codebase restructuring.
`spark/Join.scala`, `JoinUtils.scala`, `JoinBase.scala`, `JoinDerivationJob.scala`, `LabelJoin.scala`, etc.	Updated join logic and table name computation; standardized constant usage and updated method signatures to leverage new naming conventions and helper methods.

Sequence Diagram(s)

sequenceDiagram
    participant C as Client
    participant F as Config Loader
    participant N as Node Builder
    C->>F: Call from(filePath)
    F->>N: Build assets & nodes
    N-->>F: Return node configuration
    F-->>C: Provide Conf instance

sequenceDiagram
    participant U as User
    participant J as Join Handler
    participant JU as JoinUtils
    U->>J: Initiate join request
    J->>JU: Compute left source table name
    JU->>JU: Validate using ENTITIES/(EVENTS)
    JU-->>J: Return full table name
    J-->>U: Complete join process

Possibly related PRs

adding format based read / write support #12: Updated handling of DataModel constants from lowercase to uppercase.
[WIP] Drift metrics #59: Removed the DataModel enum and standardized constant usage across modules.
Refactor spark to create modular jobs for orchestrator #402: Modified DataModel constant references to improve naming consistency.

Poem

In the code’s realm where changes unfurl,
Constants now shimmer in uppercase swirl.
Deprecated files quietly fade away,
As new nodes and configs lead the way.
Code’s evolution—a quirky, joyful play! 🚀

Warning

Review ran into problems

🔥 Problems

GitHub Actions and Pipeline Checks: Resource not accessible by integration - https://docs.github.com/rest/actions/workflow-runs#list-workflow-runs-for-a-repository.

Please grant the required permissions to the CodeRabbit GitHub App under the organization or repository settings.

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Generate unit testing code for this file.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
- @coderabbitai generate unit testing code for this file.
- @coderabbitai modularize this function.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read src/utils.ts and generate unit testing code.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
- @coderabbitai help me debug CodeRabbit configuration file.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai full review to do a full review from scratch and review all the files again.
@coderabbitai summary to regenerate the summary of the PR.
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai plan to trigger planning for file edits and PR creation.
@coderabbitai configuration to show the current CodeRabbit configuration for the repository.
@coderabbitai help to get help.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (`.coderabbit.yaml`)

You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
Please see the configuration documentation for more information.
If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (9)

api/src/main/scala/ai/chronon/api/Extensions.scala (1)

1213-1214: Function signature updated with unused parameter

A joinOutputTable parameter was added but not used in the method.

Consider removing the unused parameter or utilizing it if it's intended for future functionality.

api/src/main/scala/ai/chronon/api/planner/ConfToNode.scala (8)

1-10: Suggest adding brief documentation.
These lines look fine, but docstrings or scaladocs on Asset could clarify usage.

33-35: Implementation missing in Conf.from(...).
This method is unimplemented; fill it or remove if unused.

37-134: Methods are just placeholders.
All node case classes have ???; ensure correct logic is added before production.

220-220: onlineNodes not implemented.
Fill in online node generation or confirm if it's unnecessary.

223-241: Staging and GroupBy conf incomplete.
Both offline/online methods are ???; clarify final usage or fill logic.

244-252: ModelConf incomplete.
Offline and online node methods are ???; implement if required.

255-261: Empty Dependency definitions.
Trait/Object have no members; remove or expand as needed.

270-272: ConfToNodes object is empty.
Confirm its purpose or remove if it’s not needed.

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro (Legacy)

📥 Commits

Reviewing files that changed from the base of the PR and between b566b10 and e808406.

📒 Files selected for processing (53)

api/src/main/scala/ai/chronon/api/DataModel.scala (0 hunks)
api/src/main/scala/ai/chronon/api/Extensions.scala (6 hunks)
api/src/main/scala/ai/chronon/api/planner/ConfToNode.scala (1 hunks)
api/src/main/scala/ai/chronon/api/planner/DependencyResolver.scala (1 hunks)
api/src/main/scala/ai/chronon/api/planner/RelevantLeftForJoinPart.scala (2 hunks)
api/src/test/scala/ai/chronon/api/test/RelevantLeftForJoinPartSpec.scala (1 hunks)
api/thrift/api.thrift (1 hunks)
api/thrift/orchestration.thrift (2 hunks)
flink/src/main/scala/ai/chronon/flink/AvroCodecFn.scala (2 hunks)
online/src/main/scala/ai/chronon/online/DataStreamBuilder.scala (3 hunks)
online/src/main/scala/ai/chronon/online/GroupByServingInfoParsed.scala (1 hunks)
online/src/main/scala/ai/chronon/online/fetcher/GroupByResponseHandler.scala (2 hunks)
online/src/test/scala/ai/chronon/online/test/DataStreamBuilderTest.scala (1 hunks)
orchestration/src/main/scala/ai/chronon/orchestration/RepoIndex.scala (0 hunks)
orchestration/src/main/scala/ai/chronon/orchestration/RepoNodes.scala (0 hunks)
orchestration/src/main/scala/ai/chronon/orchestration/RepoParser.scala (0 hunks)
orchestration/src/main/scala/ai/chronon/orchestration/RepoTypes.scala (0 hunks)
orchestration/src/main/scala/ai/chronon/orchestration/VersionUpdate.scala (0 hunks)
orchestration/src/main/scala/ai/chronon/orchestration/logical/GroupByNodeImpl.scala (0 hunks)
orchestration/src/main/scala/ai/chronon/orchestration/logical/JoinNodeImpl.scala (0 hunks)
orchestration/src/main/scala/ai/chronon/orchestration/logical/LogicalNodeImpl.scala (0 hunks)
orchestration/src/main/scala/ai/chronon/orchestration/logical/ModelNodeImpl.scala (0 hunks)
orchestration/src/main/scala/ai/chronon/orchestration/logical/StagingQueryNodeImpl.scala (0 hunks)
orchestration/src/main/scala/ai/chronon/orchestration/logical/TabularDataNodeImpl.scala (0 hunks)
orchestration/src/main/scala/ai/chronon/orchestration/physical/ComputeNodeRunner.scala (0 hunks)
orchestration/src/main/scala/ai/chronon/orchestration/physical/GroupByBackfill.scala (0 hunks)
orchestration/src/main/scala/ai/chronon/orchestration/physical/GroupByPrepareUpload.scala (0 hunks)
orchestration/src/main/scala/ai/chronon/orchestration/physical/JoinBackfill.scala (0 hunks)
orchestration/src/main/scala/ai/chronon/orchestration/physical/LabelJoin.scala (0 hunks)
orchestration/src/main/scala/ai/chronon/orchestration/physical/StagingQueryNode.scala (0 hunks)
orchestration/src/main/scala/ai/chronon/orchestration/physical/TabularNode.scala (0 hunks)
orchestration/src/main/scala/ai/chronon/orchestration/utils/Config.scala (0 hunks)
orchestration/src/main/scala/ai/chronon/orchestration/utils/SequenceMap.scala (0 hunks)
orchestration/src/test/scala/ai/chronon/orchestration/test/RepoIndexSpec.scala (0 hunks)
orchestration/src/test/scala/ai/chronon/orchestration/test/SequenceMapSpec.scala (0 hunks)
spark/src/main/scala/ai/chronon/spark/Analyzer.scala (2 hunks)
spark/src/main/scala/ai/chronon/spark/BootstrapJob.scala (1 hunks)
spark/src/main/scala/ai/chronon/spark/GroupBy.scala (5 hunks)
spark/src/main/scala/ai/chronon/spark/GroupByUpload.scala (1 hunks)
spark/src/main/scala/ai/chronon/spark/Join.scala (5 hunks)
spark/src/main/scala/ai/chronon/spark/JoinBase.scala (2 hunks)
spark/src/main/scala/ai/chronon/spark/JoinDerivationJob.scala (1 hunks)
spark/src/main/scala/ai/chronon/spark/JoinPartJob.scala (5 hunks)
spark/src/main/scala/ai/chronon/spark/JoinUtils.scala (6 hunks)
spark/src/main/scala/ai/chronon/spark/LabelJoin.scala (4 hunks)
spark/src/main/scala/ai/chronon/spark/MergeJob.scala (4 hunks)
spark/src/main/scala/ai/chronon/spark/SourceJob.scala (3 hunks)
spark/src/main/scala/ai/chronon/spark/stats/CompareJob.scala (2 hunks)
spark/src/main/scala/ai/chronon/spark/streaming/GroupBy.scala (4 hunks)
spark/src/main/scala/ai/chronon/spark/streaming/JoinSourceRunner.scala (1 hunks)
spark/src/test/scala/ai/chronon/spark/test/OnlineUtils.scala (1 hunks)
spark/src/test/scala/ai/chronon/spark/test/join/JoinTest.scala (1 hunks)
spark/src/test/scala/ai/chronon/spark/test/join/ModularJoinTest.scala (5 hunks)

💤 Files with no reviewable changes (23)

orchestration/src/main/scala/ai/chronon/orchestration/physical/LabelJoin.scala
orchestration/src/main/scala/ai/chronon/orchestration/logical/LogicalNodeImpl.scala
orchestration/src/main/scala/ai/chronon/orchestration/logical/GroupByNodeImpl.scala
orchestration/src/main/scala/ai/chronon/orchestration/logical/TabularDataNodeImpl.scala
orchestration/src/main/scala/ai/chronon/orchestration/physical/GroupByBackfill.scala
orchestration/src/main/scala/ai/chronon/orchestration/RepoParser.scala
orchestration/src/test/scala/ai/chronon/orchestration/test/SequenceMapSpec.scala
orchestration/src/main/scala/ai/chronon/orchestration/physical/GroupByPrepareUpload.scala
api/src/main/scala/ai/chronon/api/DataModel.scala
orchestration/src/main/scala/ai/chronon/orchestration/physical/ComputeNodeRunner.scala
orchestration/src/main/scala/ai/chronon/orchestration/RepoNodes.scala
orchestration/src/main/scala/ai/chronon/orchestration/physical/TabularNode.scala
orchestration/src/main/scala/ai/chronon/orchestration/logical/ModelNodeImpl.scala
orchestration/src/main/scala/ai/chronon/orchestration/physical/StagingQueryNode.scala
orchestration/src/main/scala/ai/chronon/orchestration/logical/JoinNodeImpl.scala
orchestration/src/main/scala/ai/chronon/orchestration/physical/JoinBackfill.scala
orchestration/src/main/scala/ai/chronon/orchestration/utils/Config.scala
orchestration/src/main/scala/ai/chronon/orchestration/logical/StagingQueryNodeImpl.scala
orchestration/src/main/scala/ai/chronon/orchestration/RepoIndex.scala
orchestration/src/test/scala/ai/chronon/orchestration/test/RepoIndexSpec.scala
orchestration/src/main/scala/ai/chronon/orchestration/utils/SequenceMap.scala
orchestration/src/main/scala/ai/chronon/orchestration/VersionUpdate.scala
orchestration/src/main/scala/ai/chronon/orchestration/RepoTypes.scala

🧰 Additional context used

🧬 Code Definitions (22)

spark/src/main/scala/ai/chronon/spark/streaming/JoinSourceRunner.scala (2)

online/src/main/scala/ai/chronon/online/GroupByServingInfoParsed.scala (1)

mutationValueChrononSchema (122-124)

api/src/main/scala/ai/chronon/api/Extensions.scala (2)

dataModel (320-325)

dataModel (456-463)

api/src/test/scala/ai/chronon/api/test/RelevantLeftForJoinPartSpec.scala (1)

api/src/main/scala/ai/chronon/api/planner/RelevantLeftForJoinPart.scala (2)

RelevantLeftForJoinPart (15-23)

RelevantLeftForJoinPart (25-91)

online/src/test/scala/ai/chronon/online/test/DataStreamBuilderTest.scala (1)

online/src/main/scala/ai/chronon/online/DataStreamBuilder.scala (2)

DataStream (54-106)

apply (58-105)

spark/src/main/scala/ai/chronon/spark/BootstrapJob.scala (1)

spark/src/main/scala/ai/chronon/spark/JoinUtils.scala (2)

JoinUtils (41-628)

computeFullLeftSourceTableName (625-627)

spark/src/test/scala/ai/chronon/spark/test/join/JoinTest.scala (1)

api/src/main/scala/ai/chronon/api/planner/RelevantLeftForJoinPart.scala (2)

RelevantLeftForJoinPart (15-23)

RelevantLeftForJoinPart (25-91)

api/src/main/scala/ai/chronon/api/planner/DependencyResolver.scala (1)

api/src/main/scala/ai/chronon/api/Extensions.scala (3)

Extensions (43-1252)

SourceOps (319-432)

WindowUtils (88-116)

online/src/main/scala/ai/chronon/online/DataStreamBuilder.scala (3)

api/src/main/scala/ai/chronon/api/DataType.scala (1)

apply (178-178)

api/src/main/scala/ai/chronon/api/Builders.scala (9)

apply (30-32)

apply (42-65)

apply (69-84)

apply (88-103)

apply (143-167)

apply (171-198)

apply (202-208)

apply (212-217)

apply (221-232)

api/src/main/scala/ai/chronon/api/ParametricMacro.scala (1)

api (66-66)

spark/src/main/scala/ai/chronon/spark/streaming/GroupBy.scala (3)

api/src/main/scala/ai/chronon/api/Extensions.scala (4)

dataModel (320-325)

dataModel (456-463)

keys (535-545)

keys (808-821)

spark/src/main/scala/ai/chronon/spark/stats/drift/Summarizer.scala (1)

keys (103-114)

online/src/main/scala/ai/chronon/online/GroupByServingInfoParsed.scala (1)

mutationValueChrononSchema (122-124)

online/src/main/scala/ai/chronon/online/fetcher/GroupByResponseHandler.scala (2)

api/src/main/scala/ai/chronon/api/Extensions.scala (2)

dataModel (320-325)

dataModel (456-463)

online/src/main/scala/ai/chronon/online/GroupByServingInfoParsed.scala (2)

valueAvroCodec (83-83)

mutationValueAvroCodec (126-126)

api/src/main/scala/ai/chronon/api/Extensions.scala (1)

api/src/main/scala/ai/chronon/api/Constants.scala (1)

Constants (23-100)

spark/src/main/scala/ai/chronon/spark/stats/CompareJob.scala (1)

api/src/main/scala/ai/chronon/api/Extensions.scala (2)

dataModel (320-325)

dataModel (456-463)

spark/src/main/scala/ai/chronon/spark/GroupByUpload.scala (1)

spark/src/main/scala/ai/chronon/spark/GroupBy.scala (3)

snapshotEvents (183-193)

snapshotEntities (153-160)

temporalEvents (291-369)

spark/src/main/scala/ai/chronon/spark/GroupBy.scala (2)

api/src/main/scala/ai/chronon/api/Extensions.scala (2)

dataModel (320-325)

dataModel (456-463)

spark/src/main/scala/ai/chronon/spark/GroupByUpload.scala (2)

snapshotEntities (65-80)

snapshotEvents (82-86)

api/src/main/scala/ai/chronon/api/planner/RelevantLeftForJoinPart.scala (1)

api/src/main/scala/ai/chronon/api/Extensions.scala (3)

Extensions (43-1252)

SourceOps (319-432)

StringOps (656-666)

spark/src/main/scala/ai/chronon/spark/JoinDerivationJob.scala (1)

spark/src/main/scala/ai/chronon/spark/JoinUtils.scala (2)

JoinUtils (41-628)

computeFullLeftSourceTableName (625-627)

spark/src/main/scala/ai/chronon/spark/JoinPartJob.scala (4)

spark/src/main/scala/ai/chronon/spark/JoinUtils.scala (3)

JoinUtils (41-628)

genBloomFilterIfNeeded (337-374)

shiftDays (569-594)

api/src/main/scala/ai/chronon/api/Extensions.scala (3)

dataModel (320-325)

dataModel (456-463)

toPartitionRange (1246-1250)

spark/src/main/scala/ai/chronon/spark/GroupBy.scala (2)

snapshotEvents (183-193)

snapshotEntities (153-160)

api/src/main/scala/ai/chronon/api/DataRange.scala (1)

toPartitionRange (30-32)

spark/src/main/scala/ai/chronon/spark/Analyzer.scala (1)

api/src/main/scala/ai/chronon/api/PartitionSpec.scala (2)

minus (50-50)

minus (54-57)

spark/src/main/scala/ai/chronon/spark/LabelJoin.scala (3)

api/src/main/scala/ai/chronon/api/Extensions.scala (2)

dataModel (320-325)

dataModel (456-463)

spark/src/main/scala/ai/chronon/spark/GroupBy.scala (1)

snapshotEntities (153-160)

spark/src/main/scala/ai/chronon/spark/GroupByUpload.scala (1)

snapshotEntities (65-80)

spark/src/test/scala/ai/chronon/spark/test/join/ModularJoinTest.scala (2)

spark/src/main/scala/ai/chronon/spark/JoinUtils.scala (2)

JoinUtils (41-628)

computeFullLeftSourceTableName (625-627)

api/src/main/scala/ai/chronon/api/planner/RelevantLeftForJoinPart.scala (4)

RelevantLeftForJoinPart (15-23)

RelevantLeftForJoinPart (25-91)

partTableName (35-53)

fullPartTableName (55-64)

spark/src/main/scala/ai/chronon/spark/MergeJob.scala (2)

api/src/main/scala/ai/chronon/api/planner/RelevantLeftForJoinPart.scala (2)

RelevantLeftForJoinPart (15-23)

RelevantLeftForJoinPart (25-91)

api/src/main/scala/ai/chronon/api/Extensions.scala (2)

dataModel (320-325)

dataModel (456-463)

spark/src/main/scala/ai/chronon/spark/Join.scala (2)

api/src/main/scala/ai/chronon/api/planner/RelevantLeftForJoinPart.scala (3)

RelevantLeftForJoinPart (15-23)

RelevantLeftForJoinPart (25-91)

partTableName (35-53)

spark/src/main/scala/ai/chronon/spark/JoinUtils.scala (2)

JoinUtils (41-628)

computeFullLeftSourceTableName (625-627)

api/src/main/scala/ai/chronon/api/planner/ConfToNode.scala (5)

api/src/main/scala/ai/chronon/api/Extensions.scala (6)

Extensions (43-1252)

MetadataOps (118-165)

SourceOps (319-432)

table (374-374)

from (252-293)

outputTable (121-121)

spark/src/main/scala/ai/chronon/spark/GroupBy.scala (2)

spark (393-397)

from (484-578)

spark/src/main/scala/ai/chronon/spark/JoinUtils.scala (2)

JoinUtils (41-628)

computeLeftSourceTableName (606-619)

spark/src/main/scala/ai/chronon/spark/GroupByUpload.scala (2)

GroupByUpload (56-124)

GroupByUpload (126-274)

api/src/main/scala/ai/chronon/api/planner/RelevantLeftForJoinPart.scala (3)

RelevantLeftForJoinPart (15-23)

RelevantLeftForJoinPart (25-91)

partTableName (35-53)

🔇 Additional comments (89)

online/src/main/scala/ai/chronon/online/fetcher/GroupByResponseHandler.scala (2)

85-85: Enum constant renaming looks good.

Updated to use DataModel.ENTITIES instead of DataModel.Entities for consistency.

135-137: Pattern matching updated correctly.

Changed enum values to uppercase constants (EVENTS, ENTITIES) to follow standard naming conventions.

api/src/main/scala/ai/chronon/api/planner/DependencyResolver.scala (2)

1-1: Package ref updated.
New package correctly reflects the module reorganization.

6-6: Import location revised.
convertUnits is now imported from its updated location.

spark/src/main/scala/ai/chronon/spark/Analyzer.scala (4)

21-22: Updated imports for new DataModel naming convention.

Correct update to imports using the new DataModel case objects ENTITIES and EVENTS.

26-26: Updated import path.

Import path correctly updated.

437-439: Updated pattern matching for DataModel constants.

Pattern matching correctly updated to use ENTITIES instead of the deprecated Entities constant.

440-444: Updated EVENTS pattern matching cases.

Pattern matching cases properly updated to use the new EVENTS constant naming convention.

spark/src/main/scala/ai/chronon/spark/MergeJob.scala (4)

15-15: Import change to use ENTITIES constant.

Updated import statement to use ENTITIES constant instead of Entities enum.

26-26: Import path adjustment.

Import path for RelevantLeftForJoinPart properly specified.

91-91: Updated condition to use ENTITIES constant.

Condition correctly updated to maintain the same logic while using the new constant name.

108-108: Updated condition to use ENTITIES constant.

Condition in joinWithLeft method properly updated to use the new ENTITIES constant.

spark/src/test/scala/ai/chronon/spark/test/join/ModularJoinTest.scala (6)

6-6: Import updated for package restructuring.

Import now specifies planner package which aligns with organizational changes.

201-201: Method name now reflects its full functionality.

Changed to computeFullLeftSourceTableName which accurately describes that it includes namespace information.

280-280: Updated to use planner namespace.

Correctly qualified RelevantLeftForJoinPart methods with planner prefix to match package restructuring.

Also applies to: 282-282

295-295: Simplified dataModel parameter usage.

Removed unnecessary toString() call since setLeftDataModel now accepts dataModel directly.

305-306: Updated to use planner namespace.

Correctly qualified RelevantLeftForJoinPart methods with planner prefix to match package restructuring.

314-314: Simplified dataModel parameter usage.

Removed unnecessary toString() call since setLeftDataModel now accepts dataModel directly.

spark/src/main/scala/ai/chronon/spark/streaming/JoinSourceRunner.scala (4)

76-79: Updated DataModel enumeration values to uppercase

Changed pattern matching from Events/Entities to EVENTS/ENTITIES, maintaining consistency with the broader codebase changes.

80-83: Consistent DataModel enumeration usage

Modified case pattern matching to use uppercase constants (ENTITIES/EVENTS), aligning with similar changes throughout the codebase.

76-79: Updated pattern matching to use new uppercase data model constants.

Constants renamed from lowercase to uppercase for consistency.

80-83: Updated pattern matching to use new uppercase data model constants.

Constants renamed from lowercase to uppercase for consistency.

online/src/main/scala/ai/chronon/online/GroupByServingInfoParsed.scala (2)

137-140: Updated DataModel enumeration values

Changed lowercase Events/Entities to uppercase EVENTS/ENTITIES in pattern matching, maintaining consistency with the broader enum renaming effort.

138-140: Updated pattern matching to use new uppercase data model constants.

Constants renamed from lowercase to uppercase for consistency.

spark/src/test/scala/ai/chronon/spark/test/join/JoinTest.scala (2)

33-33: Updated import path for RelevantLeftForJoinPart

Import path now correctly reflects the class's new location in the ai.chronon.api.planner package.

33-33: Updated import to reflect new package structure.

RelevantLeftForJoinPart moved from ai.chronon.api to ai.chronon.api.planner.

flink/src/main/scala/ai/chronon/flink/AvroCodecFn.scala (2)

59-59: Updated to consistent naming convention.

Changed from DataModel.Events to DataModel.EVENTS to match the standardized uppercase enum values.

75-76: Updated to consistent naming convention.

Changed from DataModel.Events to DataModel.EVENTS to match the standardized uppercase enum values.

spark/src/main/scala/ai/chronon/spark/SourceJob.scala (3)

3-3: Updated import for new enum value.

Direct import of the specific enum value EVENTS instead of the whole DataModel class.

29-33: Updated to consistent naming convention.

Changed from source.dataModel == Events to source.dataModel == EVENTS.

56-60: Updated to consistent naming convention.

Changed from source.dataModel == Events to source.dataModel == EVENTS.

spark/src/main/scala/ai/chronon/spark/GroupByUpload.scala (1)

236-240: Updated to consistent naming convention.

All pattern matching cases updated to use uppercase enum values (EVENTS, ENTITIES).

spark/src/test/scala/ai/chronon/spark/test/OnlineUtils.scala (2)

56-62: Updated to consistent naming convention.

Changed from DataModel.Entities to DataModel.ENTITIES.

63-66: Updated to consistent naming convention.

Changed from DataModel.Events to DataModel.EVENTS.

spark/src/main/scala/ai/chronon/spark/LabelJoin.scala (5)

21-22: Updated import statements to use new DataModel constants

Import statements correctly updated from Entities/Events to ENTITIES/EVENTS to align with API changes.

62-63: Assert correctly updated to use EVENTS constant

Assertion updated to use the new EVENTS constant.

70-71: Updated assertion to use EVENTS constant

Consistent update of assertion condition.

86-87: Updated assertion for ENTITIES constant

Maintaining consistent usage of new constant names.

245-249: Updated pattern matching cases

Pattern matching cases correctly modified to use EVENTS and ENTITIES constants.

spark/src/main/scala/ai/chronon/spark/streaming/GroupBy.scala (5)

55-62: Updated pattern matching for dataModel

Pattern matching correctly updated to use the new ENTITIES and EVENTS constants.

72-75: Updated dataModel case matching

Case matching correctly modified to use the new constant names.

147-148: Updated conditional check for ENTITIES

Conditional check correctly updated to use the new constant name.

153-155: Updated pattern matching in value columns definition

Pattern matching correctly updated to use the new constant names.

164-166: Updated valueZSchema pattern matching

Pattern matching for valueZSchema correctly updated to use new constants.

spark/src/main/scala/ai/chronon/spark/stats/CompareJob.scala (2)

21-22: Updated import statement

Import statement correctly updated to use new EVENTS constant.

177-178: Updated conditional check in getJoinKeys

Conditional check correctly updated to use the new EVENTS constant.

spark/src/main/scala/ai/chronon/spark/JoinUtils.scala (7)

20-22: Updated import statements

Import statements correctly updated to use the new EVENTS constant.

77-81: Updated conditional check for dataModel

Conditional check correctly updated to use the new EVENTS constant.

107-111: Updated conditional in leftDfFromSource

Conditional check correctly updated to use new constant name.

575-579: Updated shiftDays conditional check

Conditional correctly updated to use the new EVENTS constant.

587-593: Updated rightRange calculation

Conditional check for calculating rightRange correctly updated.

606-619: Refactored computeLeftSourceTableName

Modified to return only the table name without namespace.

621-627: Added computeFullLeftSourceTableName method

New method correctly combines namespace with table name for complete reference.

api/thrift/api.thrift (1)

497-501: Added DataModel enum in Thrift

New enum with uppercase constants aligns with standardized naming convention across codebase.

spark/src/main/scala/ai/chronon/spark/JoinDerivationJob.scala (1)

26-26: Updated method to use full table name

Changed from computeLeftSourceTableName to computeFullLeftSourceTableName to use qualified table name with namespace.

api/src/test/scala/ai/chronon/api/test/RelevantLeftForJoinPartSpec.scala (1)

5-5: Updated import path

Reflects RelevantLeftForJoinPart's move to api.planner package.

online/src/test/scala/ai/chronon/online/test/DataStreamBuilderTest.scala (1)

60-60: Updated to use uppercase DataModel constant

Changed from DataModel.Events to DataModel.EVENTS to match new enum naming.

spark/src/main/scala/ai/chronon/spark/JoinBase.scala (2)

22-22: Updated import to use new constant name

Import updated from Entities to ENTITIES to align with standardized constant naming.

77-77: Updated condition check to use the new constant

Updated condition to use ENTITIES constant instead of Entities for consistency.

spark/src/main/scala/ai/chronon/spark/BootstrapJob.scala (1)

32-32: Enhanced table name construction logic

Changed from computeLeftSourceTableName to computeFullLeftSourceTableName to include namespace in the table name.

api/src/main/scala/ai/chronon/api/planner/RelevantLeftForJoinPart.scala (3)

1-1: Moved to appropriate package

Relocated class to ai.chronon.api.planner package for better organization.

4-6: Updated imports

Removed specific extension imports and added wildcard import for base package.

79-79: Updated to new constant naming

Updated case match to use EVENTS instead of Events for consistency.

online/src/main/scala/ai/chronon/online/DataStreamBuilder.scala (3)

58-58: Updated default parameter to new constant

Changed default value from DataModel.Events to DataModel.EVENTS.

73-77: Updated pattern matching to use new constants

Updated data model constants in pattern matching to align with new naming convention.

88-91: Updated conditional data model check

Modified pattern matching to use the new uppercase constants consistently.

api/src/main/scala/ai/chronon/api/Extensions.scala (1)

322-323: LGTM: Data model constant renaming

Constants renamed from Entities/Events to uppercase format.

spark/src/main/scala/ai/chronon/spark/GroupBy.scala (3)

34-35: LGTM: Imports updated to use constant names

Updated imports to reference renamed constants.

435-436: LGTM: Updated DataModel reference

Updated reference to use new constant name.

739-742: LGTM: Updated pattern matching with new constant names

Pattern match now uses imported uppercase constants.

spark/src/main/scala/ai/chronon/spark/JoinPartJob.scala (3)

5-6: LGTM: Updated imports for DataModel

Imports modified to reference new constant names.

62-62: LGTM: Using node.leftDataModel directly

Replaced local variable with direct property access.

219-232: LGTM: Improved pattern matching

Updated triple-tuple pattern match with direct property access and new constant names.

spark/src/main/scala/ai/chronon/spark/Join.scala (6)

20-20: Updated import to use new DataModel constant name.

The import has been correctly updated from Entities to ENTITIES, matching the standardized naming convention in the codebase.

23-23: Explicitly importing planner package.

Import now specifically references the planner package rather than relying on wildcard import, improving clarity.

206-206: Updated data model comparison to use ENTITIES constant.

Condition properly updated to check against the new uppercase constant name.

338-338: Updated RelevantLeftForJoinPart reference to use planner package.

Correctly updated to reference the class from its new location in the planner package.

361-361: Changed to use computeFullLeftSourceTableName.

Now using method that includes namespace in the table name, ensuring proper table qualification.

378-378: Updated leftDataModel setter.

Direct assignment of data model without conversion, matching the updated type in JoinPartNode.

api/thrift/orchestration.thrift (7)

179-183: Repositioned metaData field to first position.

Standardized struct field ordering with metaData as first field.

186-188: Repositioned metaData field to first position.

Standardized struct field ordering with metaData as first field.

191-193: Repositioned metaData field to first position.

Standardized struct field ordering with metaData as first field.

196-198: Repositioned metaData field to first position.

Standardized struct field ordering with metaData as first field.

201-206: Repositioned metaData field and updated leftDataModel type.

Standardized struct field ordering and changed leftDataModel from string to api.DataModel enum type.

209-211: Repositioned metaData field to first position.

Standardized struct field ordering with metaData as first field.

219-219: Added LabelPartNode to NodeUnion.

Added LabelPartNode as the 6th member of the NodeUnion, allowing the use of this node type.

api/src/main/scala/ai/chronon/api/planner/ConfToNode.scala (3)

11-22: Data classes look good.
They are concise and well-structured for holding table/asset info.

24-31: Interface clarity is fine.
Conf clearly defines required fields for any implementing configuration.

135-139: JoinConf structure is clear.
Constructor and basic overrides for json and metaData are straightforward.

coderabbitai · 2025-04-04T02:33:56Z

api/src/main/scala/ai/chronon/api/planner/ConfToNode.scala

+    override def offlineNodes: Seq[Node] = {
+
+      def metaDataCopy(): MetaData = join.metaData.deepCopy()
+      def joinCopy(): Join = join.deepCopy()
+
+      require(join.left != null, s"Left side of join ${join.metaData.getName} is null.")
+
+      val leftSourceTableName = JoinUtils.computeLeftSourceTableName(join)
+      val sourceMetadata = metaDataCopy().setName(leftSourceTableName)
+      val sourceNode = SourceWithFilter(new SourceWithFilterNode()
+            .setSource(join.left)
+            .setMetaData(sourceMetadata))
+
+      require(join.metaData != null, s"Join metaData is null for join ${join.metaData.getName}.")
+
+      val bootstrapNodeOpt = Option(join.bootstrapParts)
+        .filterNot(_.isEmpty)
+        .map{ parts =>
+            val result = new JoinBootstrapNode()
+            result.setJoin(join)
+
+            result.setMetaData(metaDataCopy().setName(join.metaData.getName + ".bootstrap"))
+            JoinBootstrap(result)
+          }
+
+
+      val joinPartNodeOpt = for(
+        parts <- Option(join.joinParts).toSeq;
+        part <- parts.toScala
+      ) yield {
+          val partOutputTable = RelevantLeftForJoinPart.partTableName(join, part)
+          val result = new JoinPartNode()
+          result.setLeftSourceTable(sourceMetadata.outputTable)
+          result.setJoinPart(part)
+          val joinNodes = for(
+            source <- part.groupBy.sources.toScala if source.isSetJoinSource;
+            joinSource = source.getJoinSource
+            effectiveSource =
+          ) yield {
+            val
+            JoinConf(joinSource.getJoin).offlineNodes
+          }
+          result.setLeftDataModel(join.left.dataModel)
+          result.setMetaData(metaDataCopy().setName(partOutputTable))
+          JoinPart(result)
+      }
+
+      val mergeNode = {
+        val result = new JoinMergeNode()
+        val joinCopyObj = joinCopy()
+        joinCopyObj.unsetDerivations()
+        joinCopyObj.unsetLabelParts() // remove join parts to avoid double counting
+        result.setJoin(joinCopyObj)
+        result.setMetaData(metaDataCopy().setName(join.metaData.getName + ".merge"))
+        JoinMerge(result)
+      }
+
+      val derivationNode = {
+        val result = new JoinDerivationNode()
+        result.setJoin(join)
+        result.setMetaData(metaDataCopy().setName(join.metaData.getName + ".derivations"))
+        JoinDerivation(result)
+      }
+
+      val labelPartNodeOpt = Option(join.labelParts).map { labelParts =>
+
+        val result = new LabelPartNode()
+        result.setJoin(join)
+        result.setMetaData(metaDataCopy().setName(join.metaData.getName + ".label_parts"))
+        LabelPart(result)
+      }
+
+      (bootstrapNodeOpt ++
+        Seq(sourceNode) ++
+        joinPartNodeOpt ++
+        Seq(mergeNode) ++
+        Seq(derivationNode) ++
+        labelPartNodeOpt).toSeq
+    }


⚠️ Potential issue

Incomplete logic & potential syntax error.
The offline node building is comprehensive but lines 178-181 appear broken. Ensure valid code and properly handle derived offline nodes.

- val - JoinConf(joinSource.getJoin).offlineNodes + // Example fix: gather nodes or remove incomplete snippet + val subNodes = JoinConf(joinSource.getJoin).offlineNodes

Committable suggestion skipped: line range outside the PR's diff.

coderabbitai

Actionable comments posted: 0

♻️ Duplicate comments (1)

api/src/main/scala/ai/chronon/api/planner/ConfToNode.scala (1)

175-179: Incomplete code snippet
Similar to a previous review comment: lines are commented out and appear broken. Either remove or complete them.

🧹 Nitpick comments (4)

api/src/main/scala/ai/chronon/api/planner/ConfToNode.scala (4)

29-31: Implement Conf.from.
Currently returns ???; provide a concrete implementation for loading configs from filePath.

41-129: Fill in the ??? methods.
All node subclasses return ???; finalize or provide stubs to avoid runtime errors.

218-218: Implement onlineNodes logic.
It's unimplemented; define how online nodes are built or return an empty sequence if not applicable.

250-251: Add usage or remove trait Dependency.
It is currently empty; clarify its role or remove it if unnecessary.

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro (Legacy)

📥 Commits

Reviewing files that changed from the base of the PR and between e808406 and bc59828.

📒 Files selected for processing (30)

api/src/main/scala/ai/chronon/api/DataModel.scala (0 hunks)
api/src/main/scala/ai/chronon/api/Extensions.scala (6 hunks)
api/src/main/scala/ai/chronon/api/planner/ConfToNode.scala (1 hunks)
api/src/main/scala/ai/chronon/api/planner/DependencyResolver.scala (1 hunks)
api/src/main/scala/ai/chronon/api/planner/RelevantLeftForJoinPart.scala (3 hunks)
api/src/test/scala/ai/chronon/api/test/DependencyResolverSpec.scala (1 hunks)
api/src/test/scala/ai/chronon/api/test/RelevantLeftForJoinPartSpec.scala (1 hunks)
api/thrift/api.thrift (1 hunks)
api/thrift/orchestration.thrift (2 hunks)
flink/src/main/scala/ai/chronon/flink/AvroCodecFn.scala (2 hunks)
online/src/main/scala/ai/chronon/online/DataStreamBuilder.scala (3 hunks)
online/src/main/scala/ai/chronon/online/GroupByServingInfoParsed.scala (1 hunks)
online/src/main/scala/ai/chronon/online/fetcher/GroupByResponseHandler.scala (2 hunks)
online/src/test/scala/ai/chronon/online/test/DataStreamBuilderTest.scala (1 hunks)
orchestration/src/main/scala/ai/chronon/orchestration/RepoIndex.scala (0 hunks)
orchestration/src/main/scala/ai/chronon/orchestration/RepoNodes.scala (0 hunks)
orchestration/src/main/scala/ai/chronon/orchestration/RepoParser.scala (0 hunks)
orchestration/src/main/scala/ai/chronon/orchestration/RepoTypes.scala (0 hunks)
orchestration/src/main/scala/ai/chronon/orchestration/VersionUpdate.scala (0 hunks)
orchestration/src/main/scala/ai/chronon/orchestration/logical/GroupByNodeImpl.scala (0 hunks)
orchestration/src/main/scala/ai/chronon/orchestration/logical/JoinNodeImpl.scala (0 hunks)
orchestration/src/main/scala/ai/chronon/orchestration/logical/LogicalNodeImpl.scala (0 hunks)
orchestration/src/main/scala/ai/chronon/orchestration/logical/ModelNodeImpl.scala (0 hunks)
orchestration/src/main/scala/ai/chronon/orchestration/logical/StagingQueryNodeImpl.scala (0 hunks)
orchestration/src/main/scala/ai/chronon/orchestration/logical/TabularDataNodeImpl.scala (0 hunks)
orchestration/src/main/scala/ai/chronon/orchestration/physical/ComputeNodeRunner.scala (0 hunks)
orchestration/src/main/scala/ai/chronon/orchestration/physical/GroupByBackfill.scala (0 hunks)
orchestration/src/main/scala/ai/chronon/orchestration/physical/GroupByPrepareUpload.scala (0 hunks)
orchestration/src/main/scala/ai/chronon/orchestration/physical/JoinBackfill.scala (0 hunks)
orchestration/src/main/scala/ai/chronon/orchestration/physical/LabelJoin.scala (0 hunks)

💤 Files with no reviewable changes (17)

orchestration/src/main/scala/ai/chronon/orchestration/physical/GroupByPrepareUpload.scala
orchestration/src/main/scala/ai/chronon/orchestration/RepoNodes.scala
orchestration/src/main/scala/ai/chronon/orchestration/logical/LogicalNodeImpl.scala
orchestration/src/main/scala/ai/chronon/orchestration/physical/LabelJoin.scala
orchestration/src/main/scala/ai/chronon/orchestration/physical/GroupByBackfill.scala
orchestration/src/main/scala/ai/chronon/orchestration/logical/JoinNodeImpl.scala
orchestration/src/main/scala/ai/chronon/orchestration/physical/ComputeNodeRunner.scala
orchestration/src/main/scala/ai/chronon/orchestration/logical/ModelNodeImpl.scala
orchestration/src/main/scala/ai/chronon/orchestration/logical/StagingQueryNodeImpl.scala
orchestration/src/main/scala/ai/chronon/orchestration/RepoIndex.scala
orchestration/src/main/scala/ai/chronon/orchestration/physical/JoinBackfill.scala
orchestration/src/main/scala/ai/chronon/orchestration/logical/TabularDataNodeImpl.scala
orchestration/src/main/scala/ai/chronon/orchestration/RepoParser.scala
api/src/main/scala/ai/chronon/api/DataModel.scala
orchestration/src/main/scala/ai/chronon/orchestration/logical/GroupByNodeImpl.scala
orchestration/src/main/scala/ai/chronon/orchestration/VersionUpdate.scala
orchestration/src/main/scala/ai/chronon/orchestration/RepoTypes.scala

✅ Files skipped from review due to trivial changes (1)

api/src/test/scala/ai/chronon/api/test/DependencyResolverSpec.scala

🚧 Files skipped from review as they are similar to previous changes (10)

online/src/main/scala/ai/chronon/online/GroupByServingInfoParsed.scala
api/thrift/api.thrift
flink/src/main/scala/ai/chronon/flink/AvroCodecFn.scala
online/src/main/scala/ai/chronon/online/fetcher/GroupByResponseHandler.scala
online/src/test/scala/ai/chronon/online/test/DataStreamBuilderTest.scala
api/src/main/scala/ai/chronon/api/Extensions.scala
api/src/main/scala/ai/chronon/api/planner/RelevantLeftForJoinPart.scala
online/src/main/scala/ai/chronon/online/DataStreamBuilder.scala
api/src/test/scala/ai/chronon/api/test/RelevantLeftForJoinPartSpec.scala
api/thrift/orchestration.thrift

🧰 Additional context used

🧬 Code Definitions (2)

api/src/main/scala/ai/chronon/api/planner/DependencyResolver.scala (1)

api/src/main/scala/ai/chronon/api/Extensions.scala (4)

Extensions (40-1262)

SourceOps (331-444)

WindowUtils (85-126)

convertUnits (116-123)

api/src/main/scala/ai/chronon/api/planner/ConfToNode.scala (4)

api/src/main/scala/ai/chronon/api/Extensions.scala (8)

Extensions (40-1262)

MetadataOps (128-177)

SourceOps (331-444)

table (386-386)

from (264-305)

outputTable (131-131)

dataModel (332-337)

dataModel (468-475)

api/src/main/scala/ai/chronon/api/ThriftJsonCodec.scala (2)

ThriftJsonCodec (41-137)

toPrettyJsonStr (56-60)

api/src/main/scala/ai/chronon/api/planner/RelevantLeftForJoinPart.scala (2)

RelevantLeftForJoinPart (16-24)

RelevantLeftForJoinPart (26-92)

api/src/main/scala/ai/chronon/api/Builders.scala (1)

joinSource (132-139)

⏰ Context from checks skipped due to timeout of 90000ms (18)

GitHub Check: streaming_tests
GitHub Check: spark_tests
GitHub Check: streaming_tests
GitHub Check: join_tests
GitHub Check: spark_tests
GitHub Check: join_tests
GitHub Check: groupby_tests
GitHub Check: batch_tests
GitHub Check: fetcher_tests
GitHub Check: fetcher_tests
GitHub Check: batch_tests
GitHub Check: non_spark_tests
GitHub Check: groupby_tests
GitHub Check: analyzer_tests
GitHub Check: non_spark_tests
GitHub Check: analyzer_tests
GitHub Check: scala_compile_fmt_fix
GitHub Check: enforce_triggered_workflows

🔇 Additional comments (3)

api/src/main/scala/ai/chronon/api/planner/DependencyResolver.scala (2)

1-1: Package moved to planner namespace

Package structure rearrangement. Matches PR's structural changes.

5-5: Explicit import of convertUnits

More specific import improves clarity. Only imports what's needed.

api/src/main/scala/ai/chronon/api/planner/ConfToNode.scala (1)

20-27: Refine the Conf trait's contract.
These methods are placeholders; ensure documentation or usage examples clarify expected inputs/outputs.

## Summary - #586 did a refactor on the naming of internal tables - we need to update the Driver code to account for this. ## Checklist - [ ] Added Unit Tests - [ ] Covered by existing CI - [ ] Integration tested - [ ] Documentation update  ## Summary by CodeRabbit - **Bug Fixes** - Improved handling of table name parsing by changing the delimiter from a dot to a double underscore when extracting namespace and table name. - Updated the method for computing source output table names to enhance accuracy in table identification.   Co-authored-by: Thomas Chow <[email protected]>

## Summary removed some dead code - we are going to introduce planner in a later PR - this PR refactors code around to make the next change easy. ## Checklist - [ ] Added Unit Tests - [x] Covered by existing CI - [ ] Integration tested - [ ] Documentation update  ## Summary by CodeRabbit - **New Features** - Introduced a new configuration framework that enables enhanced pipeline asset management and clearer node definitions. - Added an updated data model enumeration for improved data categorization. - **Refactor** - Standardized data model identifiers across the system (switching from lower-case to uppercase constants) to ensure consistency. - Updated join table naming logic and reorganized internal components for improved clarity and maintainability. - Streamlined the orchestration subsystem by removing deprecated modules, enhancing overall system performance and reliability.

## Summary - #586 did a refactor on the naming of internal tables - we need to update the Driver code to account for this. ## Checklist - [ ] Added Unit Tests - [ ] Covered by existing CI - [ ] Integration tested - [ ] Documentation update  ## Summary by CodeRabbit - **Bug Fixes** - Improved handling of table name parsing by changing the delimiter from a dot to a double underscore when extracting namespace and table name. - Updated the method for computing source output table names to enhance accuracy in table identification.   Co-authored-by: Thomas Chow <[email protected]>

## Summary removed some dead code - we are going to introduce planner in a later PR - this PR refactors code around to make the next change easy. ## Checklist - [ ] Added Unit Tests - [x] Covered by existing CI - [ ] Integration tested - [ ] Documentation update  ## Summary by CodeRabbit - **New Features** - Introduced a new configuration framework that enables enhanced pipeline asset management and clearer node definitions. - Added an updated data model enumeration for improved data categorization. - **Refactor** - Standardized data model identifiers across the system (switching from lower-case to uppercase constants) to ensure consistency. - Updated join table naming logic and reorganized internal components for improved clarity and maintainability. - Streamlined the orchestration subsystem by removing deprecated modules, enhancing overall system performance and reliability.

## Summary - #586 did a refactor on the naming of internal tables - we need to update the Driver code to account for this. ## Checklist - [ ] Added Unit Tests - [ ] Covered by existing CI - [ ] Integration tested - [ ] Documentation update  ## Summary by CodeRabbit - **Bug Fixes** - Improved handling of table name parsing by changing the delimiter from a dot to a double underscore when extracting namespace and table name. - Updated the method for computing source output table names to enhance accuracy in table identification.   Co-authored-by: Thomas Chow <[email protected]>

## Summary removed some dead code - we are going to introduce planner in a later PR - this PR refactors code around to make the next change easy. ## Checklist - [ ] Added Unit Tests - [x] Covered by existing CI - [ ] Integration tested - [ ] Documentation update  ## Summary by CodeRabbit - **New Features** - Introduced a new configuration framework that enables enhanced pipeline asset management and clearer node definitions. - Added an updated data model enumeration for improved data categorization. - **Refactor** - Standardized data model identifiers across the system (switching from lower-case to uppercase constants) to ensure consistency. - Updated join table naming logic and reorganized internal components for improved clarity and maintainability. - Streamlined the orchestration subsystem by removing deprecated modules, enhancing overall system performance and reliability.

## Summary - #586 did a refactor on the naming of internal tables - we need to update the Driver code to account for this. ## Checklist - [ ] Added Unit Tests - [ ] Covered by existing CI - [ ] Integration tested - [ ] Documentation update  ## Summary by CodeRabbit - **Bug Fixes** - Improved handling of table name parsing by changing the delimiter from a dot to a double underscore when extracting namespace and table name. - Updated the method for computing source output table names to enhance accuracy in table identification.   Co-authored-by: Thomas Chow <[email protected]>

## Summary removed some dead code - we are going to introduce planner in a later PR - this PR refactors code around to make the next change easy. ## Checklist - [ ] Added Unit Tests - [x] Covered by existing CI - [ ] Integration tested - [ ] Documentation update  ## Summary by CodeRabbit - **New Features** - Introduced a new configuration framework that enables enhanced pipeline asset management and clearer node definitions. - Added an updated data model enumeration for improved data categorization. - **Refactor** - Standardized data model identifiers across the system (switching from lower-case to uppercase constants) to ensure consistency. - Updated join table naming logic and reorganized internal components for improved clarity and maintainability. - Streamlined the orchestration subsystem by removing deprecated modules, enhancing overall system performance and reliability.

## Summary - #586 did a refactor on the naming of internal tables - we need to update the Driver code to account for this. ## Checklist - [ ] Added Unit Tests - [ ] Covered by existing CI - [ ] Integration tested - [ ] Documentation update  ## Summary by CodeRabbit - **Bug Fixes** - Improved handling of table name parsing by changing the delimiter from a dot to a double underscore when extracting namespace and table name. - Updated the method for computing source output table names to enhance accuracy in table identification.   Co-authored-by: Thomas Chow <[email protected]>

## Summary removed some dead code - we are going to introduce planner in a later PR - this PR refactors code around to make the next change easy. ## Cheour clientslist - [ ] Added Unit Tests - [x] Covered by existing CI - [ ] Integration tested - [ ] Documentation update  ## Summary by CodeRabbit - **New Features** - Introduced a new configuration framework that enables enhanced pipeline asset management and clearer node definitions. - Added an updated data model enumeration for improved data categorization. - **Refactor** - Standardized data model identifiers across the system (switching from lower-case to uppercase constants) to ensure consistency. - Updated join table naming logic and reorganized internal components for improved clarity and maintainability. - Streamlined the orchestration subsystem by removing deprecated modules, enhancing overall system performance and reliability.

## Summary - #586 did a refactor on the naming of internal tables - we need to update the Driver code to account for this. ## Cheour clientslist - [ ] Added Unit Tests - [ ] Covered by existing CI - [ ] Integration tested - [ ] Documentation update  ## Summary by CodeRabbit - **Bug Fixes** - Improved handling of table name parsing by changing the delimiter from a dot to a double underscore when extracting namespace and table name. - Updated the method for computing source output table names to enhance accuracy in table identification.   Co-authored-by: Thomas Chow <[email protected]>

coderabbitai bot reviewed Apr 4, 2025

View reviewed changes

nikhil-zlai added 4 commits April 7, 2025 14:59

WIP - agent

dbddb7f

WIP - planner

e573823

rebase

5838cf6

compile fix

bc59828

nikhil-zlai force-pushed the nikhil/planner branch from e808406 to bc59828 Compare April 7, 2025 22:44

coderabbitai bot reviewed Apr 7, 2025

View reviewed changes

nikhil-zlai changed the title ~~WIP Nikhil/planner~~ Planner - 1: Refactor code to introduce planner Apr 7, 2025

nikhil-zlai changed the title ~~Planner - 1: Refactor code to introduce planner~~ refactor: prior to introducing planner Apr 7, 2025

varant-zlai approved these changes Apr 7, 2025

View reviewed changes

nikhil-zlai merged commit 73039b9 into main Apr 7, 2025
21 checks passed

nikhil-zlai deleted the nikhil/planner branch April 7, 2025 23:36

tchow-zlai mentioned this pull request Apr 14, 2025

fix: get the full table name properly from Driver #641

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

refactor: prior to introducing planner #586

refactor: prior to introducing planner #586

Uh oh!

nikhil-zlai commented Apr 4, 2025 •

edited

Loading

Uh oh!

coderabbitai bot commented Apr 4, 2025 •

edited

Loading

Review ran into problems

Chat

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

CodeRabbit Configuration File (`.coderabbit.yaml`)

Documentation and Community

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot Apr 4, 2025

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

refactor: prior to introducing planner #586

refactor: prior to introducing planner #586

Uh oh!

Conversation

nikhil-zlai commented Apr 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Checklist

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Apr 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Possibly related PRs

Poem

Review ran into problems

Chat

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

CodeRabbit Configuration File (.coderabbit.yaml)

Documentation and Community

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Apr 4, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

nikhil-zlai commented Apr 4, 2025 •

edited

Loading

coderabbitai bot commented Apr 4, 2025 •

edited

Loading

CodeRabbit Configuration File (`.coderabbit.yaml`)