Skip to content

Conversation

@codope
Copy link
Member

@codope codope commented May 20, 2022

What is the purpose of the pull request

(For example: This pull request adds quick-start document.)

Brief change log

(for example:)

  • Modify AnnotationLocation checkstyle rule in checkstyle.xml

Verify this pull request

(Please pick either of the following options)

This pull request is a trivial rework / code cleanup without any test coverage.

(or)

This pull request is already covered by existing tests, such as (please describe tests).

(or)

This change added tests and can be verified as follows:

(example:)

  • Added integration tests for end-to-end.
  • Added HoodieClientWriteTest to verify the change.
  • Manually verified the change by running a job locally.

Committer checklist

  • Has a corresponding JIRA in PR title & commit

  • Commit message is descriptive of the change

  • CI is green

  • Necessary doc changes done or have another open PR

  • For large changes, please consider breaking it into sub-tasks under an umbrella JIRA.

@nsivabalan
Copy link
Contributor

@codope : whats the follow up on this

@nsivabalan nsivabalan added the priority:blocker Production down; release blocker label Jun 9, 2022
@codope
Copy link
Member Author

codope commented Jul 12, 2022

@codope : whats the follow up on this

Need to rebase and fix tests. Will get to that in this week.

@codope codope force-pushed the HUDI-4071-config-defaults branch from 8c5174c to cd339e3 Compare July 21, 2022 13:11
@codope codope changed the title [WIP][HUDI-4071] Change defaults for some of the configs [HUDI-4071] Change defaults for some of the configs Jul 22, 2022
.key("hoodie.embed.timeline.server.async")
.defaultValue("false")
.defaultValue("true")
.withDocumentation("Controls whether or not, the requests to the timeline server are processed in asynchronous fashion, "
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What is exactly the effect of this param, i'm scared whether it can cause data loss like #6179

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I see. The intent is to improve the throughput by requesting the timeline server asynchronously. Internally, we have run a long-running test (30+ commits) with this config but did not see any data loss. Our validations were based on count. However, if you have any concern let me know. We can keep it false by default for now. Data loss issues are more critical.

Copy link
Contributor

@danny0405 danny0405 Jul 25, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

30+ commits is too few to reproduce, in #6179 , we run about 2000+ commits to reproduce the problem. I would suggest you to do the same test before switch the flag.

- BULK_INSERT_SORT_MODE from GLOBAL_SORT to NONE
- RECONCILE_SCHEMA from false to true
- Match ROLLBACK_USING_MARKERS_ENABLE in spark sql as spark datasource
@hudi-bot
Copy link
Collaborator

CI report:

Bot commands @hudi-bot supports the following commands:
  • @hudi-bot run azure re-run the last Azure build

public static final ConfigProperty<Boolean> RECONCILE_SCHEMA = ConfigProperty
.key("hoodie.datasource.write.reconcile.schema")
.defaultValue(false)
.defaultValue(true)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

.withPath(basePath)
.withIndexConfig(HoodieIndexConfig.newBuilder.withIndexType(IndexType.BLOOM).build)
.withRollbackUsingMarkers(false)
.withRollbackUsingMarkers(HoodieWriteConfig.ROLLBACK_USING_MARKERS_ENABLE.defaultValue.toBoolean)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@xushiyan xushiyan closed this Jul 25, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

priority:blocker Production down; release blocker

Projects

Archived in project

Development

Successfully merging this pull request may close these issues.

5 participants