KAFKA-6647 KafkaStreams.cleanUp creates .lock file in directory it tries to clean #5650

tedyu · 2018-09-14T00:09:29Z

Specify StandardOpenOption#DELETE_ON_CLOSE when creating the FileChannel.

Move lock file up one level.

Committer Checklist (excluded from commit message)

Verify design and implementation
Verify test coverage and CI build status
Verify documentation (including upgrade notes)

…ies to clean

mjsax · 2018-10-03T23:54:13Z

@guozhangwang I was thinking about your compatibility concerns. Could we fix is with the following approach: we encode which lock structure to use in rebalance protocol (we can simply pump up the version) -- if at least one instance is on old version, we still use old locks -- after all instances are on new version, we switch from old lock files to new lock files (for this, code must hold old lock, get new lock, releases old lock).

Thoughts?

mjsax · 2018-10-03T23:39:27Z

streams/src/test/java/org/apache/kafka/streams/processor/internals/StateDirectoryTest.java

-
-        try (
-            final FileChannel channel = FileChannel.open(
-                new File(taskDirectory, StateDirectory.LOCK_FILE_NAME).toPath(),


Why do we remove this test? Seems, we should update the FileChannel here to use the new lock file name?

mjsax · 2018-10-03T23:41:54Z

streams/src/test/java/org/apache/kafka/streams/processor/internals/StateDirectoryTest.java

@@ -196,15 +176,14 @@ public void shouldCleanUpTaskStateDirectoriesThatAreNotCurrentlyLocked() throws
            directory.directoryForTask(new TaskId(2, 0));

            List<File> files = Arrays.asList(appDir.listFiles());
-            assertEquals(3, files.size());
+            assertEquals(1, files.size());


Why this? Shouldn't directory.lock(task0); and directory.lock(task1); have create a lock file each?

Will dig some more on these tests once Guozhang confirms the plan.

This was due to the specification of StandardOpenOption.DELETE_ON_CLOSE to FileChannel.open call.

@mjsax
I want to get your opinion on whether StandardOpenOption.DELETE_ON_CLOSE should be kept in the PR.
This would affect how test is modified.

Sorry for late reply. It was a little crazy the last weeks and I did not find time earlier.

I cannot remember why we want to add the DELETE_ON_CLOSE option? Can you refresh my mind?

Also, I am not sure why this option reduced the file count? I understand that the task directories are actually not created any longer, however, we moved both lock files up the hierarchy and thus the count should not change?

Also, did you see this older comment: #5650 (comment) For a clean upgrade path, addressing this issue is important.

guozhangwang · 2018-10-05T16:49:46Z

@guozhangwang I was thinking about your compatibility concerns. Could we fix is with the following approach: we encode which lock structure to use in rebalance protocol (we can simply pump up the version) -- if at least one instance is on old version, we still use old locks -- after all instances are on new version, we switch from old lock files to new lock files (for this, code must hold old lock, get new lock, releases old lock).

Just to clarify, this decision is done at the leader side when assigning partitions right? If yes, that sounds good to me.

mjsax · 2018-10-05T17:56:17Z

Yes, the consumer group leader collects all consumer versions, and downgrades via version probing if necessary.

tedyu · 2018-11-05T00:59:21Z

I ran the failed tests from Java 11 locally which passed.

tedyu · 2018-11-05T03:03:35Z

w.r.t. consumer group leader collecting consumer versions, a bit more pointer is appreciated.

Thanks

tedyu · 2018-11-05T17:46:27Z

The failed tests in jdk8 run were not related to PR.

mjsax · 2018-11-06T05:14:29Z

The rebalance version thing is basically based on KIP-268 https://cwiki.apache.org/confluence/display/KAFKA/KIP-268%3A+Simplify+Kafka+Streams+Rebalance+Metadata+Upgrade

We can exploit this, by bumping the version number to indicate a change (ie, we need to dump the number in SubscriptionInfo and AssignmentInfo and update StreamsPartitionAssignor#assign() accordingly, so handles the new version). The actual metadata must not be changed for this. With version probing, the AssignmentInfo will contain the used metadata version, and as long as we received a downgraded metadata version number, we use the old locking scheme. Only, if we receive the new metadata version, we switch to the new logging scheme. Thus, StreamsPartitionAssignor#onAssignment() must evaluate the metadata version and provide this information to the StateDirectory class (I guess a boolean flag, "useOldLogging" should be sufficient).

Does this make sense? I can provide more details if necessary. Hope it's good starting point for you to dig into it.

tedyu · 2018-11-17T16:46:29Z

Thanks for the hint.
Looking at StreamsPartitionAssignor#onAssignment() , I am trying to find how the locking scheme specification can be passed to StateDirectory class.

tedyu · 2018-11-17T17:52:40Z

Currently taskManager#taskCreator is not accessible to onAssignment method.
Should a getter of stateDirectory be added to TaskManager for passing lock scheme (stateDirectory is a field of AbstractTaskCreator) ?

tedyu · 2018-11-17T20:21:11Z

StateDirectory is constructed and passed to StreamThread.
If lock method is called before assignment, only the old lock scheme can be used, right ?

mjsax · 2018-11-20T23:30:06Z

If lock method is called before assignment, only the old lock scheme can be used, right ?

I think this should never happen, because we only lock a task directory after the task was assigned. Maybe we can put a guard to avoid bug with StateDirectory: initialize a variable that tracks the locking schema to "UNKNOWN" and set to concrete locking schema in StreamsPartitionAssignor#onAssignment()-- if lock is called when schema is UNKNOWN, we throw an IllegalStateException.

For the first question, it seems ok to me to add a method to taskManager to set the locking schema and the taskmanager can "forward" it -- it would not pass the StateDirectory into StreamsPartitionAssignor#onAssignment().

Does this help?

jukkakarvanen · 2019-05-14T06:31:18Z

@mjsax , @bbejeck

I have been checking this KAFKA-6647 with two locking policy approach.
And I have some draft version available:
trunk...jukkakarvanen:KAFKA-6647_MultipleLockings

Some questions:
There seems to be one shared StateDirectory per KafkaStreams. So multiple threads and tasks sharing it.

So can this useOldLocking field be one per StateDirectory or should it be set by each TaskId or StreamThread?

When this locking of StateDirectory is used?
I found it in AbstractTask registerStateStores where it locks the task and closeStateManager to unlock it and in task removal in StateDirectory. Based on Matthias above comment this onAssignment should always happen before locking, so registerStateStores or removal. In my current version, I made this useOldLocking as map of taskId and there are logic throw error if locking policy not set for task.
This revealed one failing scenario at least in ResetIntegrationTest where KafkaStreams is created twice and the cleanup on second KafkaStreams is trying to remove and that way lock task based on directory structures which locking policy is not set.

If the rebalance is happening and there is new onAssignment, is this closeStateManager called before it or can the task be already locked?
This is related to do we need to migrate new locks to old locks and which of the locks need to be migrate at the same time? And is there need to migrate old locks to new locks?
What are the scenarios where these migrations need to happen?

New lock to old lock: Application with old Stream API connected
Old lock to new lock: Applications with old Stream API disconnected

Related to AssignmentInfo and SubscriptionInfo:
Is there possibility that some of the assignments are handled by only new apps and that way getting newer version and different locking than with those shared with old and new applications?

mjsax · 2022-12-29T00:31:49Z

Closing this stale PR. The corresponding Jira ticket is marked as "resolved" already.

KAFKA-6647 KafkaStreams.cleanUp creates .lock file in directory it tr…

a246725

…ies to clean

tedyu mentioned this pull request Sep 14, 2018

KAFKA-6647 KafkaStreams.cleanUp creates .lock file in directory it tries to clean #4702

Closed

3 tasks

mjsax added the streams label Oct 3, 2018

mjsax reviewed Oct 3, 2018

View reviewed changes

drop StandardOpenOption.DELETE_ON_CLOSE

4ed75bc

Address checkstyle

0fd3096

mjsax mentioned this pull request Nov 23, 2018

MINOR: fix KafkaStreams#cleanUp(): should throw and fail for lock conflict #4713

Closed

bbejeck mentioned this pull request Apr 15, 2019

KAFKA-6647 KafkaStreams.cleanUp creates .lock file in directory its trying to clean (Windows OS) #6569

Closed

3 tasks

mjsax closed this Dec 29, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KAFKA-6647 KafkaStreams.cleanUp creates .lock file in directory it tries to clean #5650

KAFKA-6647 KafkaStreams.cleanUp creates .lock file in directory it tries to clean #5650

tedyu commented Sep 14, 2018

mjsax commented Oct 3, 2018

mjsax Oct 3, 2018

mjsax Oct 3, 2018

tedyu Oct 4, 2018

tedyu Oct 21, 2018

tedyu Oct 21, 2018

mjsax Nov 4, 2018

guozhangwang commented Oct 5, 2018

mjsax commented Oct 5, 2018

tedyu commented Nov 5, 2018

tedyu commented Nov 5, 2018

tedyu commented Nov 5, 2018

mjsax commented Nov 6, 2018

tedyu commented Nov 17, 2018

tedyu commented Nov 17, 2018

tedyu commented Nov 17, 2018

mjsax commented Nov 20, 2018

jukkakarvanen commented May 14, 2019

mjsax commented Dec 29, 2022

KAFKA-6647 KafkaStreams.cleanUp creates .lock file in directory it tries to clean #5650

KAFKA-6647 KafkaStreams.cleanUp creates .lock file in directory it tries to clean #5650

Conversation

tedyu commented Sep 14, 2018

Committer Checklist (excluded from commit message)

mjsax commented Oct 3, 2018

mjsax Oct 3, 2018

Choose a reason for hiding this comment

mjsax Oct 3, 2018

Choose a reason for hiding this comment

tedyu Oct 4, 2018

Choose a reason for hiding this comment

tedyu Oct 21, 2018

Choose a reason for hiding this comment

tedyu Oct 21, 2018

Choose a reason for hiding this comment

mjsax Nov 4, 2018

Choose a reason for hiding this comment

guozhangwang commented Oct 5, 2018

mjsax commented Oct 5, 2018

tedyu commented Nov 5, 2018

tedyu commented Nov 5, 2018

tedyu commented Nov 5, 2018

mjsax commented Nov 6, 2018

tedyu commented Nov 17, 2018

tedyu commented Nov 17, 2018

tedyu commented Nov 17, 2018

mjsax commented Nov 20, 2018

jukkakarvanen commented May 14, 2019

mjsax commented Dec 29, 2022