Simplify Test and Dependency Scheduling #2759

thomhurst · 2025-08-04T21:44:45Z

No description provided.

…mplified scheduling This major redesign simplifies test processing by: 1. **Eager Dependency Execution**: Tests now execute their dependencies immediately when they run, rather than complex pre-execution dependency coordination. Dependencies are stored as properties within tests and executed first when needed. 2. **Task Reuse for Dependencies**: Multiple tests depending on the same test now reuse the same execution task, stored on TestContext.ExecutionTask, ensuring dependencies aren't started more than once. 3. **Proceed-on-Failure Support**: Added ProceedOnFailure property to TestDependency and updated DependsOnAttribute to preserve this flag, allowing tests to continue when dependencies fail if explicitly configured. 4. **Simplified Scheduler Logic**: Replaced complex channel-based producer-consumer system with simple queues: - SimpleTestQueues: 4 queue types for different constraint patterns - SimpleConsumerManager: Simple polling loop with natural completion detection - Removed 575 lines of complex channel coordination code, replaced with 342 lines of straightforward queue logic (40% reduction) 5. **Natural Queue Drainage**: Eliminated counting logic and external signaling. Consumers naturally exit when queues are empty and no tests are running. 6. **Removed Redundant Logic**: - Eliminated dependency graph setup and completion tracking - Removed complex routing logic - all tests route immediately - Simplified constraint handling with direct queue-based distribution The new architecture is more synchronous, less prone to race conditions, and significantly easier to understand and maintain. Tests execute dependencies eagerly, reuse execution tasks efficiently, and the scheduler focuses purely on constraint-based work distribution. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

…uler - Deleted ChannelConsumerManager and ProducerConsumerTestScheduler classes. - Simplified test scheduling by removing outdated channel-based producer-consumer logic.

This commit fixes critical issues discovered during testing: 1. **Object Disposal Double-Decrementing Bug**: - Removed redundant `DecrementAndDisposeTrackedObjectsAsync` method - ObjectTracker already handles disposal automatically via TestContext.Events.OnDispose - Was causing objects to be disposed too early when shared between dependency and main tests - Fixed "Expected fixture.IsDisposed to be equal to False but found True" error 2. **Priority and Order Scheduling Fixes**: - Fixed priority ordering by sorting within constraint groups instead of globally - Fixed NotInParallel Order attribute handling by sorting Order first, then Priority - Implemented proper sorting for each constraint type: * Parallel tests: Priority only (can run concurrently) * Global NotInParallel: Order first, then Priority * Keyed NotInParallel: Order first, then Priority within each key * Parallel Groups: Preserve existing order logic - Used simple upfront sorting approach instead of complex priority queue logic 3. **Improved Queue Management**: - Process and enqueue each constraint group separately with proper sorting - Maintain FIFO order within queues after proper sorting - Cleaner separation of concerns between sorting and queueing These fixes ensure that: - Shared objects between dependency and main tests are properly managed - High priority tests execute before lower priority tests - NotInParallel tests with Order attributes execute in the correct sequence - The simplified scheduler respects all original TUnit constraints and behaviors 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

Changed parameters dictionary from Dictionary<string, string> to Dictionary<string, List<string>> to allow multiple values for the same parameter key. This resolves the "An item with the same key has already been added" error when the same parameter is passed multiple times. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

…cquisition The previous implementation had a race condition where multiple consumers could dequeue tests from the same constraint key, but only one could acquire the constraint lock, causing other tests to be lost forever. This fix implements atomic constraint acquisition by: - Using queue.Peek() to check tests without removing them - Acquiring constraints before dequeuing - Only dequeuing if constraint acquisition succeeds - Using proper locking to make the entire operation atomic Also changed AssertOrder from a test with Order=6 to an [After(Class)] hook to properly validate test execution order. This ensures NotInParallel tests execute sequentially in the correct order without any tests being lost due to race conditions. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

claude · 2025-08-04T21:45:01Z

Claude finished @thomhurst's task —— View job

Code Review: Simplify Test and Dependency Scheduling

I've completed a comprehensive review of this pull request which replaces the complex ProducerConsumer scheduler with a simpler queue-based approach. Here's my detailed feedback:

Progress:

Read repository setup instructions from CLAUDE.md
Analyze the architectural changes from ProducerConsumer to SimpleQueue scheduling
Review code quality and best practices in new scheduling classes
Identify potential bugs or issues in the implementation
Assess performance implications of the scheduling changes
Check for security concerns in the new implementation
Evaluate test coverage and identify testing gaps
Provide comprehensive review feedback via GitHub comment

🔍 Overall Assessment

This is a significant architectural improvement that simplifies the test scheduling system while maintaining functionality. The refactor removes ~727 lines of complex channel-based code and replaces it with ~708 lines of cleaner queue-based logic.

✅ Positive Changes

Architecture Simplification

Removed complex channel infrastructure: Eliminated ProducerConsumerTestScheduler and ChannelConsumerManager (727 lines)
Cleaner abstractions: New SimpleQueueTestScheduler, SimpleTestQueues, and SimpleConsumerManager are more readable
Better separation of concerns: Each class has a focused responsibility

Code Quality Improvements

Consistent null checking: Proper argument validation throughout
Good error handling: Appropriate try-catch blocks with cleanup logic
Clear naming: Method and variable names are descriptive
Immutable data structures: TestExecutionData uses required properties

⚠️ Issues Identified

1. Critical Race Condition in `SimpleTestQueues.cs:63-156`

private TestExecutionData? TryDequeueTestInternal(...)
{
    // Potential race: Multiple threads could pass constraint check simultaneously
    if (!runningConstraints.Contains(key)) // Thread A checks
    {
        // Thread B could acquire same constraint here
        lock (lockObj) // Then both try to acquire lock
        {
            // Both threads might process same constraint
        }
    }
}

Risk: Tests with same constraint could run concurrently when they shouldn't.

Recommendation: Move constraint checking inside the lock or use atomic operations.

2. Inconsistent Locking Strategy in `SimpleTestQueues.cs`

The class mixes different locking approaches:

ConcurrentQueue<T> for unconstrained tests (lock-free)
Queue<T> with locks for constrained tests
ConcurrentDictionary<string, object> for lock objects

Recommendation: Consider using ConcurrentQueue<T> consistently or document why different approaches are needed.

3. Potential Memory Leak in `SimpleConsumerManager.cs:88`

await Task.Delay(50, cancellationToken); // Fixed 50ms delay

Issues:

Fixed delay could cause unnecessary CPU usage
No exponential backoff for empty queues
Could delay test completion in edge cases

Recommendation: Implement exponential backoff or use SemaphoreSlim for better signaling.

4. Unused Method in `SimpleConsumerManager.cs:119-181`

The TryAcquireConstraintsAsync method is defined but never called, creating dead code.

Recommendation: Remove or integrate this method if it's intended for future use.

5. Constraint Acquisition Logic Duplication

Similar constraint checking logic appears in multiple places:

SimpleTestQueues.TryDequeueTestInternal() (lines 96-126)
SimpleConsumerManager.TryAcquireConstraintsAsync() (lines 135-159)

Recommendation: Extract to a shared ConstraintManager class.

🚀 Performance Considerations

Improvements

Reduced complexity: Simpler queue operations vs. channel management
Better cache locality: Fewer indirections in hot paths
Lower memory overhead: Removed complex channel infrastructure

Concerns

Polling overhead: 50ms delay in consumer loop could impact latency
Lock contention: Multiple queues with different locking strategies
O(n) constraint checking: Linear search through running constraints

Recommendation: Consider using BlockingCollection<T> or producer/consumer pattern with signaling.

🔒 Security Assessment

Low Risk: No security concerns identified. The code:

Properly handles cancellation tokens
Has appropriate exception handling
Doesn't expose sensitive data
Uses safe collection types

🧪 Test Coverage Analysis

Current Coverage

Basic scheduling scenarios appear covered by existing tests
Dependency execution logic has comprehensive tests
Parallel constraint handling has test cases

Gaps Identified

Race condition testing: No tests for concurrent constraint acquisition
Edge case coverage: Empty queues, rapid scheduling/descheduling
Performance testing: No load tests for new scheduler
Timeout handling: Limited coverage of timeout scenarios

Recommendation: Add integration tests specifically for the new scheduler implementation.

🔧 Specific Code Suggestions

SimpleTestQueues.cs:86-87

// Current: Potential race condition
if (!runningConstraints.Contains(key))

// Suggested: Atomic constraint checking
lock (constraintLock)
{
    if (!runningConstraints.Contains(key))
    {
        // Constraint acquisition logic here
    }
}

SimpleConsumerManager.cs:26-28

// Current: Confusing logic
var parallelism = _configuration.Strategy == ParallelismStrategy.Adaptive
    ? _configuration.MaxParallelism  
    : _configuration.MaxParallelism;

// Suggested: Clarify or implement actual adaptive logic
var parallelism = _configuration.MaxParallelism;

📋 Action Items

High Priority

Fix race condition in constraint acquisition logic
Remove dead code (TryAcquireConstraintsAsync)
Add integration tests for new scheduler

Medium Priority

Implement exponential backoff for consumer polling
Consolidate constraint checking logic
Add performance benchmarks

Low Priority

Document design decisions in code comments
Consider using BlockingCollection<T> for better signaling
Add more comprehensive timeout testing

🎯 Verdict

Approve with conditions: This is a solid architectural improvement that simplifies the codebase significantly. The identified issues are fixable and don't prevent the core functionality from working. The race condition should be addressed before merging, but the overall direction is excellent.

The refactor successfully reduces complexity while maintaining feature parity, making the codebase more maintainable and easier to understand.

Resolved the race condition where multiple consumers could dequeue tests but only one could acquire constraints, causing tests to be lost forever. Key changes: - Replaced separate dequeue + constraint acquisition with atomic operation - Added TryDequeueTestWithConstraints method that combines both operations - Fixed global NotInParallel constraint checking logic - Ensures tests are only dequeued when constraints can be successfully acquired This fixes: - OrderedTests now execute in correct sequential order (First->Fifth) - NotInParallel tests no longer run in parallel when they shouldn't - Eliminates lost tests due to failed constraint acquisition 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

Added atomic check-and-set operation for ExecutionTask assignment to prevent tests from being executed multiple times when dependency resolution races with normal test execution. Key changes: - Added _executionTaskLock field to SingleTestExecutor - Used lock with null-coalescing assignment: dependency.Context.ExecutionTask ??= Task.Run(...) - Prevents duplicate test execution that was causing ObjectBag duplicate key errors - Ensures each test runs exactly once, either via normal scheduling or dependency execution This fixes the "An item with the same key has already been added" errors that occurred when Test1/Test2 were executed both as standalone tests and as dependencies of Test3. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

…tion paths Completely eliminated duplicate test execution by ensuring both regular and dependency execution use the same ExecutionTask mechanism. Key changes: - Modified ExecuteTestAsync to check for existing ExecutionTask before creating new one - Both regular and dependency execution now use the same ExecuteTestInternalAsync method - Removed Task.Run to preserve AsyncLocal flow (TestContext.Current) - Added proper test state handling for edge cases This finally resolves the "An item with the same key has already been added" errors by ensuring each test runs exactly once, regardless of whether it's executed as a standalone test or as a dependency of another test. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

Fixed async method return type mismatch in GetOrCreateDependencyExecutionTaskAsync. The method now properly awaits the task outside the lock to prevent deadlocks while ensuring only one execution task is created per dependency test. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

claude · 2025-08-04T22:49:59Z

Claude finished @thomhurst's task —— View job

Code Review: Simplify Test and Dependency Scheduling

I've completed a comprehensive review of this pull request which replaces the complex ProducerConsumer scheduler with a simpler queue-based approach. Here's my detailed feedback:

Progress:

Read repository setup instructions from CLAUDE.md
Analyze the architectural changes from ProducerConsumer to SimpleQueue scheduling
Review code quality and best practices in new scheduling classes
Identify potential bugs or issues in the implementation
Assess performance implications of the scheduling changes
Check for security concerns in the new implementation
Evaluate test coverage and identify testing gaps
Provide comprehensive review feedback via GitHub comment

🔍 Overall Assessment

This is a significant architectural improvement that simplifies the test scheduling system while maintaining functionality. The refactor removes ~727 lines of complex channel-based code and replaces it with ~571 lines of cleaner queue-based logic.

✅ Positive Changes

Architecture Simplification

Removed complex channel infrastructure: Eliminated ProducerConsumerTestScheduler and ChannelConsumerManager (727 lines)
Cleaner abstractions: New SimpleQueueTestScheduler, SimpleTestQueues, and SimpleConsumerManager are more readable
Better separation of concerns: Each class has a focused responsibility

Code Quality Improvements

Consistent null checking: Proper argument validation throughout
Good error handling: Appropriate try-catch blocks with cleanup logic
Clear naming: Method and variable names are descriptive
Immutable data structures: TestExecutionData uses required properties

⚠️ Issues Identified

1. Critical Race Condition in `SimpleTestQueues.cs:80-88`

// Check if any other parallel group is running
var otherGroupRunning = runningConstraintKeys.Any(x => 
    x.Key.StartsWith("__parallel_group_") && x.Key != groupKey && x.Value > 0);
    
if (!otherGroupRunning && queue.TryDequeue(out var groupTest))
{
    // Race: Another thread could start a conflicting group here
    runningConstraintKeys.AddOrUpdate(groupKey, 1, (k, v) => v + 1);
    return groupTest;
}

Risk: Multiple parallel groups could start simultaneously when they shouldn't.

Recommendation: Check and acquire constraints atomically using a lock around the entire check-and-acquire operation.

2. Potential Memory Leak in `SimpleConsumerManager.cs:87`

await Task.Delay(50, cancellationToken); // Fixed 50ms delay

Issues:

Fixed delay could cause unnecessary CPU usage under high load
No exponential backoff for empty queues
Could delay test completion in edge cases

Recommendation: Implement exponential backoff or use SemaphoreSlim for better signaling.

3. Dead Code in `SimpleConsumerManager.cs:109-114`

The GetCurrentRunningConstraints method is defined but never used, creating dead code.

Recommendation: Remove this method unless it's intended for future use.

4. Confusing Logic in `SimpleConsumerManager.cs:26-28`

var parallelism = _configuration.Strategy == ParallelismStrategy.Adaptive
    ? _configuration.MaxParallelism  
    : _configuration.MaxParallelism;

Both branches return the same value, making this logic confusing.

Recommendation: Simplify to var parallelism = _configuration.MaxParallelism; or implement actual adaptive logic.

🚀 Performance Considerations

Improvements

Reduced complexity: Simpler queue operations vs. channel management
Better cache locality: Fewer indirections in hot paths
Lower memory overhead: Removed complex channel infrastructure

Concerns

Polling overhead: 50ms delay in consumer loop could impact latency
Lock contention: Multiple queues with different locking strategies
O(n) constraint checking: Linear search through running constraints

Recommendation: Consider using BlockingCollection<T> or producer/consumer pattern with signaling.

🔒 Security Assessment

Low Risk: No security concerns identified. The code:

Properly handles cancellation tokens
Has appropriate exception handling
Doesn't expose sensitive data
Uses safe collection types

🧪 Test Coverage Analysis

Current Coverage

Basic scheduling scenarios appear covered by existing tests
NotInParallelTests and KeyedNotInParallelTests verify constraint behavior
Dependency execution logic has comprehensive tests
Parallel constraint handling has test cases

Gaps Identified

Race condition testing: No tests for concurrent constraint acquisition
Edge case coverage: Empty queues, rapid scheduling/descheduling
Performance testing: No load tests for new scheduler
Timeout handling: Limited coverage of timeout scenarios

Recommendation: Add integration tests specifically for the new scheduler implementation.

🔧 Specific Code Suggestions

SimpleTestQueues.cs:80-88

// Current: Potential race condition
var otherGroupRunning = runningConstraintKeys.Any(x => 
    x.Key.StartsWith("__parallel_group_") && x.Key != groupKey && x.Value > 0);
if (!otherGroupRunning && queue.TryDequeue(out var groupTest))

// Suggested: Atomic constraint checking
lock (_parallelGroupLock) // Add a dedicated lock for parallel groups
{
    var otherGroupRunning = runningConstraintKeys.Any(x => 
        x.Key.StartsWith("__parallel_group_") && x.Key != groupKey && x.Value > 0);
    if (!otherGroupRunning && queue.TryDequeue(out var groupTest))
    {
        runningConstraintKeys.AddOrUpdate(groupKey, 1, (k, v) => v + 1);
        return groupTest;
    }
}

SimpleConsumerManager.cs:87

// Current: Fixed delay
await Task.Delay(50, cancellationToken);

// Suggested: Exponential backoff or signaling
private int _backoffMs = 10;
await Task.Delay(Math.Min(_backoffMs, 1000), cancellationToken);
_backoffMs = Math.Min(_backoffMs * 2, 1000); // Reset on successful dequeue

📋 Action Items

High Priority

Fix race condition in parallel group constraint acquisition
Remove dead code (GetCurrentRunningConstraints)
Add integration tests for new scheduler under concurrent load

Medium Priority

Implement exponential backoff for consumer polling
Add performance benchmarks comparing old vs new scheduler
Document design decisions in code comments

Low Priority

Consider using BlockingCollection<T> for better signaling
Add more comprehensive timeout testing
Clarify adaptive parallelism logic

🎯 Verdict

Approve with conditions: This is a solid architectural improvement that simplifies the codebase significantly. The identified issues are fixable and don't prevent the core functionality from working. The race condition should be addressed before merging, but the overall direction is excellent.

The refactor successfully reduces complexity while maintaining feature parity, making the codebase more maintainable and easier to understand.

- Added ExecutionLock to TestContext for thread-safe coordination - Both scheduler and dependency execution paths now use same lock - Unified both paths to use single ExecuteTestInternalAsync method - Prevents Test1 from running twice (once as scheduled test, once as dependency) - Simplified logic removes duplicate execution paths and potential race conditions 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

claude · 2025-08-04T22:57:52Z

Claude finished @thomhurst's task —— View job

Code Review: Simplify Test and Dependency Scheduling

I've completed a comprehensive review of this pull request which replaces the complex ProducerConsumer scheduler with a simpler queue-based approach. Here's my detailed feedback:

Progress:

Read repository setup instructions from CLAUDE.md
Analyze the architectural changes from ProducerConsumer to SimpleQueue scheduling
Review code quality and best practices in new scheduling classes
Identify potential bugs or issues in the implementation
Assess performance implications of the scheduling changes
Check for security concerns in the new implementation
Evaluate test coverage and identify testing gaps
Provide comprehensive review feedback

🔍 Overall Assessment

This is a significant architectural improvement that simplifies the test scheduling system while maintaining functionality. The refactor removes complex channel-based code and replaces it with cleaner queue-based logic, making the codebase more maintainable.

✅ Positive Changes

Architecture Simplification

Removed complex channel infrastructure: Eliminated ProducerConsumerTestScheduler and ChannelConsumerManager (727 lines deleted)
Cleaner abstractions: New SimpleQueueTestScheduler, SimpleTestQueues, and SimpleConsumerManager are more readable
Better separation of concerns: Each class has a focused responsibility
Improved dependency execution: Unified execution paths with proper locking in SingleTestExecutor.cs:44-53

Code Quality Improvements

Consistent null checking: Proper argument validation throughout
Good error handling: Appropriate try-catch blocks with cleanup logic in SimpleConsumerManager.cs:74-81
Clear naming: Method and variable names are descriptive
Immutable data structures: TestExecutionData uses required properties

⚠️ Issues Identified

1. Race Condition in `SimpleTestQueues.cs:80-88`

// Check if any other parallel group is running
var otherGroupRunning = runningConstraintKeys.Any(x => 
    x.Key.StartsWith("__parallel_group_") && x.Key != groupKey && x.Value > 0);
    
if (!otherGroupRunning && queue.TryDequeue(out var groupTest))
{
    // Race: Another thread could start a conflicting group here
    runningConstraintKeys.AddOrUpdate(groupKey, 1, (k, v) => v + 1);
    return groupTest;
}

Risk: Multiple parallel groups could start simultaneously when they shouldn't.

Recommendation: Check and acquire constraints atomically using a lock around the entire check-and-acquire operation.

2. Performance Issue in `SimpleConsumerManager.cs:87`

await Task.Delay(50, cancellationToken); // Fixed 50ms delay

Issues:

Fixed delay could cause unnecessary CPU usage under high load
No exponential backoff for empty queues
Could delay test completion in edge cases

Recommendation: Implement exponential backoff or use SemaphoreSlim for better signaling.

3. Dead Code in `SimpleConsumerManager.cs:109-114`

The GetCurrentRunningConstraints method is defined but never used.

Recommendation: Remove this method unless it's intended for future use.

4. Confusing Logic in `SimpleConsumerManager.cs:26-28`

var parallelism = _configuration.Strategy == ParallelismStrategy.Adaptive
    ? _configuration.MaxParallelism  
    : _configuration.MaxParallelism;

Both branches return the same value, making this logic confusing.

Recommendation: Simplify to var parallelism = _configuration.MaxParallelism; or implement actual adaptive logic.

🚀 Performance Considerations

Improvements

Reduced complexity: Simpler queue operations vs. channel management
Better cache locality: Fewer indirections in hot paths
Lower memory overhead: Removed complex channel infrastructure

Concerns

Polling overhead: 50ms delay in consumer loop could impact latency
Lock contention: Multiple queues with different locking strategies
O(n) constraint checking: Linear search through running constraints in SimpleTestQueues.cs:80-81

Recommendation: Consider using BlockingCollection<T> or producer/consumer pattern with signaling.

🔒 Security Assessment

Low Risk: No security concerns identified. The code:

Properly handles cancellation tokens
Has appropriate exception handling
Doesn't expose sensitive data
Uses safe collection types

🧪 Test Coverage Analysis

Current Coverage

Basic scheduling scenarios appear covered by existing tests
NotInParallelTests and KeyedNotInParallelTests verify constraint behavior
Dependency execution logic has comprehensive tests with proper locking
Parallel constraint handling has test cases

Gaps Identified

Race condition testing: No tests for concurrent constraint acquisition
Edge case coverage: Empty queues, rapid scheduling/descheduling
Performance testing: No load tests for new scheduler
Timeout handling: Limited coverage of timeout scenarios

Recommendation: Add integration tests specifically for the new scheduler implementation.

🔧 Specific Code Suggestions

SimpleTestQueues.cs:80-88

// Current: Potential race condition
var otherGroupRunning = runningConstraintKeys.Any(x => 
    x.Key.StartsWith("__parallel_group_") && x.Key != groupKey && x.Value > 0);
if (!otherGroupRunning && queue.TryDequeue(out var groupTest))

// Suggested: Atomic constraint checking
lock (_parallelGroupLock) // Add a dedicated lock for parallel groups
{
    var otherGroupRunning = runningConstraintKeys.Any(x => 
        x.Key.StartsWith("__parallel_group_") && x.Key != groupKey && x.Value > 0);
    if (!otherGroupRunning && queue.TryDequeue(out var groupTest))
    {
        runningConstraintKeys.AddOrUpdate(groupKey, 1, (k, v) => v + 1);
        return groupTest;
    }
}

SimpleConsumerManager.cs:87

// Current: Fixed delay
await Task.Delay(50, cancellationToken);

// Suggested: Exponential backoff
private int _backoffMs = 10;
await Task.Delay(Math.Min(_backoffMs, 1000), cancellationToken);
_backoffMs = Math.Min(_backoffMs * 2, 1000); // Reset on successful dequeue

📋 Action Items

High Priority

Fix race condition in parallel group constraint acquisition
Remove dead code (GetCurrentRunningConstraints)
Add integration tests for new scheduler under concurrent load

Medium Priority

Implement exponential backoff for consumer polling
Add performance benchmarks comparing old vs new scheduler
Document design decisions in code comments

Low Priority

Consider using BlockingCollection<T> for better signaling
Add more comprehensive timeout testing
Clarify adaptive parallelism logic

🎯 Verdict

Approve with conditions: This is a solid architectural improvement that simplifies the codebase significantly. The identified issues are fixable and don't prevent the core functionality from working. The race condition should be addressed before merging, but the overall direction is excellent.

The refactor successfully reduces complexity while maintaining feature parity, making the codebase more maintainable and easier to understand.

Fixed a critical bug where tests skipped due to failed dependencies were having their state reset to Running, causing "Test is still running" errors. The issue was in ExecuteTestInternalAsync: 1. ExecuteDependenciesAsync would mark a test as Skipped if dependencies failed 2. But execution continued and reset the state to Running 3. This left tests in an invalid Running state that never completed Now we check if the test was skipped after executing dependencies and return early to preserve the Skipped state. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

claude · 2025-08-05T09:50:08Z

Claude finished @thomhurst's task —— View job

Code Review: Simplify Test and Dependency Scheduling

I've completed a comprehensive review of this architectural refactor that replaces the complex ProducerConsumer scheduler with a simpler queue-based approach.

Progress:

Read repository setup instructions from CLAUDE.md
Analyze the architectural changes from ProducerConsumer to SimpleQueue scheduling
Review code quality and best practices in new scheduling classes
Identify potential bugs or issues in the implementation
Assess performance implications of the scheduling changes
Check for security concerns in the new implementation
Evaluate test coverage and identify testing gaps
Provide comprehensive review feedback

🔍 Overall Assessment

This is a significant architectural improvement that successfully simplifies the test scheduling system while maintaining functionality. The refactor removes complex channel-based code and replaces it with cleaner queue-based logic, making the codebase more maintainable and easier to understand.

✅ Positive Changes

Architecture Simplification

Removed complex channel infrastructure: Eliminated ProducerConsumerTestScheduler and ChannelConsumerManager (727 lines deleted)
Cleaner abstractions: New SimpleQueueTestScheduler, SimpleTestQueues, and SimpleConsumerManager are more readable and focused
Better separation of concerns: Each class has a clear, focused responsibility
Improved dependency execution: Unified execution paths with proper locking in SingleTestExecutor.cs:44-61

Code Quality Improvements

Consistent null checking: Proper argument validation throughout (SimpleQueueTestScheduler.cs:31-35)
Good error handling: Appropriate try-catch blocks with cleanup logic (SimpleConsumerManager.cs:70-81)
Clear naming: Method and variable names are descriptive and self-documenting
Immutable data structures: TestExecutionData uses required properties for better safety
Proper constraint management: Atomic constraint acquisition and release

⚠️ Issues Identified

1. Race Condition in `SimpleTestQueues.cs:80-88`

// Check if any other parallel group is running
var otherGroupRunning = runningConstraintKeys.Any(x => 
    x.Key.StartsWith("__parallel_group_") && x.Key != groupKey && x.Value > 0);
    
if (!otherGroupRunning && queue.TryDequeue(out var groupTest))
{
    // Race: Another thread could start a conflicting group here
    runningConstraintKeys.AddOrUpdate(groupKey, 1, (k, v) => v + 1);
    return groupTest;
}

Risk: Multiple parallel groups could start simultaneously when they shouldn't, violating the constraint that only one parallel group can run at a time.

Recommendation: Check and acquire constraints atomically using a lock around the entire check-and-acquire operation.

2. Performance Issue in `SimpleConsumerManager.cs:87`

await Task.Delay(50, cancellationToken); // Fixed 50ms delay

Issues:

Fixed delay could cause unnecessary CPU usage under high load
No exponential backoff for empty queues
Could delay test completion in edge cases

Recommendation: Implement exponential backoff or use SemaphoreSlim for better signaling.

3. Dead Code in `SimpleConsumerManager.cs:109-114`

The GetCurrentRunningConstraints method is defined but never used, creating dead code.

Recommendation: Remove this method unless it's intended for future use.

4. Confusing Logic in `SimpleConsumerManager.cs:26-28`

var parallelism = _configuration.Strategy == ParallelismStrategy.Adaptive
    ? _configuration.MaxParallelism  
    : _configuration.MaxParallelism;

Both branches return the same value, making this logic confusing and suggesting incomplete adaptive logic implementation.

Recommendation: Either implement actual adaptive logic or simplify to var parallelism = _configuration.MaxParallelism;.

🚀 Performance Considerations

Improvements

Reduced complexity: Simpler queue operations vs. channel management
Better cache locality: Fewer indirections in hot paths
Lower memory overhead: Removed complex channel infrastructure
More efficient constraint checking: Better structured logic in SimpleTestQueues.cs:92-121

Concerns

Polling overhead: 50ms delay in consumer loop could impact latency in scenarios with rapid test completion
O(n) constraint checking: Linear search through running constraints in SimpleTestQueues.cs:80-81
Lock contention: Multiple queues with different locking strategies could cause contention

Recommendation: Consider using BlockingCollection<T> or producer/consumer pattern with proper signaling mechanisms.

🔒 Security Assessment

Low Risk: No security concerns identified. The code:

Properly handles cancellation tokens
Has appropriate exception handling
Doesn't expose sensitive data
Uses safe collection types
Follows secure coding practices

🧪 Test Coverage Analysis

Current Coverage ✅

Basic scheduling scenarios covered by existing tests
NotInParallelTests and KeyedNotInParallelTests verify constraint behavior with proper timing checks
Dependency execution logic has comprehensive tests with proper locking
Parallel constraint handling has test cases with appropriate delays to verify ordering

Gaps Identified ⚠️

Race condition testing: No tests specifically for concurrent constraint acquisition scenarios
Edge case coverage: Limited testing of empty queues, rapid scheduling/descheduling
Performance testing: No load tests for new scheduler under high concurrency
Timeout handling: Limited coverage of timeout scenarios in new implementation
Error recovery: Limited testing of error scenarios in consumer management

Recommendation: Add integration tests specifically for the new scheduler implementation, particularly focusing on concurrent access patterns.

🔧 Specific Code Suggestions

Fix Race Condition in `SimpleTestQueues.cs:80-88`

// Current: Potential race condition
var otherGroupRunning = runningConstraintKeys.Any(x => 
    x.Key.StartsWith("__parallel_group_") && x.Key != groupKey && x.Value > 0);
if (!otherGroupRunning && queue.TryDequeue(out var groupTest))

// Suggested: Atomic constraint checking
private readonly object _parallelGroupLock = new();

lock (_parallelGroupLock) // Add dedicated lock for parallel groups
{
    var otherGroupRunning = runningConstraintKeys.Any(x => 
        x.Key.StartsWith("__parallel_group_") && x.Key != groupKey && x.Value > 0);
    if (!otherGroupRunning && queue.TryDequeue(out var groupTest))
    {
        runningConstraintKeys.AddOrUpdate(groupKey, 1, (k, v) => v + 1);
        return groupTest;
    }
}

Improve Consumer Polling in `SimpleConsumerManager.cs:87`

// Current: Fixed delay
await Task.Delay(50, cancellationToken);

// Suggested: Exponential backoff
private int _backoffMs = 10;
await Task.Delay(Math.Min(_backoffMs, 1000), cancellationToken);
_backoffMs = Math.Min(_backoffMs * 2, 1000); // Reset on successful dequeue

📋 Action Items

High Priority 🔴

Fix race condition in parallel group constraint acquisition (SimpleTestQueues.cs:80-88)
Remove dead code (GetCurrentRunningConstraints method)
Add integration tests for new scheduler under concurrent load

Medium Priority 🟡

Implement exponential backoff for consumer polling to reduce CPU usage
Add performance benchmarks comparing old vs new scheduler performance
Document design decisions in code comments for future maintainers

Low Priority 🟢

Consider using BlockingCollection<T> for better signaling instead of polling
Add more comprehensive timeout testing for edge cases
Clarify or implement adaptive parallelism logic

🎯 Verdict

Approve with conditions: This is a solid architectural improvement that significantly simplifies the codebase. The identified issues are fixable and don't prevent the core functionality from working correctly. The race condition should be addressed before merging, but the overall direction is excellent.

The refactor successfully reduces complexity while maintaining feature parity, making the codebase more maintainable and easier to understand. The previous complex channel-based approach has been replaced with a much cleaner queue-based system that achieves the same goals with less code and complexity.

Added try-catch around execution task awaiting to ensure tests that throw exceptions during execution get proper failed state instead of being left in Running state. This helps handle edge cases where test execution fails unexpectedly. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

thomhurst and others added 6 commits August 4, 2025 21:51

refactor: remove ChannelConsumerManager and ProducerConsumerTestSched…

eeed71d

…uler - Deleted ChannelConsumerManager and ProducerConsumerTestScheduler classes. - Simplified test scheduling by removing outdated channel-based producer-consumer logic.

Merge branch 'main' into feature/improve-dependency-scheduling

82ed04e

thomhurst had a problem deploying to Pull Requests August 4, 2025 21:44 — with GitHub Actions Failure

thomhurst enabled auto-merge (squash) August 4, 2025 21:44

thomhurst and others added 4 commits August 4, 2025 23:49

thomhurst had a problem deploying to Pull Requests August 4, 2025 22:49 — with GitHub Actions Failure

thomhurst had a problem deploying to Pull Requests August 4, 2025 22:57 — with GitHub Actions Failure

thomhurst had a problem deploying to Pull Requests August 5, 2025 09:49 — with GitHub Actions Failure

This was referenced Oct 13, 2025

Bump TUnit.Engine from 0.19.86 to 0.67.19 ImmediatePlatform/Immediate.Cache#78

Closed

chore(deps): Bump the tunit group with 1 update dailydevops/fluentvalue#160

Closed

chore(deps): Bump the tunit group with 1 update dailydevops/fluentvalue#162

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Simplify Test and Dependency Scheduling #2759

Simplify Test and Dependency Scheduling #2759

Uh oh!

thomhurst commented Aug 4, 2025

Uh oh!

claude bot commented Aug 4, 2025 •

edited

Loading

Uh oh!

claude bot commented Aug 4, 2025 •

edited

Loading

Uh oh!

claude bot commented Aug 4, 2025 •

edited

Loading

Uh oh!

claude bot commented Aug 5, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Simplify Test and Dependency Scheduling #2759

Simplify Test and Dependency Scheduling #2759

Uh oh!

Conversation

thomhurst commented Aug 4, 2025

Uh oh!

claude bot commented Aug 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Code Review: Simplify Test and Dependency Scheduling

Progress:

🔍 Overall Assessment

✅ Positive Changes

Architecture Simplification

Code Quality Improvements

⚠️ Issues Identified

1. Critical Race Condition in SimpleTestQueues.cs:63-156

2. Inconsistent Locking Strategy in SimpleTestQueues.cs

3. Potential Memory Leak in SimpleConsumerManager.cs:88

4. Unused Method in SimpleConsumerManager.cs:119-181

5. Constraint Acquisition Logic Duplication

🚀 Performance Considerations

Improvements

Concerns

🔒 Security Assessment

🧪 Test Coverage Analysis

Current Coverage

Gaps Identified

🔧 Specific Code Suggestions

SimpleTestQueues.cs:86-87

SimpleConsumerManager.cs:26-28

📋 Action Items

High Priority

Medium Priority

Low Priority

🎯 Verdict

Uh oh!

claude bot commented Aug 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Code Review: Simplify Test and Dependency Scheduling

Progress:

🔍 Overall Assessment

✅ Positive Changes

Architecture Simplification

Code Quality Improvements

⚠️ Issues Identified

1. Critical Race Condition in SimpleTestQueues.cs:80-88

2. Potential Memory Leak in SimpleConsumerManager.cs:87

3. Dead Code in SimpleConsumerManager.cs:109-114

4. Confusing Logic in SimpleConsumerManager.cs:26-28

🚀 Performance Considerations

Improvements

Concerns

🔒 Security Assessment

🧪 Test Coverage Analysis

Current Coverage

Gaps Identified

🔧 Specific Code Suggestions

SimpleTestQueues.cs:80-88

SimpleConsumerManager.cs:87

📋 Action Items

High Priority

Medium Priority

Low Priority

🎯 Verdict

Uh oh!

claude bot commented Aug 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Code Review: Simplify Test and Dependency Scheduling

Progress:

🔍 Overall Assessment

✅ Positive Changes

Architecture Simplification

Code Quality Improvements

⚠️ Issues Identified

1. Race Condition in SimpleTestQueues.cs:80-88

2. Performance Issue in SimpleConsumerManager.cs:87

3. Dead Code in SimpleConsumerManager.cs:109-114

4. Confusing Logic in SimpleConsumerManager.cs:26-28

🚀 Performance Considerations

claude bot commented Aug 4, 2025 •

edited

Loading

1. Critical Race Condition in `SimpleTestQueues.cs:63-156`

2. Inconsistent Locking Strategy in `SimpleTestQueues.cs`

3. Potential Memory Leak in `SimpleConsumerManager.cs:88`

4. Unused Method in `SimpleConsumerManager.cs:119-181`

claude bot commented Aug 4, 2025 •

edited

Loading

1. Critical Race Condition in `SimpleTestQueues.cs:80-88`

2. Potential Memory Leak in `SimpleConsumerManager.cs:87`

3. Dead Code in `SimpleConsumerManager.cs:109-114`

4. Confusing Logic in `SimpleConsumerManager.cs:26-28`

claude bot commented Aug 4, 2025 •

edited

Loading

1. Race Condition in `SimpleTestQueues.cs:80-88`

2. Performance Issue in `SimpleConsumerManager.cs:87`

3. Dead Code in `SimpleConsumerManager.cs:109-114`

4. Confusing Logic in `SimpleConsumerManager.cs:26-28`

claude bot commented Aug 5, 2025 •

edited

Loading

1. Race Condition in `SimpleTestQueues.cs:80-88`

2. Performance Issue in `SimpleConsumerManager.cs:87`

3. Dead Code in `SimpleConsumerManager.cs:109-114`

4. Confusing Logic in `SimpleConsumerManager.cs:26-28`

Fix Race Condition in `SimpleTestQueues.cs:80-88`

Improve Consumer Polling in `SimpleConsumerManager.cs:87`