linux-system-roles · spetrosi · May 25, 2026 · May 20, 2026 · May 20, 2026 · May 20, 2026
diff --git a/.claude/skills/analyze-quarterly-metrics/SKILL.md b/.claude/skills/analyze-quarterly-metrics/SKILL.md
@@ -0,0 +1,175 @@
+---
+name: analyze-quarterly-metrics
+description: This skill should be used when the user asks to "analyze quarterly metrics", "analyze the quarter", "generate quarterly report", "quarterly analysis", mentions a specific quarter like "2026-Q2", or discusses trends, risks, and growth opportunities for Linux System Roles metrics.
+---
+
+# Quarterly Metrics Analysis Skill
+
+This skill analyzes quarterly metrics data and generates a FOCUSED report (~400-500 words) with key metrics, highlights, top downloaded roles, concerns, and actionable recommendations.
+
+## What to do
+
+1. **Determine if the quarter is complete or in-progress:**
+   - Get today's date and compare to the quarter being analyzed
+   - If analyzing current or future quarter: Add a disclaimer that data is PARTIAL/INCOMPLETE
+   - Adjust your interpretation: low numbers may just mean "not much time has passed yet", not a crisis
+   - For partial quarters: focus on trends and rates rather than absolute numbers
+
+2. **Load the raw data** for the specified quarter:
+   - Read the summary CSV files: `data/github_prs_summary.csv`, `data/github_issues_summary.csv`, `data/galaxy_legacy_summary.csv`, `data/galaxy_collections_summary.csv`
+   - Extract data for the specified quarter and the previous quarter for comparison
+   - If `data/{{quarter}}/galaxy_legacy.csv` exists, read it for top downloaded roles and growth analysis
+
+3. **Calculate key metrics** from the raw data:
+   - **PR Merge Rate**: PRs Merged / (PRs Created - PRs Open) × 100 (excludes PRs still under review)
+   - **External Acceptance Rate**: External PRs Merged / (External PRs Created - External PRs Open) × 100
+   - **External Contribution %**: External PRs Created / PRs Created × 100
+   - **Issue Resolution Rate**: Issues Closed / Issues Created × 100
+   - **QoQ Growth rates**: Compare current quarter to previous quarter
+   - **Fastest growing roles**: If per-role data exists, identify top gainers by comparing to previous quarter
+
+4. **Analyze the data** and generate a FOCUSED report with these sections:
+
+   ### Executive Summary (2-3 sentences)
+   - High-level overview: what's the overall health? One key win, one key concern.
+   - Note if data is partial (for incomplete quarters)
+
+   ### Key Metrics (Table format)
+   - Comparison table: Current vs Previous Quarter
+   - PRs Created, Merged, Merge Rate
+   - External PRs, External Acceptance Rate, External % of total
+   - Issues Created, Closed, Resolution Rate
+   - Galaxy Legacy Downloads (QoQ change)
+   - Galaxy Collections Downloads (QoQ change)
+
+   ### Highlights (2-3 bullets)
+   - Notable achievements and positive changes
+   - Metrics that improved
+   - Balance the concerns with wins
+
+   ### Top Downloaded Roles (Top 5 list)
+   - List the 5 most downloaded roles with cumulative totals
+   - If per-role data exists for current and previous quarter, also show fastest growing/declining roles
+
+   ### Top Concerns (3-4 bullets)
+   - Most critical issues requiring attention
+   - Use specific numbers and percentages
+   - Only urgent/high-impact issues
+
+   ### Recommendations (3-4 bullets)
+   - Most important actionable items
+   - Be specific and brief
+   - Focus on high-impact actions
+
+   **TARGET LENGTH**: ~400-500 words. Scannable but informative.
+
+5. **Be specific with numbers**: Always cite actual metrics, percentages, and comparisons. Don't use vague language like "significant" without quantifying it.
+
+6. **Be smart about partial quarters**:
+   - If the quarter is incomplete, DO NOT flag low absolute numbers as risks
+   - Focus on rates (merge rate, acceptance rate) rather than volumes for partial data
+   - Only flag things as concerns if they represent actual problems, not just "we're only 2 weeks into the quarter"
+   - Make it clear in the Executive Summary if data is partial
+
+7. **Validate the report for accuracy**:
+   - Re-read all numeric claims in your draft
+   - For each number: verify it matches the correct CSV column/row
+   - For each comparison: verify the direction (higher/lower) matches the math
+   - Check for contradictions: "up from X, but below X" is impossible
+   - If you find errors, fix them before saving
+
+8. **Automatically save the report**:
+   - Save to `reports/{{quarter}}-analysis.md`
+   - Create the reports directory if it doesn't exist
+   - **Safe to overwrite**: If the report file already exists, overwrite it (reports are tracked in git, so previous versions are preserved)
+   - Show the user a brief summary and notify where the full file was saved
+
+## Guidelines
+
+- **Balanced and scannable**: Target ~400-500 words, readable in 2-3 minutes
+- **Detect partial quarters**: Note if data is incomplete in the Executive Summary
+- **Compare to previous quarter**: QoQ changes only, skip historical deep-dives
+- **Balance concerns with wins**: Include both Highlights and Top Concerns sections
+- **Show what matters**: Top downloaded roles help prioritize work
+- **Only critical issues**: If it's not urgent or high-impact, skip it from concerns
+- **Be actionable**: Every recommendation must be specific and implementable
+- **Don't cry wolf on partial data**: For incomplete quarters, only flag true risks (bad rates, declining trends), not low volumes
+- **Use numbers**: Always cite specific metrics, no vague language
+
+### Data Accuracy and Validation
+
+**CRITICAL: Verify all numbers before making claims**
+
+1. **Use correct data sources**:
+   - When discussing a specific collection (e.g., fedora.linux_system_roles), use that collection's column, NOT the Total Downloads column
+   - When discussing totals, clearly state "total across all collections"
+   - Double-check: Does the number in your sentence match the CSV cell you're referencing?
+
+2. **Verify comparison direction**:
+   - If A > B: use "higher than", "above", "exceeds", "increased from"
+   - If A < B: use "lower than", "below", "decreased from", "down from"
+   - If A ≈ B: use "similar to", "comparable to", "roughly equal to"
+   - **NEVER** say "A is well below B" when A > B or vice versa
+
+3. **Validate calculations**:
+   - QoQ growth = ((Current - Previous) / Previous) × 100
+   - If growth is positive, use "increased" or "up"; if negative, use "decreased" or "down"
+   - Check sign: positive growth means increase, negative growth means decrease
+
+4. **Before finalizing the report**:
+   - Re-read each numeric claim
+   - Verify the number matches the data source (correct CSV column/row)
+   - Verify the comparison direction matches the math (higher/lower/equal)
+   - Check for internal contradictions (e.g., "up from X, but well below X")
+
+## Example invocations
+
+**Via skill invocation:**
+User types: `/analyze-quarterly-metrics 2026-Q2`
+
+**Via natural language:**
+- "Analyze the quarterly metrics for 2026-Q2"
+- "Generate a quarterly report for Q2 2026"
+- "What do the metrics show for this quarter?"
+
+You should:
+1. Extract the quarter from the user's request or args (format: YYYY-QN, e.g., 2026-Q2)
+2. Determine if the quarter is complete or in-progress based on today's date
+3. Read the raw metrics data from CSV files for that quarter and previous quarter
+4. Read per-role Galaxy data if available for top downloaded roles analysis
+5. Calculate derived metrics (merge rates, growth rates, etc.)
+6. Generate a focused report (~400-500 words): Executive Summary, Key Metrics, Highlights, Top Downloaded Roles, Top Concerns, Recommendations
+7. Validate: verify all numbers match data sources
+8. Save to `reports/{{quarter}}-analysis.md`
+9. Show the full report to user and notify where file was saved
+
+## Important
+
+- **Check if quarter is complete**: Compare today's date to quarter end (Q1: Jan-Mar, Q2: Apr-Jun, Q3: Jul-Sep, Q4: Oct-Dec)
+- For partial quarters: add disclaimer in Executive Summary
+- **Calculate metrics from CSV data** - don't rely on pre-computed files
+- **Read per-role data** if `data/{{quarter}}/galaxy_legacy.csv` exists for top downloaded roles
+- **Balanced report**: ~400-500 words, 6 sections (Summary, Metrics, Highlights, Top Roles, Concerns, Recommendations)
+- **Auto-save** to `reports/{{quarter}}-analysis.md` (safe to overwrite, tracked in git)
+- Show full report to user and file location
+
+## Metric Calculation Formulas
+
+Use these formulas when calculating metrics from the raw CSV data:
+
+**PR Metrics:**
+- Merge Rate = (PRs Merged) / (PRs Created - PRs Open) × 100
+  - Excludes PRs still under review from the calculation
+- External Acceptance = (External PRs Merged) / (External PRs Created - External PRs Open) × 100  
+  - Excludes external PRs still under review
+- External % = (External PRs Created) / (PRs Created) × 100
+- QoQ Growth = ((Current - Previous) / Previous) × 100
+
+**Issue Metrics:**
+- Resolution Rate = (Issues Closed) / (Issues Created) × 100
+- External % = (External Issues Created) / (Issues Created) × 100
+- QoQ Growth = ((Current - Previous) / Previous) × 100
+
+**Galaxy Metrics:**
+- Legacy QoQ Growth = ((Current Total - Previous Total) / Previous Total) × 100
+- Collections QoQ Growth = ((Current Total - Previous Total) / Previous Total) × 100
diff --git a/.github/workflows/quarterly-metrics.yml b/.github/workflows/quarterly-metrics.yml
@@ -0,0 +1,171 @@
+name: Quarterly Metrics Report
+
+on:
+  # Scheduled: Run on the last day of each quarter at 9am UTC
+  schedule:
+    - cron: '0 9 31 3,12 *'  # March 31, December 31 at 9am UTC
+    - cron: '0 9 30 6,9 *'   # June 30, September 30 at 9am UTC
+
+  # Manual trigger with optional parameters
+  workflow_dispatch:
+    inputs:
+      quarter:
+        description: 'Quarter (e.g., 2025-Q1)'
+        required: false
+        type: string
+      date_range:
+        description: 'Date range (e.g., 2025-01-01..2025-03-31)'
+        required: false
+        type: string
+
+jobs:
+  generate-report:
+    runs-on: ubuntu-latest
+
+    permissions:
+      contents: write        # Required to commit and push changes
+      pull-requests: write   # Required to create pull requests
+
+    steps:
+      - name: Checkout repository
+        uses: actions/checkout@v6
+
+      - name: Set up Python
+        uses: actions/setup-python@v6
+        with:
+          python-version: '3.11'
+          cache: 'pip'
+
+      - name: Install Python dependencies
+        run: |
+          pip install -r requirements.txt
+
+      - name: Install GitHub CLI
+        run: |
+          # gh CLI is pre-installed on ubuntu-latest runners
+          gh --version
+
+      - name: Determine quarter and date range
+        id: quarter
+        env:
+          INPUT_QUARTER: ${{ github.event.inputs.quarter }}
+          INPUT_DATE_RANGE: ${{ github.event.inputs.date_range }}
+        run: |
+          # Use input if provided, otherwise calculate current quarter
+          if [ -n "$INPUT_QUARTER" ]; then
+            QUARTER="$INPUT_QUARTER"
+          else
+            # Calculate quarter from current date
+            YEAR=$(date +%Y)
+            MONTH=$(date +%m)
+            Q=$(( (MONTH - 1) / 3 + 1 ))
+            QUARTER="${YEAR}-Q${Q}"
+          fi
+          echo "quarter=${QUARTER}" >> $GITHUB_OUTPUT
+
+          # Determine date range
+          if [ -n "$INPUT_DATE_RANGE" ]; then
+            DATE_RANGE="$INPUT_DATE_RANGE"
+          else
+            # Auto-calculate date range based on quarter
+            YEAR=$(echo ${QUARTER} | cut -d'-' -f1)
+            Q_NUM=$(echo ${QUARTER} | cut -d'Q' -f2)
+
+            case ${Q_NUM} in
+              1)
+                DATE_RANGE="${YEAR}-01-01..${YEAR}-03-31"
+                ;;
+              2)
+                DATE_RANGE="${YEAR}-04-01..${YEAR}-06-30"
+                ;;
+              3)
+                DATE_RANGE="${YEAR}-07-01..${YEAR}-09-30"
+                ;;
+              4)
+                DATE_RANGE="${YEAR}-10-01..${YEAR}-12-31"
+                ;;
+            esac
+          fi
+          echo "date_range=${DATE_RANGE}" >> $GITHUB_OUTPUT
+
+      - name: Collect GitHub statistics
+        env:
+          GITHUB_TOKEN: ${{ secrets.GITHUB_TOKEN }}
+          QUARTER: ${{ steps.quarter.outputs.quarter }}
+          DATE_RANGE: ${{ steps.quarter.outputs.date_range }}
+        run: bash scripts/collect_all_github_stats.sh
+
+      - name: Collect Galaxy statistics
+        env:
+          GALAXY_API_KEY: ${{ secrets.GALAXY_API_KEY }}
+          QUARTER: ${{ steps.quarter.outputs.quarter }}
+        run: python3 scripts/collect_galaxy_stats.py
+
+      - name: Update quarterly summary files
+        env:
+          QUARTER: ${{ steps.quarter.outputs.quarter }}
+        run: python3 scripts/update_quarterly_summary.py
+
+      - name: Generate graphs
+        env:
+          QUARTER: ${{ steps.quarter.outputs.quarter }}
+        run: python3 scripts/generate_graphs.py
+
+      - name: Create Pull Request with results
+        env:
+          GH_TOKEN: ${{ secrets.GITHUB_TOKEN }}
+          QUARTER: ${{ steps.quarter.outputs.quarter }}
+        run: |
+          git config user.name "github-actions[bot]"
+          git config user.email "github-actions[bot]@users.noreply.github.com"
+
+          git add data/ reports/
+
+          # Check if there are changes to commit
+          if git diff --staged --quiet; then
+            echo "No changes to commit"
+            exit 0
+          fi
+
+          # Create or switch to branch for this quarter
+          BRANCH_NAME="metrics/${QUARTER}"
+          if git ls-remote --exit-code --heads origin "$BRANCH_NAME" >/dev/null 2>&1; then
+            echo "Branch $BRANCH_NAME exists, updating it"
+            git fetch origin "$BRANCH_NAME"
+            git checkout -B "$BRANCH_NAME" "origin/$BRANCH_NAME"
+          else
+            echo "Creating new branch $BRANCH_NAME"
+            git checkout -b "$BRANCH_NAME"
+          fi
+
+          # Commit changes
+          git commit -m "Add metrics for ${QUARTER}" -m "Generated by GitHub Actions workflow" -m "- Data: data/${QUARTER}/" -m "- Graphs: reports/images/"
+
+          # Push the branch
+          git push --set-upstream origin "$BRANCH_NAME"
+
+          # Create Pull Request if it doesn't exist
+          PR_URL=$(gh pr list --head "$BRANCH_NAME" --base main --json url -q '.[0].url')
+          if [ -n "$PR_URL" ]; then
+            echo "PR already exists: $PR_URL"
+          else
+            gh pr create \
+              --title "Quarterly Metrics - ${QUARTER}" \
+              --body "Automated metrics collection for ${QUARTER}. **Data:** \`data/${QUARTER}/\` **Graphs:** \`reports/images/\` After merging, generate analysis: \`/analyze-quarterly-metrics ${QUARTER}\`" \
+              --base main \
+              --head "$BRANCH_NAME"
+          fi
+
+      - name: Create workflow summary
+        env:
+          GH_TOKEN: ${{ secrets.GITHUB_TOKEN }}
+          QUARTER: ${{ steps.quarter.outputs.quarter }}
+        run: |
+          echo "## Quarterly Metrics - ${QUARTER}" >> $GITHUB_STEP_SUMMARY
+          echo "" >> $GITHUB_STEP_SUMMARY
+          PR_URL=$(gh pr view metrics/${QUARTER} --json url -q .url 2>/dev/null || echo "")
+          if [ -n "$PR_URL" ]; then
+            echo "**[Review Pull Request](${PR_URL})**" >> $GITHUB_STEP_SUMMARY
+          else
+            echo "Data: \`data/${QUARTER}/\`" >> $GITHUB_STEP_SUMMARY
+          fi
diff --git a/.gitignore b/.gitignore
@@ -0,0 +1,2 @@
+__pycache__/
+venv/