Simplify how our FatalError type works and introduce a severity concept #58094

jasonmalinowski · 2021-12-03T01:37:10Z

Our FatalError type had two "handlers", the "fatal handler" and "non fatal handler." If you called any ReportAndCatch method, that would be counted as "non fatal" because you're catching the exception and presumably dealing with it. If you called any ReportAndPropagate, that would call the "fatal" handler.

This was confusing from a few perspectives. In the IDE, both the "fatal" and "non-fatal" handlers actually did the same thing; they both reported a non-fatal report via telemetry and kept the process alive. At first, I was tempted to use fatal vs. non-fatal as way to report different severities via telemetry, but the concept doesn't jive well there. Sometimes we were calling ReportAndCatch when the feature absolutely broke and the user would have known -- in that case we want the severity to be high. Sometimes we're also catching an exception and gracefully dealing with it, so the severity should be lower. Similarly, propagating an exception usually means it's a severe problem, but if it's a background job that has a higher-up error handler, we may not want to treat it as high severity. In the end, the conclusion was the fatal vs. non-fatal split didn't make sense in relation to severity, and since it doesn't correlate to whether you want exceptions to flow or not, it's easier to just add severity as a separate concept when reporting, and unify the code so everything is going through a single reporting path.

The compiler story is a bit different and code reviewers should read the commit message of the last commit in this PR for the tactical fix we're doing for now.

jasonmalinowski · 2021-12-16T01:27:24Z

src/Compilers/Core/Portable/Operations/ControlFlowGraph.cs

+#pragma warning restore CS0618 // ReportIfNonFatalAndCatchUnlessCanceled is obsolete
            {
                // Log a Non-fatal-watson and then ignore the crash in the attempt of getting flow graph.
                Debug.Assert(false, "\n" + e.ToString());


This Debug.Assert prevents the compiler from flagging that the "return null" below is actually disallowed by the nullable annotations.

src/Compilers/Core/Portable/SourceGeneration/GeneratorDriver.cs

jasonmalinowski · 2021-12-16T01:30:29Z

...io/Core/Def/Implementation/LanguageService/AbstractLanguageService`2.IVsLanguageDebugInfo.cs

            {
                return LanguageDebugInfo.GetLanguageID(pBuffer, iLine, iCol, out pguidLanguageID);
            }
-            catch (Exception e) when (FatalError.ReportAndCatch(e) && false)


These were all doing ReportAndCatch because they wanted the "non fatal" behavior (which mind you was the case either way) but still wanted to propagate so that way the COM error handling still happens. We can call the regular method now.

jasonmalinowski · 2021-12-16T01:31:32Z

src/VisualStudio/Core/Def/Interactive/VsInteractiveWindowPackage.cs

+            Action<Exception> fatalHandler = e => FaultReporter.ReportFault(e, VisualStudio.Telemetry.FaultSeverity.Critical, forceDump: false);
+            Action<Exception> nonFatalHandler = e => FaultReporter.ReportFault(e, VisualStudio.Telemetry.FaultSeverity.General, forceDump: false);
+
            SetErrorHandlers(typeof(IInteractiveWindow).Assembly, fatalHandler, nonFatalHandler);
            SetErrorHandlers(typeof(IVsInteractiveWindow).Assembly, fatalHandler, nonFatalHandler);


There is a separate piece of work (outside the scope of this PR) to remove the reflection against the interactive window package. The interactive window package should just be doing it's own thing directly.

jasonmalinowski · 2021-12-16T01:45:32Z

src/Compilers/Core/Portable/InternalUtilities/FatalError.cs

+
+    internal enum ErrorSeverity
+    {
+        Uncategorized,


"Uncategorized" is the default for now, but the VS APIs recommends not using that for anything new. My plan is to merge this PR, do another cycle or two getting everything else annotated, and then removing "Uncategorized" entirely along with the defaults of the parameters. I don't want to bloat this PR too much annotating everything, and also doing the switch to remove the defaults will break in-flight PRs trying to call the Report APIs so we need to do that quickly.

src/Compilers/Core/Portable/Compilation/SemanticModel.cs

src/Compilers/Core/Portable/DiagnosticAnalyzer/AnalyzerExecutor.cs

src/Compilers/Core/Portable/DiagnosticAnalyzer/AnalyzerDriver.cs

src/Compilers/Core/Portable/Operations/ControlFlowGraph.cs

src/Compilers/Core/Portable/SourceGeneration/GeneratorDriver.cs

src/Compilers/Core/Portable/SourceGeneration/UserFunction.cs

src/Compilers/Core/Portable/InternalUtilities/FatalError.cs

333fred

Compiler changes LGTM (commit 7)

...atures/Core/Implementation/IntelliSense/QuickInfo/QuickInfoSourceProvider.QuickInfoSource.cs

tmat · 2021-12-20T21:53:53Z

src/Compilers/Core/Portable/InternalUtilities/FatalError.cs

+#else
+        public
+#endif
+            static bool ReportWithDumpAndCatch(Exception exception, ErrorSeverity severity = ErrorSeverity.Uncategorized)


Where do we use this?

I guess the one use got removed, so we could get rid of this.

Oh right: I wanted to keep it around until we have confidence that the dump telemetry requests are working again, as this is the backup until they are.

This hasn't directly used Watson in quite some time.

Our FatalError type had two "handlers", the "fatal handler" and "non fatal handler." If you called any ReportAndCatch method, that would be counted as "non fatal" because you're catching the exception and presumably dealing with it. If you called any ReportAndPropagate, that would call the "fatal" handler. This was confusing from a few perspectives. In the IDE, both the "fatal" and "non-fatal" handlers actually did the same thing; they both reported a non-fatal report via telemetry and kept the process alive. At first, I was tempted to use fatal vs. non-fatal as way to report different severities via telemetry, but the concept doesn't jive well there. Sometimes we were calling ReportAndCatch when the feature absolutely broke and the user would have known -- in that case we want the severity to be high. Sometimes we're also catching an exception and gracefully dealing with it, so the severity should be lower. Similarly, propagating an exception usually means it's a severe problem, but if it's a background job that has a higher-up error handler, we may not want to treat it as high severity. In the end, the conclusion was the fatal vs. non-fatal split didn't make sense in relation to severity, and since it doesn't correlate to whether you want exceptions to flow or not, it's easier to just add severity as a separate concept when reporting, and unify the code so everything is going through a single reporting path. The compiler story was a bit different: the compiler would only set the "fatal" handler; the non-fatal handler was never set. This meant that a call to ReportAndPropagate would crash the process, ReportAndCatch would keep it going. This seems sensible at first glance, except the ReportAndCatches in the compiler often were swallowing exceptions and returning placeholder results from the SemanticModel; in that case we're OK with the compiler crashing as well, since the compiler is the source of those issues.

My logic for severities was this: 1. If it's anything in the core workspace that will break all features (like parsing, semantic model, etc.) it's Critical. 2. If the fault is going to prevent an explicit user gesture from working, it's Critical. So an exception breaking go to definition, a refactoring, etc., is Critical. We failed to do what the user told us. 3. Situations where we failed to do part of a feature are General. If we triggered completion but a single provider failed, we are missing content, but there's a good chance the user was still able to have happen what they wanted. 4. Errors while computing tags for features are General, since it's possible we'll catch up if the user had broken code. 5. Situations where we're updating background stuff and the user might never notice are Diagnostic.

When I started down the path of unifying our fatal/non-fatal handlers, we didn't believe there was anywhere the compiler was catching exceptions while reporting them, because in the command line compiler case that would simply be equivalent to catching and silently swallowing the issue. Upon further investigation, there were a handful of places where the compiler was doing exactly that! Worse off, those are places where if an exception is thrown we'd return invalid nulls from some APIs which probably would make the caller crash. We agreed that we need to remove those catches -- some of them seem to be in service of our IOperation APIs, which at some point must have been fairly unstable and we didn't want that causing stability issues. But by now, any issues should be worked out, and if they're not they're causing analyzers to not analyze code which may be a problem. However, since we don't know if there's surprises lurking, we're going to keep the existing behavior for now and then remove those try/catches at the start of the 17.2 cycle where we'll have sufficient time to react. This commit does the following then: 1. Makes all the ReportAndCatch* APIs private if we're compiling at the compiler layer, to prevent any future use of them. 2. Adds an [Obsolete] API for the current behavior under a new name. 3. Moves the callers over to it. Once 17.2 cycle starts we will: 1. Delete these new APIs. 2. Clean up the scoping around ReportAndCatch to simply #if !COMPILERCORE around the whole thing. 3. Remove the flag added that's set by the IDE. At that point, the command line compiler has no way to catch and report non-fatal errors that wouldn't be fatal if it's the command line compiler. If we decide we need that, we can change this.

ghost added Needs UX Triage Area-IDE labels Dec 3, 2021

runfoapp bot mentioned this pull request Dec 3, 2021

Microsoft.CodeAnalysis.CSharp.CommandLine.UnitTests.CommandLineTests.ArgumentParsing is flaky #58077

Open

jasonmalinowski self-assigned this Dec 3, 2021

jasonmalinowski force-pushed the include-severities-in-faults branch from 963acdb to 89dd980 Compare December 3, 2021 22:58

jasonmalinowski removed the Needs UX Triage label Dec 9, 2021

jasonmalinowski force-pushed the include-severities-in-faults branch from 89dd980 to 9e7f6ab Compare December 16, 2021 01:19

ghost added the Needs UX Triage label Dec 16, 2021

jasonmalinowski marked this pull request as ready for review December 16, 2021 01:20

jasonmalinowski requested review from a team as code owners December 16, 2021 01:20

jasonmalinowski requested a review from a team December 16, 2021 01:20

jasonmalinowski removed the Needs UX Triage label Dec 16, 2021

jasonmalinowski requested review from 333fred and jaredpar December 16, 2021 01:26

jasonmalinowski commented Dec 16, 2021

View reviewed changes

src/Compilers/Core/Portable/SourceGeneration/GeneratorDriver.cs Show resolved Hide resolved

jasonmalinowski commented Dec 16, 2021

View reviewed changes

jasonmalinowski requested review from AbhitejJohn and ryzngard December 16, 2021 01:32

jasonmalinowski commented Dec 16, 2021

View reviewed changes

runfoapp bot mentioned this pull request Dec 16, 2021

[Flaky Test] Roslyn.VisualStudio.IntegrationTests.CSharp.CSharpCodeActions.GenerateMethodInClosedFile times out #57722

Closed

chsienki mentioned this pull request Dec 16, 2021

UserFunctionException --> OperationCanceledException in SourceGeneration/Nodes/SyntaxReceiverInputNode.cs:line 81 #58290

Closed

333fred reviewed Dec 16, 2021

View reviewed changes

jasonmalinowski mentioned this pull request Dec 16, 2021

Delete FatalError.ReportIfNonFatalAndCatchUnlessCanceled #58375

Closed

ghost added the Needs UX Triage label Dec 16, 2021

333fred approved these changes Dec 16, 2021

View reviewed changes

jasonmalinowski removed the Needs UX Triage label Dec 16, 2021

jasonmalinowski added the Area-Compilers label Dec 16, 2021

ghost added the Needs UX Triage label Dec 16, 2021

JoeRobich added UX Review Not Required UX Review Not Required and removed Needs UX Triage labels Dec 17, 2021

dibarbet approved these changes Dec 17, 2021

View reviewed changes

...atures/Core/Implementation/IntelliSense/QuickInfo/QuickInfoSourceProvider.QuickInfoSource.cs Outdated Show resolved Hide resolved

runfoapp bot mentioned this pull request Dec 17, 2021

[Flaky Test] CSharp.CSharpCodeActions.AddUsingExactMatchBeforeRenameTracking #57982

Closed

333fred approved these changes Dec 20, 2021

View reviewed changes

tmat reviewed Dec 20, 2021

View reviewed changes

chsienki approved these changes Jan 4, 2022

View reviewed changes

jasonmalinowski added 5 commits January 4, 2022 11:13

Delete dead code

42df516

Rename WatsonReporter to FaultReporter

69e50f0

This hasn't directly used Watson in quite some time.

jasonmalinowski force-pushed the include-severities-in-faults branch from 9f0ea98 to 409f385 Compare January 4, 2022 19:28

jasonmalinowski enabled auto-merge January 4, 2022 19:29

jasonmalinowski merged commit cb30fc7 into dotnet:main Jan 4, 2022

ghost added this to the Next milestone Jan 4, 2022

jasonmalinowski deleted the include-severities-in-faults branch January 4, 2022 21:49

jasonmalinowski mentioned this pull request Jan 4, 2022

Allow non-fatal error reports to include FaultSeverity #57699

Closed

Cosifne modified the milestones: Next, 17.1.P3 Jan 5, 2022

chsienki mentioned this pull request Jan 13, 2022

Filter cancellation exceptions in generator driver #58843

Merged

Simplify how our FatalError type works and introduce a severity concept #58094

Simplify how our FatalError type works and introduce a severity concept #58094

Uh oh!

Conversation

jasonmalinowski commented Dec 3, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jasonmalinowski Dec 16, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jasonmalinowski Dec 16, 2021

Choose a reason for hiding this comment

Uh oh!

jasonmalinowski Dec 16, 2021

Choose a reason for hiding this comment

Uh oh!

jasonmalinowski Dec 16, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

333fred left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

tmat Dec 20, 2021

Choose a reason for hiding this comment

Uh oh!

jasonmalinowski Dec 20, 2021

Choose a reason for hiding this comment

Uh oh!

jasonmalinowski Dec 20, 2021

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

jasonmalinowski commented Dec 3, 2021 •

edited

Loading