Skip to content

Conversation

@mdbirnstiehl
Copy link
Contributor

This PR closes #4345 and updates information on how Streams calculates data quality.

@github-actions
Copy link
Contributor

github-actions bot commented Jan 8, 2026

✅ Vale Linting Results

No issues found on modified lines!


The Vale linter checks documentation changes against the Elastic Docs style guide.

To use Vale locally or report issues, refer to Elastic style guide for Vale.

@github-actions
Copy link
Contributor

github-actions bot commented Jan 8, 2026

🔍 Preview links for changed docs

@mdbirnstiehl mdbirnstiehl marked this pull request as ready for review January 8, 2026 21:44
@mdbirnstiehl mdbirnstiehl requested a review from a team as a code owner January 8, 2026 21:44
Copy link
Contributor

@nastasha-solomon nastasha-solomon left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just some light suggestions. Take em or leave em!


## Failure store
* **Degraded documents:** Documents from the last backing index of the stream with the `ignored` property, usually because of malformed fields or exceeding the limit of total fields when `ignore_above` is set to `false`. This component shows:
* Total number of degraded documents.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Don't think these list items need periods since they're not complete sentences.

Suggested change
* Total number of degraded documents.
* Total number of degraded documents

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I normally follow the Microsoft guideline where if the introductory sentence is a fragment, and the list items complete the sentence, then use punctuation, but I'm not sure if we have guidance in that particular instance in our style guide.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Our official guidance is:

Use parallel construction for bulleted lists (i.e., all sentence fragments or all complete sentences, rather than a mix of the two).

A period should be added to the end of each bullet point when the bullet points contain complete sentences and/or two or more sentences. When using short fragments, bullets should not include punctuation.

https://brand.elastic.co/302f66895/p/194a3b-writing-style-guide/t/446788

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

But that's the brand guide and I have my own brand(on) guide that I follow

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It gets a little hazy when there's a item that has a complete sentence (refer to...) but I don't want to rewrite all of these bullets as complete sentences. I have learned in the past to use periods on all in that situation, but it's all a style guide thang, so who knows.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🚢 👍

* **Failed documents:** Documents that were rejected during ingestion because of mapping conflicts or pipeline failures. This component shows:
* Total number of failed documents that correspond with this stream from within the specified time range in the date picker. Refer to [Failure store](#streams-data-quality-failure) for more information.
* Percentage of failed documents relative to the total document count from the stream's last backing index.
* The data quality status (**Good**, **Degraded**, **Poor**).
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
* The data quality status (**Good**, **Degraded**, **Poor**).
* The data quality status (**Good**, **Degraded**, **Poor**)

* **Degraded:** Either the **Degraded documents** percentage or the **Failed documents** percentage are greater than 0 and less than or equal to 3.
* **Poor:** Either the **Degraded documents** percentage or the **Failed documents** percentage are greater than 3.

## Failure store [streams-data-quality-failure]
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this section needed if there's already a whole page dedicated to explaining a failure store?


For example, for a stream called `my-stream`, Streams fetches all documents from the `my-stream::failures` index from within the specified time range in the date picker.

### Required permissions
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Maybe it's worth making a snippet, I would like to have everything a user needs to user Streams without sending them to another page if possible, so I put a brief description, but maybe it's not necessary.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

++ to the snippet idea. You could snippetize the intro too?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This paragraph, that is:

A failure store is a secondary set of indices inside a data stream, dedicated to storing failed documents. A failed document is any document that, without the failure store enabled, would cause an ingest pipeline exception or that has a structure that conflicts with a data stream's mappings. In the absence of the failure store, a failed document would cause the indexing operation to fail, with an error message returned in the operation response.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[Streams] Update docs for data quality

3 participants