Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
7 changes: 7 additions & 0 deletions packages/filestream/_dev/build/docs/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -3,6 +3,13 @@
WARNING: Migrating from the "Custom Logs (Deprecated)" to "Custom Logs
(Filestream)" will cause files to be re-ingested because the state is not migrated.

IMPORTANT: The Filestream integration will only start ingesting files
**when they are 1024 bytes in size or larger**. This can be adjusted by
setting "Fingerprint length", however it will influence how files are
identified. Refer to the
[fingerprint](https://www.elastic.co/docs/reference/beats/filebeat/filebeat-input-filestream#filebeat-input-filestream-file-identity-fingerprint)
documentation for more details.

In future releases it's expected to have an automated way to migrate the state. However, this is not possible at the moment.

The current best option for minimizing the data duplication while migrating to "Custom Logs (Filestream)" is to use the 'Ignore Older' or 'Exclude Files' options.
Expand Down
5 changes: 5 additions & 0 deletions packages/filestream/changelog.yml
Original file line number Diff line number Diff line change
@@ -1,4 +1,9 @@
# newer versions go on top
- version: "1.1.4"
changes:
- description: Add warning about only ingesting files >= 1024 bytes
type: enhancement
link: https://github.com/elastic/integrations/pull/14209
- version: "1.1.3"
changes:
- description: Correct the readme
Expand Down
20 changes: 10 additions & 10 deletions packages/filestream/data_stream/generic/manifest.yml
Original file line number Diff line number Diff line change
Expand Up @@ -98,7 +98,7 @@ streams:
title: Enable symlinks
description: |
The symlinks option allows Elastic Agent to harvest symlinks in addition to regular files. When harvesting symlinks, Elastic Agent opens and reads the original file even though it reports the path of the symlink.
** Because this option may lead to data loss, it is disabled by default. **
**Because this option may lead to data loss, it is disabled by default.**
required: false
show_user: false
- name: resend_on_touch
Expand All @@ -112,7 +112,7 @@ streams:
type: text
title: Check Interval
description: |
How often Elastic Agent checks for new files in the paths that are specified for harvesting. For example Specify 1s to scan the directory as frequently as possible without causing Elastic Agent to scan too frequently. ** We do not recommend to set this value <1s. **
How often Elastic Agent checks for new files in the paths that are specified for harvesting. For example Specify 1s to scan the directory as frequently as possible without causing Elastic Agent to scan too frequently. **We do not recommend to set this value <1s.**
required: false
show_user: false
- name: ignore_older
Expand Down Expand Up @@ -142,7 +142,7 @@ streams:
type: bool
title: Close on State Changed Renamed
description: |
** Only use this option if you understand that data loss is a potential side effect. **
**Only use this option if you understand that data loss is a potential side effect.**
When this option is enabled, Elastic Agent closes the file handler when a file is renamed. This happens, for example, when rotating files. By default, the harvester stays open and keeps reading the file because the file handler does not depend on the file name.
required: false
show_user: false
Expand All @@ -157,15 +157,15 @@ streams:
type: bool
title: Close Reader EOF
description: |
** Only use this option if you understand that data loss is a potential side effect. **
**Only use this option if you understand that data loss is a potential side effect.**
When this option is enabled, Elastic Agent closes a file as soon as the end of a file is reached. This is useful when your files are only written once and not updated from time to time. For example, this happens when you are writing every single log event to a new file. This option is disabled by default.
required: false
show_user: false
- name: close_reader_after_interval
type: text
title: Close Reader After Interval
description: |
** Only use this option if you understand that data loss is a potential side effect. Another side effect is that multiline events might not be completely sent before the timeout expires. **
**Only use this option if you understand that data loss is a potential side effect. Another side effect is that multiline events might not be completely sent before the timeout expires.**
This option is particularly useful in case the output is blocked, which makes Elastic Agent keep open file handlers even for files that were deleted from the disk.
For more information see the [documentation](https://www.elastic.co/guide/en/beats/filebeat/current/filebeat-input-filestream.html#filebeat-input-filestream-close-timeout).
required: false
Expand All @@ -175,7 +175,7 @@ streams:
title: Clean Inactive
default: -1
description: |
** Only use this option if you understand that data loss is a potential side effect. **
**Only use this option if you understand that data loss is a potential side effect.**
When this option is enabled, Elastic Agent removes the state of a file after the specified period of inactivity has elapsed.
E.g: "30m", Valid time units are "ns", "us" (or "µs"), "ms", "s", "m", "h". By default cleaning inactive states is disabled, -1 is used to disable it.
required: false
Expand All @@ -185,7 +185,7 @@ streams:
title: Clean Removed
description: |
When this option is enabled, Elastic Agent cleans files from the registry if they cannot be found on disk anymore under the last known name.
** You must disable this option if you also disable Close Removed. **
**You must disable this option if you also disable Close Removed.**
required: false
show_user: false
- name: harvester_limit
Expand All @@ -209,16 +209,16 @@ streams:
title: Backoff Max
description: |
The maximum time for Elastic Agent to wait before checking a file again after EOF is reached. The default is 10s.
** Requirement: Set Backoff Max to be greater than or equal to Backoff Init and less than or equal to Check Interval (Backoff Init <= Backoff Max <= Check Interval). **
**Requirement: Set Backoff Max to be greater than or equal to Backoff Init and less than or equal to Check Interval (Backoff Init <= Backoff Max <= Check Interval).**
required: false
show_user: false
- name: fingerprint
title: Fingerprint file identity
type: bool
default: true
description: |
** Changing file_identity methods between runs may result in
duplicated events in the output. **
**Changing file_identity methods between runs may result in
duplicated events in the output.**
Uses a fingerprint generated from the first few bytes (1k is
the default, this can be configured via Fingerprint offset
and length) to identify a file instead inode + device ID.
Expand Down
7 changes: 7 additions & 0 deletions packages/filestream/docs/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -3,6 +3,13 @@
WARNING: Migrating from the "Custom Logs (Deprecated)" to "Custom Logs
(Filestream)" will cause files to be re-ingested because the state is not migrated.

IMPORTANT: The Filestream integration will only start ingesting files
**when they are 1024 bytes in size or larger**. This can be adjusted by
setting "Fingerprint length", however it will influence how files are
identified. Refer to the
[fingerprint](https://www.elastic.co/docs/reference/beats/filebeat/filebeat-input-filestream#filebeat-input-filestream-file-identity-fingerprint)
documentation for more details.

In future releases it's expected to have an automated way to migrate the state. However, this is not possible at the moment.

The current best option for minimizing the data duplication while migrating to "Custom Logs (Filestream)" is to use the 'Ignore Older' or 'Exclude Files' options.
Expand Down
2 changes: 1 addition & 1 deletion packages/filestream/manifest.yml
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@ name: filestream
title: Custom Logs (Filestream)
description: Collect log data using filestream with Elastic Agent.
type: integration
version: 1.1.3
version: 1.1.4
conditions:
kibana:
version: "^8.15.0 || ^9.0.0"
Expand Down