Skip to content

docs(external): Clarify docs for Azure Event Hubs (Kafka) input #23280

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

nemobis
Copy link
Contributor

@nemobis nemobis commented Jun 27, 2025

Some issues were a bit maddening to debug until we realised some simple mistakes in configs we forgot to check.

Summary

Vector configuration

How did you test this PR?

Change Type

  • Bug fix
  • New feature
  • Non-functional (chore, refactoring, docs)
  • Performance

Is this a breaking change?

  • Yes
  • No

Does this PR include user facing changes?

  • Yes. Please add a changelog fragment based on our guidelines.
  • No. A maintainer will apply the no-changelog label to this PR.

References

Notes

  • Please read our Vector contributor resources.
  • Do not hesitate to use @vectordotdev/vector to reach out to us regarding this PR.
  • Some CI checks run only after we manually approve them.
    • We recommend adding a pre-push hook, please see this template.
    • Alternatively, we recommend running the following locally before pushing to the remote branch:
      • cargo fmt --all
      • cargo clippy --workspace --all-targets -- -D warnings
      • cargo nextest run --workspace (alternatively, you can run cargo test --all)
  • After a review is requested, please avoid force pushes to help us review incrementally.
    • Feel free to push as many commits as you want. They will be squashed into one before merging.
    • For example, you can run git merge origin master and git push.
  • If this PR introduces changes Vector dependencies (modifies Cargo.lock), please
    run cargo vdev build licenses to regenerate the license inventory and commit the changes (if any). More details here.

@nemobis nemobis requested a review from a team as a code owner June 27, 2025 06:36
@github-actions github-actions bot added the domain: external docs Anything related to Vector's external, public documentation label Jun 27, 2025
@nemobis
Copy link
Contributor Author

nemobis commented Jun 27, 2025

(The automatic spellcheck is wrong.)

Some issues were a bit maddening to debug until we realised some
simple mistakes in configs we forgot to check.
@nemobis nemobis force-pushed the federicol-kafka-groups branch from daeab4e to 9d03a86 Compare June 27, 2025 06:43
Copy link
Member

@pront pront left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you @nemobis. Left a few comments.

@@ -505,7 +505,11 @@ base: components: sources: kafka: configuration: {
}
}
group_id: {
description: "The consumer group name to be used to consume events from Kafka."
description: """
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This file is generated and we don't modify by hand. Instead, this new description should be a /// comment on the group_id in src/sources/kafka.rs. Then you can make generate-component-docs to generate this file.

@@ -103,6 +104,9 @@ components: _kafka: {
- `tls.enabled` - Set to `true`.
- `tls.ca_file` - The certificate authority file.
- `tls.verify_certificate` - Set to `true`.
If you see AllBrokerDown errors every 5 minutes (the default interval between rebootstrap), that may
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

To fix the error, add rebootstrap in the allow.txt.

@@ -103,6 +104,9 @@ components: _kafka: {
- `tls.enabled` - Set to `true`.
- `tls.ca_file` - The certificate authority file.
- `tls.verify_certificate` - Set to `true`.
If you see AllBrokerDown errors every 5 minutes (the default interval between rebootstrap), that may
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This can be slightly rephrased.

If you see `AllBrokerDown` errors every five minutes (the default “rebootstrap” interval), it usually just means there are no new records to fetch. If you were expecting new data, double-check your group_id; if you’re actually hitting network hiccups, verify your keepalive, socket timeout and other retry/interval settings.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
If you see AllBrokerDown errors every 5 minutes (the default interval between rebootstrap), that may
If you see `AllBrokerDown` errors every five minutes (the default “rebootstrap” interval), it usually means there are no new records to fetch. If you are expecting new data, double-check your `group_id`; if you’re actually hitting network hiccups, verify your `keepalive`, socket timeout and other retry or interval settings.

@pront pront changed the title docs: Clarify docs for Azure Event Hubs (Kafka) input docs(external): Clarify docs for Azure Event Hubs (Kafka) input Jun 27, 2025
@pront pront added the no-changelog Changes in this PR do not need user-facing explanations in the release changelog label Jun 27, 2025
@@ -103,6 +104,9 @@ components: _kafka: {
- `tls.enabled` - Set to `true`.
- `tls.ca_file` - The certificate authority file.
- `tls.verify_certificate` - Set to `true`.
If you see AllBrokerDown errors every 5 minutes (the default interval between rebootstrap), that may
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
If you see AllBrokerDown errors every 5 minutes (the default interval between rebootstrap), that may
If you see `AllBrokerDown` errors every five minutes (the default “rebootstrap” interval), it usually means there are no new records to fetch. If you are expecting new data, double-check your `group_id`; if you’re actually hitting network hiccups, verify your `keepalive`, socket timeout and other retry or interval settings.

@@ -505,7 +505,11 @@ base: components: sources: kafka: configuration: {
}
}
group_id: {
description: "The consumer group name to be used to consume events from Kafka."
description: """
The consumer group name to be used to consume events from Kafka. Must be unique across Vector instances if they all are to consume the same events. If two instances (consumers) share the same group name, any event will be received by only one of them.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
The consumer group name to be used to consume events from Kafka. Must be unique across Vector instances if they all are to consume the same events. If two instances (consumers) share the same group name, any event will be received by only one of them.
The consumer group name used to consume events from Kafka. Must be unique across Vector instances if each is expected to consume the same events. If two instances (consumers) share the same group name, each event is delivered to only one instance.

@thomasqueirozb thomasqueirozb added the meta: awaiting author Pull requests that are awaiting their author. label Jun 27, 2025
@pront pront force-pushed the master branch 4 times, most recently from 1720078 to ffe54be Compare July 10, 2025 15:43
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
domain: external docs Anything related to Vector's external, public documentation meta: awaiting author Pull requests that are awaiting their author. no-changelog Changes in this PR do not need user-facing explanations in the release changelog
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants