[CrowdStrike]: Processing of different events can lead to identical _id
s
#13720
Labels
Integration:crowdstrike
CrowdStrike
needs:triage
Team:Security-Service Integrations
Security Service Integrations team [elastic/security-service-integrations]
Integration Name
CrowdStrike [crowdstrike]
Dataset Name
crowdstrike.falcon
Integration Version
latest (1.63.0?)
Agent Version
8.15.3
Agent Output Type
elasticsearch
Elasticsearch Version
8.15.3
OS Version and Architecture
Ubuntu
Software/API Version
No response
Error Message
:response=>{"create"=>{"status"=>409, "error"=>{"type"=>"version_conflict_engine_exception", "reason"=>"[TAYxxxxxxxxxxxxxxxxxxxxxx5s=]: version conflict, document already exists (current version [1])"}}}
Event Original
No response
What did you do?
Installed current pipeline, pointed collected events at it
What did you see?
Error message above
What did you expect to see?
No error, all events loaded to elasticsearch successfully
Anything else?
The
crowdstrike/data_stream/falcon
ingest pipeline calculates a document id based on a subset of fields:We've found that often under heavy load these fields aren't specific enough to uniquely identify a record - we often see events arrive in the same second with no difference in the above fields. For example, here is a (heavily redacted) example of two records which generate the same
_id
.I propose including one (or more) additional fields in the fingerprint step to ensure IDs are unique:
offset
- Since this is the location within the log file the event starts, it is guaranteed to be unique for each entry. This is sufficient in our case, but may not be a generic fix as it assumes all incoming data is from a log file.RuleId
- As there were two different rules that matched, this would differentiate the above entries. There may however be occasions where the same rule matches twice within a second, so this may not be sufficient in all cases either.PID
- Not certain to be unique, but often will differ between events close in time. If no guaranteed surrogate key can be found, this may help reduce the chances of a collision.The text was updated successfully, but these errors were encountered: