Uploaded image for project: 'Network Observability'
  1. Network Observability
  2. NETOBSERV-25

R&D: eBPF-based network metrics as another data source

    • Icon: Story Story
    • Resolution: Done
    • Icon: Undefined Undefined
    • None
    • None
    • eBPF, Loki
    • None
    • False
    • False
    • Undefined
    • NetObserv - Sprint 207

      eBPF has been identified as a promising alternative data source for network metrics, in addition to our current work around netflows/IPFIX.

      It could provide more metrics than what we get from netflow/IPFIX, including DNS related ones (success rate, etc.) and HTTP ones (errors, latency...). We can also expect a much reduced overhead compared to flow logs.

      This R&D task is about doing more investigations around eBPF; A non exhaustive list:

      • look for potential reuse of existing eBPF metrics collectors (e.g. Flowmill [1] or Pixie's Stirling [2] or Cilium)
      • identify dependencies / pre-requisite (e.g. what is necessary to enable eBPF on nodes, running programs & exporting metrics)
      • figure out if the chosen infrastructure for netflows / ipfix is also relevant for that source of data (e.g. is Loki relevant?)
      • eventually initiate some PoC (or via new subtasks)

      [1] https://github.com/Flowmill/flowmill-collector ; Contribution proposal to CNCF: https://github.com/open-telemetry/community/issues/733
      [2] https://github.com/pixie-labs/pixie/tree/main/src/stirling

      Other resources:

              mmaciasl@redhat.com Mario Macias (Inactive)
              jtakvori Joel Takvorian
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

                Created:
                Updated:
                Resolved: