• Icon: Feature Feature
    • Resolution: Done
    • Icon: Normal Normal
    • ols-1.0
    • None
    • None
    • Strategic Product Work
    • False
    • Hide

      None

      Show
      None
    • False
    • OCPSTRAT-895Openshift LightSpeed GA
    • 0% To Do, 0% In Progress, 100% Done
    • 0
    • Program Call

      Instrumentation of OLS service and components to cover:
       

      • Export metrics for latency invoking internal components (e.g., invoking memory, invoking RAG, invoking agents)
      • Export metrics from LLM backend: tokens, request latency, rate of fail request
      • Export metrics of the number of concurrent requests, requests per second, histogram metric of latency per session (distributed)
      • Expose TLS-enabled metrics endpoint (e.g., for Prometheus)
      • Create an OLS monitoring dashboard
      • Collect metrics of statistics of documents retrieved (e.g., how many times a document is retrieved for answering customer questions)
        • The intent is to have an understanding of which documents are the most useful, to serve as indirect anonymized metrics of the type of topic areas the end-users are using OLS to get assistance with, etc

              gausingh@redhat.com Gaurav Singh
              wcabanba@redhat.com William Caban
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

                Created:
                Updated:
                Resolved: