Uploaded image for project: 'Red Hat Enterprise Linux AI'
  1. Red Hat Enterprise Linux AI
  2. RHELAI-4070

vLLM: Enabling LoRA not working with vLLM

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Done
    • Icon: Undefined Undefined
    • rhelai-1.5
    • rhelai-1.5
    • vLLM
    • None
    • False
    • Hide

      None

      Show
      None
    • False
    • Critical
    • Approved

      To Reproduce Steps to reproduce the behavior:

      ilab data generate

      OR from ilab shell from a RHEL AI 1.5 compose (to get traceback):

      /opt/app-root/bin/python3.11 -m vllm.entrypoints.openai.api_server --host 127.0.0.1 --port 56489 --model /var/home/azureuser/.cache/instructlab/models/mixtral-8x7b-instruct-v0-1 --distributed-executor-backend mp --served-model-name /var/home/azureuser/.cache/instructlab/models/mixtral-8x7b-instruct-v0-1 mixtral-8x7b-instruct-v0-1 models/granite-3-1-8b-lab-v2 models/granite-3-1-8b-starter-v2 models/mixtral-8x7b-instruct-v0-1 models/prometheus-8x7b-v2-0 --max-num-seqs 512 --enable-lora --enable-prefix-caching --max-lora-rank 64 --dtype bfloat16 --lora-dtype bfloat16 --fully-sharded-loras --lora-modules skill-classifier-v3-clm=/var/home/azureuser/.cache/instructlab/models/skills-adapter-v3 text-classifier-knowledge-v3-clm=/var/home/azureuser/.cache/instructlab/models/knowledge-adapter-v3 --tensor-parallel-size 1

       

      Expected behavior

      • vLLM works and starts when configured with SDG parameters.

      Screenshots

      • Attached Image

      Device Info (please complete the following information):

      • Hardware Specs: MI 300X (verification on other accelerators pending)
      • OS Version: RHEL AI 1.5
      • InstructLab Version: 0.26
      • Provide the output of these two commands:

      Bug impact

      Known workaround

      • Please add any known workarounds.

      Additional context

              rh-ee-jgroenen Joseph Groenenboom
              fzatlouk@redhat.com František Zatloukal
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

                Created:
                Updated:
                Resolved: