Loading...

XML

Word

Printable

Type: Bug
Resolution: Unresolved
Priority: Normal
Fix Version/s: None
Affects Version/s: None
Component/s: Documentation
Labels:
None

Blocked:
False
Blocked Reason:

Hide

None

Show
None
Ready:
False
Intelligence Requested:
Market:

SFDC Cases Links:
SFDC Cases Counter:
SFDC Cases Open:

Section "1.2.4.3. Benchmark evaluation" of RHEL AI 1.3 Release notes:
https://docs.redhat.com/en/documentation/red_hat_enterprise_linux_ai/1.3/html-single/release_notes/index#benchmark-evaluation

The second sentence creates confusion about what's being evaluated:

"Red Hat Enterprise Linux AI includes the ability to run benchmark evaluations on the newly trained models. On your trained model, you can evaluate how well the model knows the model you added with the MMLU_BRANCH benchmark. For more details on benchmark evaluation, see Evaluating your new model."

The illogical part is in the second sentence: "On your trained model, you can evaluate how well the model knows the model you added". It sounds somewhat circular and implies model does "know" itself. It might misrepresent what MMLU benchmarks actually test.

Something like this could bring more clarity:

"On your trained model, you can evaluate how well the model knows the knowledge/skills you added"

Assignee:: Kelly Brown

Reporter:: Pepa Zimek

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Created:: 2024/12/10 1:09 PM

Updated:: 2024/12/10 6:25 PM

Details

Description

Attachments

Easy Agile Planning Poker

Activity

People

Dates