The term hallucination creates a dangerous false impression in enterprise settings. A simple expert-driven framework for evaluating LLM outputs on two axes: completeness and accuracy.
#evaluation
1 post
- The Operation Was Successful, But The Patient Died