Show HN: Evaluating LLMs on creative writing via reader usage, not benchmarks

(narrator.sh)

35 points | by Jetwu 3 days ago ago

12 comments