跳转至

05 | Preference Alignment

11 个字 预计阅读时间不到 1 分钟

  • Automated benchmarks
  • Human evaluation
  • Model-based evaluation
  • Feedback signal