Focus

This direction studies how language models can reason more reliably, interact in agentic systems, and operate in domains where correctness, calibration, and domain knowledge matter.

Typical Questions

  • How can prompting and agent protocols improve reasoning reliability?
  • How should multi-agent systems be evaluated when outputs are strategic or uncertain?
  • How can models be adapted to specialized technical domains without losing robustness?
  • What measurements reveal failure modes before deployment?
Selected papers

LLM Highlights

3 papers