WEBINAR

How can we improve AI alignment?

Meet the speakers

David Jurgens (1)

Dr David Jurgens
Assistant Professor, University of Michigan

Hua Shen (1)

Dr Hua Shen
Research Fellow, University of Washington, RAISE Center


Location pin


Wednesday 25th September 2024, 12:00 EDT /
17:00 BST


Video


Live webinar


Time icon


1 hour

LLMs give set responses - but whose responses are they giving?

Humans in the loop are informing these responses, but the way they inform the LLM is not standardized.

Dr David Jurgens and Dr Hua Shen discuss how we can get AI to align better with human practices, so that models can work in more complex situations, with nuance.


We'll discuss:

✅ What AI alignment is, and why it matters.

✅ Major gaps in alignment, and their implications for safety and future research.

✅ What good data collection for AI alignment looks like.

 

G2 Badges