6 - Debate and Imitative Generalization with Beth Barnes

AXRP - the AI X-risk Research Podcast

Контент предоставлен Daniel Filan. Весь контент подкастов, включая выпуски, графику и описания подкастов, загружается и предоставляется непосредственно Daniel Filan или его партнером по платформе подкастов. Если вы считаете, что кто-то использует вашу работу, защищенную авторским правом, без вашего разрешения, вы можете выполнить процедуру, описанную здесь https://ru.player.fm/legal.

3y ago 1:58:48

MP3•Главная эпизода

One proposal to train AIs that can be useful is to have ML models debate each other about the answer to a human-provided question, where the human judges which side has won. In this episode, I talk with Beth Barnes about her thoughts on the pros and cons of this strategy, what she learned from seeing how humans behaved in debate protocols, and how a technique called imitative generalization can augment debate. Those who are already quite familiar with the basic proposal might want to skip past the explanation of debate to 13:00, "what problems does it solve and does it not solve".

Link to Beth's posts on the Alignment Forum: alignmentforum.org/users/beth-barnes

Link to the transcript: axrp.net/episode/2021/04/08/episode-6-debate-beth-barnes.html

32 эпизодов

#Science #Tech #Daniel Filan #Xrisk