![Bretton Goods artwork](https://is5-ssl.mzstatic.com/image/thumb/Podcasts115/v4/2d/16/40/2d16400d-5539-df38-d92b-7ba5d9b115e9/mza_4183091861653125162.jpg/100x100bb.jpg)
Ep 48: An Introduction to AI Alignment with Trevor Chow
Bretton Goods
English - May 28, 2023 08:25 - 57 minutes - 30.2 MB - ★★★★★ - 1 ratingSocial Sciences Science Homepage Download Apple Podcasts Google Podcasts Overcast Castro Pocket Casts RSS feed
Previous Episode: Ep. 47: The World's Biggest Invisible Country: Indonesia
I spoke to Trevor Chow about existential risks from AI and techniques to align artificial intelligence with human goals. Specifically we talked about
An introduction to existential risk from Artificial Intelligence
Existing methods for alignment of AI models
Why RLHF might fail in large language models
Whether interpretability research might scale?
New methods being developed to make larger models safer
Regulatory frameworks for the future of AI
---
Send in a voice message: https://podcasters.spotify.com/pod/show/pradyumna-sp/message