Powerful large-scale AI models like GPT-4 are showing dramatic improvements in reasoning, problem-solving, and language capabilities. This marks a phase change for artificial intelligence—and a signal of accelerating progress to come.

In this Microsoft Research Podcast series, AI scientist and engineer Ashley Llorens hosts conversations with his collaborators and colleagues about what these models—and the models that will come next—mean for our approach to creating, understanding, and deploying AI, its applications in areas such as health care and education, and its potential to benefit humanity.

This episode features Principal Researcher Ida Momennejad. Momennejad is applying her expertise in cognitive neuroscience and computer science to better understand—and extend—AI capabilities, particularly when it comes to multistep reasoning and short- and long-term planning. Llorens and Momennejad discuss the notion of general intelligence in both humans and machines; how Momennejad and colleagues leveraged prior research into the cognition of people and rats to create prompts for evaluating large language models; and the case for the development of a “prefrontal cortex” for AI.

Learn more:

AI and Microsoft Research | Focus AreaEvaluating Cognitive Maps and Planning in Large Language Models with CogEval | Publication, October 2023Imitating Human Behaviour with Diffusion Models | Publication, May 2023Navigates Like Me: Understanding How People Evaluate Human-Like AI in Video Games | Publication, April 2023Navigation Turing Test (NTT): Learning to Evaluate Human-Like Navigation | Publication, July 2021Predictive Representations in Hippocampal and Prefrontal Hierarchies | Publication, January 2022The successor representation in human reinforcement learning | Publication, September 2017Encoding of Prospective Tasks in the Human Prefrontal Cortex under Varying Task Loads | Publication, October 2013