StrategyQA and Big Bench
Data Skeptic
English - November 18, 2022 03:39 - 41 minutes - 48.2 MB - ★★★★★ - 477 ratingsScience Technology machinelearning datamining datascience science skepticism statistics Homepage Download Apple Podcasts Google Podcasts Overcast Castro Pocket Casts RSS feed
Previous Episode: Ad Blockers Effect on News Consumption
Next Episode: Measuring Web Search Behavior
Did Aristotle Use a Laptop? That's a question from the StrategyQA benchmark which highlights the stretch goals for current artificial intelligence systems. Answering a question like that requires several cognitive steps and reasoning. Constructing a dataset of similarly challenging questions is a major undertaking. On today's episode, Mor Geva returns to share details about the creation of StrategyQA and the larger Big Bench dataset it has been included in.