Spotify Podcast Dataset

The podcast dataset contains about 200k podcasts filtered to contain only documents which the creator tags as being in English or Portuguese, as well as by a language filter applied to the creator-provided title and description. We expect that there will be a small amount of multilingual content that may have slipped through these filters. Each of the episodes in the dataset includes an audio file, a text transcript, and some associated metadata. Access is guaranteed by completing a simple Google form that asks for an explanation, in a few words, of the reason for requesting the data.

Organização

Spotify

Cobertura temporal

Não informado

Dados
Guia de uso

® 2025 Base dos Dados

Termos de uso

Política de privacidade

Contato