Spotify Podcast Dataset
The podcast dataset contains about 200k podcasts filtered to contain only documents which the creator tags as being in English or Portuguese, as well as by a language filter applied to the creator-provided title and description. We expect that there will be a small amount of multilingual content that may have slipped through these filters. Each of the episodes in the dataset includes an audio file, a text transcript, and some associated metadata. Access is guaranteed by completing a simple Google form that asks for an explanation, in a few words, of the reason for requesting the data.
Organização
Spotify
Cobertura temporal
Não informado
® 2025 Base dos Dados