Skip to content

A simple script to generate an RSS feed for self-hosted audio/videos that can be used with Apple Podcast, Amazon Podcasts and more.

License

Notifications You must be signed in to change notification settings

vpetersson/podcast-rss-generator

Use this GitHub action with your project
Add this Action to an existing workflow or create a new one
View on Marketplace

Repository files navigation

Podcast RSS Generator

Python Unit Tests and Linting

Description

This an RSS Feed Generator is designed to generate an RSS feed for audio/video podcasts, reading metadata and episode data from a YAML file.

It assumes that you self-host your video episodes somewhere (e.g. S3/GCS/R2) as well as the output of this script. You can then point YouTube/Spotify/Apple Podcast to this path.

This tool was written for my podcast Nerding Out with Viktor to solve for the fact that Apple's Podcast Connect require you to self-host videos in order to publish.

I also wrote an article on how you can use this tool to automatically turn a video podcast into audio in this article.

Features

  • Generates RSS feed for audio/video podcasts
  • Reads podcast metadata and episode data from a YAML file
  • Converts ISO format dates to RFC 2822 format
  • Attempts to follow The Podcast RSS Standard

Known Issues

  • Videos uploaded to YouTube via RSS will be uploaded as audio.
  • Spotify can't handle videos via RSS yet. You will be able to see the episodes in Podcaster, but they will not be processed and sent to Spotify properly. This is apparently a known issue that they are working on resolving.

The workaround for the above issues is to manually upload the episodes.

Installation

Prerequisites

  • Python 3.8 or higher
  • pip (Python package installer)
  • ffmpeg

Setup

  1. Clone the Repository
$ git clone https://github.com/vpetersson/podcast-rss-generator.git
$ cd podcast-rss-generator
  1. Install Dependencies
$ pip install -r requirements.txt

Optional: Install yamllint, xq and flake8.

Usage

$ python rss_generator.py --help
usage: rss_generator.py [-h] [--input-file INPUT_FILE] [--output-file OUTPUT_FILE]
                        [--skip-asset-verification] [--dry-run]

Process some parameters.

options:
  -h, --help            show this help message and exit
  --input-file INPUT_FILE
                        Input YAML file
  --output-file OUTPUT_FILE
                        Output XML file
  --skip-asset-verification
                        Skip HTTP HEAD and ffprobe checks for asset URLs (use for testing/fake URLs)
  --dry-run             Validate configuration file only, do not generate RSS feed
  1. Prepare Your Data Files

Copy podcast_config.example.yaml to podcast_config.yaml and fill out your podcast metadata and eepisodes.

  1. Validate Your Configuration (Dry-run)

Before generating the RSS feed, you can validate your configuration file using the --dry-run flag:

$ python rss_generator.py --dry-run

This will:

  • Check if your YAML file has valid syntax
  • Validate all required fields are present
  • Check URL formats for links, asset URLs, and images
  • Verify email addresses and ISO date formats
  • Validate episode structure and optional fields
  • Exit with code 0 if valid, or code 1 if errors are found

Example output for a valid config:

✓ Config validation passed!
✓ Dry-run completed successfully.

Example output for an invalid config:

✗ Config validation failed:
  - Invalid email format: 'invalid-email'
  - Episode 1: Invalid publication_date format 'invalid-date'
  - Episode 1: Invalid asset_url format 'not-a-url'
  1. Generate the RSS Feed

The podcast_config.yaml file contains two main sections: metadata and episodes.

Metadata Section

This section contains general information about your podcast:

Key Description Notes
title The title of your podcast. Required
description A description of your podcast. Required. Markdown is supported.
link The URL of the main website for your podcast. Required. Also the default link for episodes if not provided per-episode.
rss_feed_url The public URL where your generated podcast_feed.xml will be hosted. Required
language The language of the podcast (e.g., en-us). Optional. Default: en-us.
email The contact email for the podcast owner. Required. Backward compatibility: itunes_email. Used for <podcast:locked>.
author The author name(s). Required. Backward compatibility: itunes_author.
category The primary category for iTunes. Optional. Backward compatibility: itunes_category.
image The URL for the main podcast cover art (JPEG or PNG, 1400x1400 to 3000x3000 pixels). Required. Also the default image for episodes if not provided per-episode.
explicit Indicates if the podcast contains explicit content. Optional (true/false). Default: false. Backward compatibility: itunes_explicit. Can be overridden per-episode.
use_asset_hash_as_guid Use a content hash or ETag from the asset file's headers as the episode's <guid>. Prioritizes x-amz-checksum-sha256, then x-goog-hash (MD5 part), then ETag. Warning: This can break GUID permanence if asset files change or are re-uploaded, potentially causing re-downloads for subscribers. Optional (true/false). Default: false (uses asset_url).
copyright A string containing the copyright notice for the podcast. Optional.
podcast_locked Tells platforms not to import the feed without confirming ownership via the email address. Optional (yes/no). Default: no. Based on Podcast Standards Project.
podcast_guid A globally unique, permanent identifier (UUID recommended) for the entire podcast show. Optional. If omitted, a stable UUIDv5 is generated from rss_feed_url. Strongly recommended to set explicitly. Based on Podcast Standards Project.

Episodes Section

This section is a list of your podcast episodes. Each episode is an object with the following fields:

Key Description Notes
title The title of the episode. Required.
description A description of the episode. Required. Markdown is supported.
publication_date The date and time the episode was published. Episodes with future dates will not be included. Required. ISO 8601 format (e.g., 2023-01-15T10:00:00Z).
asset_url The direct URL to the audio or video file for the episode. Required.
link The URL for a webpage specific to this episode. Optional. Defaults to the global link from metadata.
image The URL for artwork specific to this episode (same format requirements as the main podcast image). Optional. Defaults to the global image from metadata.
episode The episode number. Optional. Integer.
season The season number. Optional. Integer.
episode_type Defines the type of content for the episode. Optional. Can be full (default), trailer, or bonus.
explicit Indicates if this specific episode contains explicit content. Optional (true/false). Overrides the global explicit setting for this episode. Backward compatibility: itunes_explicit.
transcripts A list of transcript files associated with the episode. Each item is an object with url (required), type (required), language (opt), rel (opt). Optional. See example config for structure.
  1. Generate the RSS Feed

Make sure your YAML is valid:

$ yamllint podcast_config.yaml

Generate your podcast_feed.xml file:

$ python rss_generator.py

Now copy your podcast_feed.xml to S3/GCS/R2 using a tool like s3cmd, aws or mc (from Minio).

You can verify your RSS feed using a tool like Podbase.

Optional: Optimize video

If you're dealing with video podcasts, the file size matters for obvious reasons. What I do is to export the video as h624 from my video editor (which I upload to YouTube and Spotify).

I then re-encode the h264 video to h265 for other platforms using ffmpeg with the following command (on macOS):

$ ffmpeg -i input.mp4 \
    -tag:v hvc1 \
    -c:v hevc_videotoolbox \
    -crf 28 \
    -preset slowest \
    -c:a aac \
    -b:a 128k \
    -movflags faststart \
    output.mp4

Usage with GitHub Actions

To incorporate this action into your workflow, follow these steps:

  1. Create a Workflow File:

    • In your repository, create a new file under .github/workflows, for example, rss_workflow.yml.
  2. Set Up the Workflow:

    • Use the following configuration as a starting point:
name: Generate Podcast RSS Feed

on: [push, pull_request]

env:
  R2_BUCKET: 'foobar'
jobs:
  generate-rss:
    runs-on: ubuntu-latest

    steps:
      - name: Checkout repository
        uses: actions/checkout@v2

      - name: Install yamllint
        run: |
          sudo apt-get update
          sudo apt-get install yamllint

      - name: Lint YAML file
        run: yamllint podcast_config.yaml

      - name: Run Podcast RSS Generator
        uses: vpetersson/podcast-rss-generator@master
        with:
          input_file: 'podcast_config.yaml'
          output_file: 'podcast_feed.xml'

      - name: Validate output with xq
        run: |
          wget -q https://github.com/sibprogrammer/xq/releases/download/v1.2.3/xq_1.2.3_linux_amd64.tar.gz
          tar xfz xq_1.2.3_linux_amd64.tar.gz
          cat podcast_feed.xml | ./xq

      - uses: actions/upload-artifact@v2
        with:
          name: podcast_feed.xml
          path: podcast_feed.xml

  deploy:
    runs-on: ubuntu-latest
    needs: generate-rss
    if: github.ref == 'refs/heads/master'
    steps:
      - uses: actions/download-artifact@v2
        with:
          name: podcast_feed.xml

      - name: Install mc
        run: |
          wget -q https://dl.min.io/client/mc/release/linux-amd64/mc
          chmod +x mc

      - name: Set up mc
        env:
          R2_ENDPOINT: ${{ secrets.R2_ENDPOINT }}
          R2_KEY_ID: ${{ secrets.R2_KEY_ID }}
          R2_KEY_SECRET: ${{ secrets.R2_KEY_SECRET }}
        run: ./mc alias set r2-storage ${R2_ENDPOINT} ${R2_KEY_ID} ${R2_KEY_SECRET}

      - name: Copy file
        run: ./mc cp podcast_feed.xml r2-storage/${R2_BUCKET}/
  1. Customize Your Workflow:

    • Adjust paths to the YAML configuration and the output XML files as per your repository structure.
    • Ensure the uses field points to vpetersson/podcast-rss-generator@master (or specify a specific release tag/version instead of master).
  2. Commit and Push Your Workflow:

    • Once you commit this workflow file to your repository, the action will be triggered based on the defined events (e.g., on push or pull request).

Usage with Docker

Build the image with the following and tagging with latest:

docker build -t podcast-rss-generator:latest .

To spin up a container from the built image that uses a custom config file and writes out to myfeed.xml.

docker run --rm -v .:/opt podcast-rss-generator:latest --output-file /opt/myfeed.xml --input-file /opt/custom_podcast_config.yaml

N.B. The switches -v share files between host and container and --rm automatically removes the container when it exits.

Inputs

  • input_file: Path to the input YAML file. Default: podcast_config.yaml.
  • output_file: Path for the generated RSS feed XML file. Default: podcast_feed.xml.

Running Tests

To run unit tests, use:

$ python -m unittest discover tests

Contributing

Contributions to this project are welcome! Please follow these steps:

  1. Fork the repository.
  2. Create a new branch for your feature.
  3. Commit your changes.
  4. Push to the branch.
  5. Submit a pull request.

License

MIT License

About

A simple script to generate an RSS feed for self-hosted audio/videos that can be used with Apple Podcast, Amazon Podcasts and more.

Topics

Resources

License

Stars

Watchers

Forks

Contributors 3

  •  
  •  
  •