« Back

What is this?

This is a search engine for transcribed podcasts. The transcripts are automatically generated by WhisperX and Whisper.cpp. The site is updated whenever there's a new episode.

Search is powered by Tantivy and the website by Django. Video player is VideoJS, and CSS by Bootstrap
Labels are generated using Spacy and summaries by OpenHermes.
Share and Download icons by Icons8.

If you have any questions or suggestions, email me, or send me a message on Twitter / Reddit. I'm also on Discord.

Infowars
The source for the Infowars transcripts are videos from 2008 to present day, and audio before 2008 and where the video is missing. The transcripts are not perfect and do contain errors. They are also available for download.

Other shows
Other shows are collected from multiple sources, and are extracted from videos if a pure audio source is unavailable.

API

The search engine has an API that can be accessed at /search/api.

Getting shows, episodes and transcripts

    GET /search/api/shows/
    GET /search/api/shows/<id>/
    GET /search/api/shows/<id>/episodes/
    GET /search/api/shows/<id>/episodes/<id>/

    examples:
      curl -L https://fight.fudgie.org/search/api/shows/
      curl -L https://fight.fudgie.org/search/api/shows/aj/
      curl -L https://fight.fudgie.org/search/api/shows/aj/episodes/
      curl -L https://fight.fudgie.org/search/api/shows/aj/episodes/20040624_Thu_Alex/

Latest episodes

    GET /search/api/latest/

    example:
      curl -L https://fight.fudgie.org/search/api/latest/

Searching for keywords or phrases

    GET /search/api/search/<query>
    
    query required parmeters:
      q          terms to search for

    query optional parameters:
      s          comma separated list of show paths to search in (aj,sr,nn), default all sources
      exact      on or off, exact/verbatim query, skips stemming of terms
      offset     offset in the results
      limit      number of results to return
      order      episode, recent, or score
      invert     on or off, inverts the sources to search in
      start_date YYYY-MM-DD
      end_date   YYYY-MM-DD

    exmaple:
      curl -L "https://fight.fudgie.org/search/api/search/?s=kf&keywords=knowledge%20fight&exact=on&invert=on&order=recent&limit=10"

The URLs returns a JSON object. There is a max limit of 10000 episodes in each query, to get the next set of episodes, append ?offset=10000.

Installing jq and piping the results to it will make the output more readable:

    curl https://fight.fudgie.org/search/api/shows/ | jq
        

Downloads

I provide downloads of the archives I've collected here as torrents.

Radio: The Alex Jones Show (1998-2024)
Archived from InfoWars, GCN, PrisonPlanet, Internet Archive, and other sources.

6,515 episodes, about 20,500 hours of audio, last episode 2024-06-18. Coverage is spotty before 2003-05, and early audio converted from RealMedia.

264 GB, MP3 format

torrent magnet
Radio: The Alex Jones Show (2024-08 - End)
Archived from InfoWars.

99 episodes, missing from previous collection. Updated 2024-11-14 after final broadcast.

8 GB, MP3 format

torrent magnet"
Radio: Other hosts (2009-2024)
American Journal, Barnes, Bermas, FirePower, Knight, Mike Adams, War Room
Archived from InfoWars

3,538 episodes, first episode 2009-03, last episode 2024-06-19.

137 GB, MP3 format

torrent magnet
Video: The Alex Jones Show (2008-2024)
Archived from InfoWars, PrisonPlanet, Internet Archive, and other sources

5,013 episodes, from 2008-04 to 2024-06-18. Most were downloaded as MP4, early videos converted from FLV and WMV.
Resolutions go from 320x240, to 480x270, 720x480, 700x394, 768x432 and finally 1280x720 from 2019-09-06.

9.11 TB, MP4 format

torrent magnet