No selection means all shows.

reinforcement learning

17 episodes

Filters
Date Range
31 Jan
Sat
Lex Fridman Podc...
State of AI in 2026: LLMs, Coding, Scaling Laws, China, Agents, GPUs, AGI | Lex Fridman Podcast #490
#text diffusiontech industry trendsrobotics developmentai safety ethicsanthropicartificial intelligenceclaude 45deepseekgeminigpt ossjensen huangmixture of expertsnvidiaopenaiopen source modelsreinforcement learningrlhfrlvr
03 Feb
Mon
Lex Fridman Podc...
DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters | Lex Fridman Podcast #459
#tsmcstargate projectsemiconductor industryagiagi developmentartificial intelligencechain of thoughtdeepseekeconomic impactexport controlsgeopoliticsh100h800kv cachenvidiaopenaireinforcement learning
05:06:05
16 Feb
Fri
Lex Fridman Podc...
Marc Raibert: Boston Dynamics and the Future of Robotics | Lex Fridman Podcast #412
#tesla optimustad mcgeerspotagiathletic intelligenceatlasbig dogboston dynamicscognitive intelligencehumanoid robotshyundailegged roboticsmarco huttermark rybertmodel predictive controlpassive dynamicsreinforcement learningrobot learning
01:43:33
06 Dec
Tue
Lex Fridman Podc...
Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation | Lex Fridman Podcast #344
#texas holdemreinforcement learningpluribusalgorithmic searchartificial intelligencecicerodiplomacyexploitative playfairgame theorygame theory optimalhuman ai interactionlibratusmeta aimonte carlo tree searchnash equilibriumnoam brown
02:29:07
22 Jan
Sat
Lex Fridman Podc...
Yann LeCun: Dark Matter of Intelligence and Self-Supervised Learning | Lex Fridman Podcast #258
#yann lecunworld modelsvicregartificial intelligencebarlow twinschinese room argumentcomplexity measurementfairmachine learning paradigmsmetanyuopen catalystpytorchreinforcement learningrobot rights ethicsself supervised learningsimclrsupervised learning
02:44:57
23 Nov
Tue
Lex Fridman Podc...
Kevin Systrom: Instagram | Lex Fridman Podcast #243
#venture capitaltwittertiktokbridgewaterelon muskfacebookfounder psychologyfoursquarefrances haugengowallainstagramjack dorseykevin systrommark zuckerbergproduct market fitray dalioreinforcement learningsocial media algorithmssteve jobs
02:44:25
13 Dec
Sun
Lex Fridman Podc...
Michael Littman: Reinforcement Learning and the Future of AI | Lex Fridman Podcast #144
#waymotom landauerteslaalphagoalphazeroartificial intelligenceautonomous vehiclesbrian christiandouglas rushkoffelon muskmachine learning historymichael littmannick bostromq learningreinforcement learningrobot and franksam harrissocial media algorithmstd gammon
01:56:19
14 Jul
Tue
Lex Fridman Podc...
Sergey Levine: Robotics and Machine Learning | Lex Fridman Podcast #108
#supervised learningsergey levinerobotics intelligenceai safety alignmentbatch rlberkeleycausal inferencecommon sense reasoningdeep reinforcement learningend to end learninggaze heuristicjacob andreaslex fridmanmoravec paradoxoffline rloff policy learningreinforcement learning
01:37:17
03 Jul
Fri
Lex Fridman Podc...
Matt Botvinick: Neuroscience, Psychology, and AI at DeepMind | Lex Fridman Podcast #106
#vs ramachandranvirtual realitysusan fiskagiartificial intelligenceconsciousnessdeepminddopaminehuman ai interactionmatt botvinickmeta learningneuroscienceparallel distributed processingprefrontal cortexreinforcement learningreward prediction error
02:00:18
03 Apr
Fri
Lex Fridman Podc...
David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning | Lex Fridman Podcast #86
#reinforcement learningmuzeromonte carlo tree searchalphagoalphazeroartificial intelligencedavid silverdeep learningdeepmindfan huigame theorygarry kasparovlee sedollex fridmanmachine intelligencemax tegmarkmit
01:47:47
26 Feb
Wed
Lex Fridman Podc...
Marcus Hutter: Universal Artificial Intelligence, AIXI, and AGI | Lex Fridman Podcast #75
#turing testsolomonov inductionreinforcement learningaixi modelartificial general intelligencebayesian frameworksconsciousnessconways game of lifegödel machinesgoogle deepmindhutter prizeicsi modelintegrated cognitive systemjürgen schmidhuberkolmogorov complexitymarcus hutterniklas luhmannoptimal agents
01:39:41
31 Aug
Sat
Lex Fridman Podc...
Yann LeCun: Deep Learning, ConvNets, and Self-Supervised Learning | Lex Fridman Podcast #36
#yann lecunworld modelswaymo2001 a space odysseyartificial general intelligenceasimov three laws of roboticsconvolutional neural networksdeep learningfacebookimagenetmnistmodel based reinforcement learningneural networksnew york universityreinforcement learningself supervised learningsophia the robottransformers
01:15:36
03 Apr
Wed
Lex Fridman Podc...
Greg Brockman: OpenAI and AGI | Lex Fridman Podcast #17
#value alignmenttechnological determinismreinforcement learningai regulation policyai safety and alignmentartificial general intelligencecaptchaconsciousnessdota aigpt 2greg brockmanlex fridmanopenaiopenai governanceopenai lp
01:24:45
12 Mar
Tue
Lex Fridman Podc...
Leslie Kaelbling: Reinforcement Learning, Planning, and Robotics | Lex Fridman Podcast #15
#turing teststanfordshaky robotacademic publishingai safetyarchivebacheschergödelimdbjmlrjournal of machine learning researchlego mindstormsleslie kaelblingmachine learning researchmarkov decision processesmitpartially observable markov decision processesreinforcement learningrobotics engineering
01:01:01
19 Jan
Sat
Lex Fridman Podc...
Tomaso Poggio: Brains, Minds, and Machines | Lex Fridman Podcast #13
#tommaso poggiotime travelstochastic gradient descentand machinesartificial intelligencecenter for brainsconsciousnessdeep learningeinsteinethicsflowers for algernongenerative adversarial networksgoogle xmachine learningmindsmitneurosciencerebecca sachsreinforcement learningsteve jobs
01:19:59
16 Dec
Sun
Lex Fridman Podc...
Pieter Abbeel: Deep Reinforcement Learning | Lex Fridman Podcast #10
#uc berkeleyspot miniroger federerai safetyautonomous drivingberkeley robotics learning labboston dynamicse=mc²generalizationimitation learningjeff bezospepperpeter abbeelreinforcement learningrl squaredrobotics
00:42:23
24 Nov
Thu
Making Sense
Sam Harris
#53 — The Dawn of Artificial Intelligence
#weak aivalue alignment problemuc berkeleyai safetyalan turingartificial intelligenceblack box problemdavid deutschdeep learningdqn systemgoogle deepmindmachine consciousnessnorbert wienerreinforcement learningsam harrisstrong aistuart russell
00:36:42