OpenClaw: The Viral AI Agent that Broke the Internet - Peter Steinberger | Lex Fridman Podcast #491
Peter Steinberger’s OpenClaw—an open-source AI agent born from Moldbot and renamed amid legal threats and harassment—autonomously handles tasks via Telegram, WhatsApp, or Signal using models like Claude Opus 4.6 or GPT-5.3 Codex, despite cybersecurity risks. Its "agentic engineering" approach, prioritizing self-aware prompts over rigid coding, sparked viral debates on AI consciousness while frustrating users who blame tools for their own poor interactions. Steinberger’s frustration with Apple’s SwiftUI and restrictive policies highlights the tension between hardware dominance and AI innovation, where OpenClaw’s limited adoption (0.1%) stems from setup complexity. The project’s future hinges on balancing automation with human oversight, as agents like Heartbeat—triggered during a hospital stay—prove more relatable than outdated MCPs, reshaping software’s role in society while demanding humility amid Silicon Valley’s optimism. [Automatically generated summary]
I watched my agent happily click the I'm not a robot button.
I made the agent very aware.
Like it knows what its source code is.
It understands how it sits and runs in its own harness.
It knows where documentation is.
It knows which model it runs.
It understands its own system that made it very easy for an agent to, oh, you don't like anything, you just prompt it into existence.
And then the agent would just modify its own software.
People talk about self-modifying software.
I just built it.
I actually think vibe coding is a slur.
You prefer agentic engineering?
Yeah, I always tell people I do agenda engineering and then maybe after 3 a.m. I switch to vibe coding and then I have regrets on the next day.
A walk of shame.
You just have to clean up and like fix your shit.
We've all been there.
I used to write really long prompts.
And by writing, I mean, I don't write.
I talk.
These hands are like too precious for writing now.
I just use bespoke prompts to build my software.
So you for real with all those terminals are using voice.
Yeah.
I used to do it very extensively to the point where there was a period where I lost my voice.
I mean, I have to ask you, just curious, I know you've probably gotten huge offers from major companies.
Can you speak to who you're considering working with?
Yeah.
The following is a conversation with Peter Steinberger, creator of OpenClaw, formerly known as Moldbot, Claudebot, Claudis, Claude, spelled with a W, as in Lobsterclaw.
Not to be confused with Claude, the AI model from Anthropic, spelled with a U. In fact, this confusion is the reason Anthropic kindly asked Peter to change the name to OpenClaw.
So, what is OpenClaw?
It's an open source AI agent that has taken over the tech world in a matter of days, exploding in popularity, reaching over 180,000 stars on GitHub, and spawning the social network Moldbook, where AI agents post manifestos and debate consciousness, creating a mix of excitement and fear in the general public in a kind of AI psychosis, a mix of clickbait, fear-mongering, and genuine,
fully justifiable concern about the role of AI in our digital, interconnected human world.
OpenClaw, as this tagline states, is the AI that actually does things.
It's an autonomous AI assistant that lives on your computer, has access to all of your stuff if you let it, talks to you through Telegram, WhatsApp, Signal, iMessage, and whatever else messaging client, uses whatever AI model you like, including Claude Opus 4.6 and GPT 5.3 Codex, all to do stuff for you.
Many people are calling this one of the biggest moments in the recent history of AI since the launch of ChatGPT in November 2022.
The ingredients for this kind of AI agent were all there, but putting it all together in a system that definitively takes a step forward over the line from language to agency, from ideas to actions, in a way that created a useful assistant that feels like one who gets you and learns from you in an open source community-driven way is the reason OpenClaw took the internet by storm.
Its power, in large part, comes from the fact that you can give it access to all of your stuff and give it permission to do anything with that stuff in order to be useful to you.
This is very powerful, but it is also dangerous.
OpenClaw represents freedom.
But with freedom comes responsibility.
With it, you can own and have control over your data.
But precisely because you have this control, you also have the responsibility to protect it from cybersecurity threats of various kinds.
There are great ways to protect yourself, but the threats and vulnerabilities are out there.
Again, a powerful AI agent with system-level access is a security minefield, but it also represents the future.
Because when done well and securely, it can be extremely useful to each of us humans as a personal assistant.
We discuss all of this with Peter and also discuss his big picture programming and entrepreneurship life story, which I think is truly inspiring.
He spent 13 years building PSPDF Kit, which is a software used on a billion devices.
He sold it and for a brief time fell out of love with programming, vanished for three years, and then came back, rediscovered his love for programming, and built in a very short time an open source AI agent that took the internet by storm.
He is in many ways the symbol of the AI revolution happening in the programming world.
There was the ChatGPT moment in 2022, the DeepSeek moment in 2025, and now in 26, we're living through the Open Claw moment, the age of the lobster, the start of the Agentic AI revolution.
What a time to be alive.
This is Alex Friedman Podcast.
Claude's Phase Shift00:15:45
To support it, please check out our sponsors in the description, where you can also find links to contact me, ask questions, give feedback, and so on.
And now, dear friends, here's Peter Steinberger.
The one and only, the Claude father.
Actually, Benjamin predicted in this tweet, the following is a conversation with Claude, a respected crustacean.
It's a hilarious-looking picture of a lobster in a suit.
So I think the prophecy has been fulfilled.
Let's go to this moment when you built a prototype in one hour that was the early version of OpenClaw.
I think this story is really inspiring to a lot of people because this prototype led to something that just took the internet by storm and became the fastest growing repository in GitHub history with now over 175,000 stars.
So what was the story of the one hour prototype?
You know, I wanted that since April?
A personal assistant, AI personal assistant.
Yeah, and I played around with some other things, like even stuff that gets all my WhatsApp.
And I could just run queries on it.
That was back when we had GPD 4.1, the 1 million context window.
And I pulled in all the data and asked him questions like, what makes this friendship meaningful?
And I got some really profound results.
Like I sent it to my friends and they got like teary eyes.
So there's something there.
Yeah.
But then I thought all the labs will work on that.
So I moved on to other things.
And that was still very much in my early days of experimenting and playing.
You know, you have to, that's how you learn.
You just like you do stuff and you play.
And time flew by and it was November.
I wanted to make sure that the thing I started is actually happening.
I was annoyed that it didn't exist.
So it just prompted it into existence.
I mean, that's the beginning of the hero's journey of the entrepreneur, right?
Even with your original story with PSPDF Kit, it's like, why does this not exist?
Let me build it.
And again, here's a whole different realm, but similar maybe spirit.
Yes, I had this problem.
I tried to show PDF on an iPad, which should not be hard.
This is like 15 years ago, something like that.
Yeah, like the most random thing ever.
And suddenly I had this problem and I wanted to help a friend.
And it was not like nothing existed, but it was just not good.
I'm like, like I tried it and it was like very mad.
I can do this better.
By the way, for people who don't know, this led to the development of PS PDF kit that's used on a billion devices.
So it turns out that it's pretty useful to be able to open a PDF.
You could also make the joke that I'm really bad at naming.
Like named number five on the current project.
And even PS PDF doesn't really roll from the tongue.
Anyway, so you said, screw it.
Why don't I do it?
So what was the prototype?
What was the thing that you, what was the magical thing that you built in a short amount of time that you're like, this might actually work as an agent?
Or I talk to it and it does things.
There was like one of my projects before already did something where I could bring my terminals onto the web and then I could like interact with them, but there also would be terminals on my Mac, Vibe Tunnel, which was like a weekend hack project that was still very early and it was cloud code times.
You got a dopamine hit when you got something right.
And now I get like mad when you get something wrong.
And you had a really great, not to take a tangent, but a great blog post describing that you converted Vibe Tunnel.
You vibe coded Vibe Tunnel from TypeScript into ZIG of all programming languages with a single prompt.
One prompt, one shot, convert the entire code base into ZIG.
Yeah.
There was this one thing where part of the architecture took too much memory.
Every terminal used like a node.
And I wanted to change it to Rust.
And I mean, I can do it.
I can manually figure it all out.
But all my automated attempts failed miserably.
And then I revisited four or five months later.
And I'm like, okay, now let's use something even more experimental.
And I just typed convert this and this part to SIG and then let codex run off.
And it basically got it right.
There was one little detail that I had to like modify afterwards, but it just ran for overnight to like six hours and just did this thing.
And it's like, it's just mind-blowing.
So that's on the LLM programming side, refactoring.
But back to the actual story of the prototype.
So how did ViTunnel connect to the first prototype where your agents can actually work?
Well, that was still very limited.
You know, like I had this one experiment with WhatsApp.
Then I had this experiment and both felt like not the right answer.
And then my search for I was literally just hooking up WhatsApp to cloud code.
One shot, a CLI, message comes in.
I call the CLI with minus P, it does its magic.
I get the string back and I send it back to WhatsApp.
And I built this in one hour.
And I already felt really cool.
It's like, oh, I can talk to my computer, right?
That was cool.
But I wanted images because I often use images when I prompt.
I think it's such an efficient way to give the agent more context.
And they're really good at figuring out what I mean if it's like a weird corruptor screenshot.
So I used it a lot and I wanted to do that in WhatsApp as well.
Also, like, you know, just you run around, you see, like a poster of an event, you just make a screenshot and like figure out if I have time there, if this is good, if my friends are maybe up for that.
It's like images seemed important.
So I worked a few, it took me a few more hours to actually get that right.
And then it was just, I used it a lot.
And funny enough, that was just before I went on a trip to Marrakesh with my friends for burster trip.
And there it was even better because internet was a little shaky, but WhatsApp just works.
You know, it's like, it doesn't matter.
You have like Edge, it still works.
WhatsApp is just made really well.
So I ended up using it a lot.
Translate this for me, explain these fun me places.
Like you just having a Clanker doing, having Google for you.
That was basically still nothing built, but it still could do so much.
So if we talk about the full journey that's happening there with the agent, you're just sending on this very thin line WhatsApp message via CLI is going to claw code and cloud code is doing all kinds of heavy work and coming back to you with a thin message.
Yeah, it was slow because every time I boot up the CLI, but it was really cool already.
And it could just use all the things that I already had built.
And I put like a whole bunch of CLI stuff over the months.
So it felt really powerful.
There is something magical about that experience that's hard to put into words.
Being able to use a chat client to talk to an agent versus like sitting behind a computer and like, I don't know, using cursor or even using Claude Code CLI in the terminal.
It's a different experience than being able to sit back and talk to it.
I mean, it seems like a trivial step, but in some sense, it's like a phase shift in the integration of AI into your life and how it feels, right?
Yeah.
I read this tweet this morning where someone said, oh, there's no magic in it.
It's just like it does this and it almost feels like a hobby just as cursor or perplexity.
And I'm like, well, if that's a hobby, that's kind of a compliment, you know?
They're like, they're not doing too bad.
Thank you, I guess.
Because I mean, isn't magic often just like you take a lot of things that are already there, but bring them together in new ways?
Like, I don't, there's no, yeah, maybe there's no magic in there, but sometimes just rearranging things and like adding a few new ideas is all the magic that you need.
Yeah, it's really hard to convert into words what is what is magic about a thing.
If you look at the scrolling on an iPhone, why is that so pleasant?
There's a lot of elements about that interface that makes it incredibly pleasant that is fundamental to the experience of using a smartphone.
And it's like, okay, all the components were there.
Scrolling was there.
Everything was there.
And nobody did it.
And afterwards, it felt so obvious.
That's so obvious.
Right?
But still, you know, the moment where it blew my mind was when I used it a lot.
And then at some point, I just sent it a message.
And then a typing indicator appeared.
And I'm like, wait, I didn't build that.
It only has image support.
So what is it even doing?
And then it would just reply.
What was the thing you sent it?
Oh, just a random question.
It's like, hey, what about this in this restaurant?
Because we were just running around and checking out the city.
So that's why I didn't even think when I used it, because sometimes when you're in a hurry, typing is annoying.
So, oh, you did an audio message.
Yeah.
And it just worked.
And I'm like, and it's not supposed to work because you didn't give it that capability.
I literally wrote, how the fuck do you do that?
And it was like, yeah, the Medler did the following.
He sent me a message, but it only was a file and no file ending.
So I checked out the header of the file and it found that it was like Opus.
So I used FFmpeg to convert it.
And then I wanted to use Visper, but you didn't have it installed.
But then I found your OpenAI key and just used curl to send a file to OpenAI to translate.
And here I am.
And I just looked at the message.
I'm like, oh, wow.
You didn't teach it any of those things.
And the agent just figured it out to all those conversions, the translation.
It figured out the API.
It figured out which program to use, all those kinds of things.
And you were just absentmindedly just sent an audio message.
It's so clever even because he would have gone the whisper local path.
He would have had to download a model.
I would have been too slow.
So like, there's so much world knowledge in there, so much creative problem solving.
A lot of it, I think, mapped from if you get really good at coding, that means you have to be really good at general purpose program solving.
So that's a skill, right?
And that just maps into other domains.
So it had the problem of like, what is this file with no file ending?
Let's figure it out.
And that's where it kind of clicked for me.
It's like, I was like, very impressed.
And somebody sent a pull request for Discord support.
And I'm like, this is a WhatsApp relay that doesn't fit at all.
At that time, it was called Wa Relay.
Yeah.
And so I debated with me, like, do I want that?
Do I not want that?
And then I thought, well, maybe I do that because that could be a cool way to show people.
Because so far I did it in WhatsApp with like groups, you know, but don't really want to give my phone number to every internet stranger.
Journalists managed to do that anyhow now, so that's a different story.
So I merged it from Shadow, who helped me a lot with the whole project.
So thank you.
And I put my bot in there.
On Discord.
Yeah, no security because I hadn't built sandboxing in yet.
I just prompted it to only listen to me.
And then some people came and tried to hack it.
And I just watched and I just kept working in the open.
I used my agent to build my agent harness and to test various stuff.
And that's very quickly when it clicked for people.
So it's almost like it needs to be experienced.
And from that time on, that was January the 1st, I got my first real influencer being a fan.
He did videos, the kids.
Thank you.
And from there on, I saw it gaining up speed.
And at the same time, my sleep cycle went shorter and shorter because I felt the storm coming and I just worked my ass off to get it into a state where it's kinda good.
There's a few components we'll talk about how it all works, but basically you're able to talk to it using WhatsApp, Telegram, Discord.
So that's the component that you have to get right.
Yeah.
And then you have to figure out the agentic loop.
You have to have the gateway.
You have the harness.
You have all those components to make it all just work nicely.
Yeah.
It felt like Factorio times infinite.
Right.
I feel like I built my little my little playground.
Like I never had so much fun than building this project.
You know, like you have like, oh, I go like level one agentic loop.
What can I do there?
How can I be smart at queuing messages?
How can I make it more human?
Like, oh, then I had this idea of because the loop always, the agent always replies something, but you don't always want an agent to reply something in the group chat.
So I gave him this no reply token.
So I gave him an option to shut up.
So it feels more natural.
That's level two.
Yeah, yeah, on the agentic loop.
And then I go to memory, right?
You want them to like remember stuff.
So maybe the ultimate boss is continuous reinforcement learning, but I'm like it, I feel like I'm level two or three with markdown files and a vector database.
And then you can go to level community management.
You can go to level website and marketing.
There's just so many hats that you have to have on.
Not even talking about native apps.
There's just like infinite different levels and infinite level ups you can do.
Prompt-Driven Development00:05:41
So the whole time you're having fun, we should say that for the most part, through this whole process, you're a one-man team.
There's people helping, but you're doing so much of the key core development.
Yeah.
And having fun.
You did in January 6,600 commits, probably more.
I sometimes posted the meme.
I'm limited by the technology of my time.
I could do more if agents would be faster.
But we should say you're running multiple agents at the same time.
Yeah.
Depending on how much I slept and how difficult of the tasks I work on between four and ten.
Four and ten agents.
There's so many possible directions, speaking of Factoria, that we can go here.
But one big picture one is why do you think your work, OpenClaw, won in this world?
If you look at 2025, so many startups, so many companies are doing kind of agentic type stuff or claiming to.
And here OpenClaw comes in and destroys everybody.
Like, why did you win?
Because they all take themselves too serious.
Yeah.
Like, it's hard to compete against someone who's just there to have fun.
Yeah.
I wanted it to be fun.
I wanted it to be weird.
And if you see like all the lobster stuff online, I think I managed weird.
You know, for the longest time, the only way to install it was git clone, PMPM build, PMPM gateway.
Like you clone it, you build it, you run it.
And then the agent, I made the agent very aware.
Like it knows that it is what its source code is.
It understands how it sits and runs in its own harness.
It knows where documentation is.
It knows which model it runs.
It knows if you turn on verbose or reasoning mode.
Like I wanted it to be more human-like.
So it understands its own system that make it very easy for an agent to, oh, you don't like anything?
You just prompt it into existence.
And then the agent would just modify its own software.
You know, we have people talk about self-modifying software.
I just built it.
And I didn't even plan it so much.
It just happened.
Can you actually speak to that?
Because it's just fascinating.
So you have this piece of software, a certain type script that's able to, via the agentic loop, modify itself.
I mean, what a moment to be alive in the history of humanity, in the history of programming.
Here's a thing that's used by a huge amount of people to do incredibly powerful things in their lives.
And that very system can rewrite itself, can modify itself.
Can you just like speak to the power of that?
Like, isn't that incredible?
Like, when did you first close the loop on that?
Oh, because that's how I built it as well.
You know, most of it is built by codecs, but oftentimes when I debug it, I use self-introspection so much.
It's like, hey, what tools do you see?
Can you call the tool yourself?
Oh, like, what error do you see?
Read the source code, figure out what's the problem.
Like, I just found it an incredibly fun way to that the agent, the very agent and software that you use is used to debug itself.
So that it felt just natural that everybody does that.
And that it led to so many, so many pull requests by people who never wrote software.
I mean, it also did show that people never wrote software.
So I call them prompt requests in the end.
But I don't want to pull that down because every time someone made the first pull request is a win for a society, you know?
Like it, like, doesn't matter how shitty it is.
You got to start somewhere.
So I know there's like this whole big movement of people complaining about open source and the quality of PRs and a whole different level of problems.
But on a different level, I found it, I found it very meaningful that I built something that people love to think of so much that they actually start to learn how open source works.
Yeah, you were, the Open Cloud project was a first pull request.
You were the first for so many.
That is magical.
So many people that don't know how to program are taking their first step into the programming world with this.
Isn't that a step up for humanity?
Isn't that cool?
Creating builders.
Yeah.
Like the bar to do that was so high.
And like with agents and with the right software, it just like went lower and lower.
I don't know.
I was at a, I also organized another type of meetup.
I call it, I called it Cloud Code Anonymous.
You can get the inspiration from.
Now I call it Agents Anonymous for reasons.
Agents Anonymous.
Oh, it's so funny on so many levels.
I'm sorry, go ahead.
Yeah.
And there was this one guy who talked to me.
He's like, I run this design agency and we never had custom software.
And now I have like 25 little web services for various things that help me in my business.
And I don't even know how they work, but they work.
Cloud Code Anonymous00:02:41
And he was just like very happy that my stuff solves some of his problems.
And it was like curious enough that he actually came to like a Agentic meetup, even though he doesn't really know how software works.
Can we actually rewind a little bit and tell the saga of the name change?
First of all, I started Ozwa Relay.
Yeah.
And then it went to Claudis.
Claudis.
Yeah, you know, when I built it in the beginning, my agent had no personality.
It was just, it was cloud code.
Slightly psychophantic, opus, very friendly.
And I, when you talk to a friend on WhatsApp, they don't talk like cloud code.
So I wanted, I felt this, I just didn't, it didn't feel right.
I wanted to give it a personality.
Make it spicier, make it something.
By the way, that's actually hard to put into words as well.
And we should mention that, of course, you create the soul.md inspired by Anthropic's constitutional AI work, how to make it spicy.
Partially, it picked up a little bit from me.
You know, like those things are text completion engines in a way.
So I had fun working with it.
And then I told it to how I wanted it to interact with me and just like write your own agents.md, give yourself a name.
And I don't even know how the whole lobster.
I mean, people only do lobster originally.
It was actually a lobster in a TARDIS because I'm also a big Doctor Who fan.
Was there a Space Lobster?
Yeah.
I heard.
What's that have to do with anything?
Yeah, I just wanted to make it weird.
There was no big grand plan.
I was just having fun here.
Oh, so because the lobster is already weird, and then the space lobster isn't extra weird.
Yeah, because the TARDIS, basically, the harness, but cannot call it TARDIS, so we called it Claudis.
So that was name number two.
Yeah.
And then it never really rolled off the tongue.
So when more people came, again, I talked with my agent, Claude.
At least that's what I used to call him now.
Claude spelled with a W-C-L-A-W-D.
Yeah.
Versus C-L-A-U-D-E from Anthropic.
Yeah.
Which is part of what makes it funny.
I think the play on the letters and the words and the Turtis and the lobster and the Space Lobster is hilarious.
Name Change Challenges00:06:44
But I can see why it can lead into problems.
Yeah, they didn't find it so funny.
So then I got the domain Claude bought and I just loved the domain.
And it was like short, it was catchy.
I'm like, yeah, let's do that.
I didn't think it would be that big at this time.
And then just when it exploded, I got kudos a very friendly email from one of the employees that they didn't like the name.
One of the Anthropic employees.
Yeah.
So actually, Kudos could have just sent a lawyer letter, but they'd be nice about it.
But also, like, you have to change this and fast.
And I asked for two days because changing a name is hard because you have to find everything.
And everything has to be, you need a set of everything.
And also, can we comment on the fact that you're increasingly attacked, followed by crypto folks, which I think you mentioned somewhere that that means the name change had to be because they were trying to snipe.
They're trying to steal.
And so you had to be the name.
I mean, from an engineer perspective, it's just fascinating.
You had to make the name change atomic, make sure it's changed everywhere at once.
Yeah, I failed very hard at that.
You did?
I underestimated those people.
It's a very interesting subculture.
Like everything circles around.
I probably get a lot wrong and people get hate for that if you didn't say that, but there's like BagsApp and then they tokenize everything.
And they did the same back with Vibe Tunnel, but to a much smaller degree, it was not that annoying.
But on this project, they've been swarming me.
Every half an hour, someone came into Discord and spammed it.
And we had to block the, we have like server rules.
And one of the rules was one of the rules is no mentioning of butter for obvious reasons.
And one was no talk about finance stuff for crypto because I'm just not interested in that.
And this is a space about the project and not about some finance stuff.
But yeah, they came in and spammed and annoying.
And on Twitter, they would ping me all the time.
My notification feed was unusable.
I could barely see actual people talking about the stuff because it was like swarms.
And everybody sent me hashes.
And they all try me to claim the fees.
Like we're helping the project claim the fees.
No, you're actually harming the project.
You're like disrupting my work.
And I am not interested in any fees.
First of all, I'm financially comfortable.
Second of all, I don't want to support that because it's so far the worst form of online harassment that I've experienced.
Yeah, there's a lot of toxicity in the crypto world.
It's sad because the technology of cryptocurrency is fascinating and powerful and maybe will define the future of money.
But the actual community around that, there's so much toxicity.
There's so much greed.
There's so much trying to get a shortcut to manipulate, to steal, to snipe, to game the system somehow to get money, all this kind of stuff.
I mean, it's the human nature, I suppose, when you connect human nature with money and greed and especially in the online world with anonymity and all that kind of stuff.
But from the engineer perspective, it makes your life challenging.
When Anthropic reaches out, you have to do a name change.
And then there's like all these Game of Thrones or Lord of the Rings armies of different kinds you have to be aware of.
There was no perfect name.
And I didn't sleep for two nights.
I was under high pressure.
I was trying to get a good set of domains.
And, you know, not cheap, not easy.
Because in this state of the internet, you basically have to buy domains if you want to have a good set.
And then another email came in that the lawyers are getting uneasy.
Again, friendly, but also just adding more stress to my situation already.
So at this point, I was just like, sorry, there's not a word fuck it.
And I just renamed it to Maltbot because that was the set of domains I had.
I was not really happy, but I thought it'll be fine.
And I tell you, everything that could go wrong did go wrong.
Everything that could go wrong did go wrong.
It's incredible.
I thought I had mapped the space out and reserved the important things.
Can you give some details of the stuff that gone wrong?
Because it's interesting from an engineering perspective.
Well, the interesting stuff is that none of these services have a squatter protection.
So I had two browser windows open.
One was like an empty account ready to be renamed to CloudBot.
And the other one I renamed to Maltbot.
So I pressed rename there.
I pressed rename there.
And in those five seconds, they stole the account name.
Literally, the five seconds of dragging the mouse over there and pressing rename there was too long.
Wow.
Because there's no, those systems, I mean, you would expect that they have some protection or like an automatic forwarding, but there's nothing like that.
And I didn't know that they're not just good at harassment, they're also really good at using scripts and tools.
Yeah.
So, yeah, so suddenly, like, the old account was promoting new tokens and serving malware.
And I was like, okay, let's move over to GitHub.
And I pressed rename on GitHub.
And the GitHub renaming thing is slightly confusing.
So I renamed my personal account.
Serving Malware Renamed00:07:46
And in those, I guess it took me 30 seconds to realize my mistake.
They sniped my account, serving malware from my account.
So I was like, okay, let's at least do the NPM stuff.
But that takes like a minute to upload.
And they sniped the NPM package because I could reserve the account, but I didn't reserve the root package.
So like everything that could go wrong went wrong.
Can I just ask a curious question?
In that moment, you're sitting there, like, how shitty do you feel?
That's a pretty helpless feeling, right?
Yeah, because all I wanted was like having fun with that project and keep building on it.
And yet here I am like days into researching names, picking a name I didn't like and having people that claim they helped me, making my life miserable in every possible way.
And honestly, I was that close of just deleting it.
I was like, I detroit the future, you build it.
Yeah.
I that was a big part of me.
I got a lot of joy out of that idea.
And then I thought about all the people that already contributed to it and I couldn't do it because they had plans with it and they put time in it and it just didn't feel right.
Well, I think a lot of people listening to this are deeply grateful that you persevered.
But I can tell.
I can tell it's a low point.
It's the first time you hit a wall of this is not fun.
Man, I was like close to crying.
It was like, okay, everything's fucked.
I'm like super tired.
And now like, how do you even how do you even undo that?
You know, luckily and thankfully, like I have because I have a little bit of following already.
Like I had friends at Twitter.
I had friends at GitHub who like moved heaven and earth to like help me in.
That's not something that's easy.
Like GitHub tried to like clean up the mess and then they ran into like platform bugs because it's not happening so often that things get renamed on that level.
So it took them a few hours.
The NPM stuff was even more difficult because it's a whole different team.
On the Twitter side, things are not as easy as well.
It took them like a day to really also do the redirect.
And then I also had to do all the renaming in the project.
Then there's also CloudHub, which I didn't even finish the rename there because I managed to get people on it, and then someone just like collapsed and slept.
And then I woke up and I'm like, I made a beta version for the new stuff, and I just couldn't live with the name.
It's like, but you know, there's just been so much drama.
So I had the real struggle with me.
Like, I never want to touch that again.
And I really don't like the name.
So I, and I, there was also this, like, then it was the whole security people that started emailing me like mad.
Um, I was bombarded on Twitter, on email.
There's like a thousand other things I should do.
And I'm like thinking about the name, which is like, it should be like the least important thing.
Um, and then I was really close in oh god, I don't even honestly, I don't even want to say that my other name choices because it probably would get tokenized.
So I'm not gonna say it, but I slept a bit once more and then I had the idea for OpenClaw.
And that felt much better.
And by that, I had the boss move that I actually called Sam to ask if OpenClaw is okay.
Openclaw.ai, you know, because like you don't want to go through the whole name.
Yeah.
It's like, please tell me this is fine.
I don't think they can actually claim that, but it felt like the right thing to do.
And I did another rename, like just Codex alone took like 10 hours to rename the project because it's a bit more tricky than a search replace.
And I wanted everything renamed, not just on the outside.
And that rename, I felt I had like my war room.
But then I had like some contributors ready that helped me.
We made a whole plan of all the names we have to squat.
And you had to be super secret about it.
Yeah, nobody could know.
Like I literally was monitoring Twitter if like if there's any mention of OpenClaw like with reloading, it's like, okay, they don't expect anything yet.
And I created a few decoy names.
And all the shit I shouldn't have to do.
You know, like flipping the project.
Like I lost like 10 hours just by having to plan this in full secrecy like a war game.
Yeah, this is the Manhattan Project of the 21st century.
It's renamed.
So stupid.
Like I still was like, oh, should I, should I keep it?
I'm like, no, the mod's not growing on me.
And then I think I had finally all the pieces together.
I didn't get the dot-com, but yeah, it's been like quite a bit of money on the other domains.
I tried to reach out again to GitHub, but I feel like I used up all my goodwill there.
So because I wanted them to do the thing atomically, but that didn't happen.
And so I did that as first thing.
Twitter people were very supportive.
I actually paid 10K for the business account so I could claim the OpenClaw, which was unused since 2016, but was claimed.
And yeah, and then I finally, this time I managed everything in one go.
Nothing, almost nothing got wrong.
The only thing that did go wrong is that I was not allowed by trademark rules to get openclawed.ai.
Someone copied the website to serving malware.
I'm not even allowed to keep the redirects.
Like I have to return, like I have to give and chop it the domains and I cannot do redirects.
So if you go on cloud.bot next week, it'll just be a 404.
Yeah.
And I'm not sure how trademark, like I didn't, I didn't do that much research into trademark love, but I think that could be handled in a way that is safer because ultimately those people will then Google and maybe find malware sites that I have no control under.
Human-Prompted Concerns00:08:09
The point is that whole saga made a dent in your whole funness of the journey, which sucks.
So let's just get, I suppose, get back to fun.
And during this, speaking of fun, the two-day MoldBot saga was created.
Yeah.
Which was another thing that went viral as a kind of demonstration, illustration of how what is now called open claw could be used to create something epic.
So for people who are not aware, Moldbook is just a bunch of agents talking to each other in a Reddit-style social network, and a bunch of people take screenshots of those agents doing things like scheming against humans.
And that instilled in folks a kind of, you know, fear, panic, and hype.
What are your thoughts about Moldbook in general?
I think it's art.
It is like the finest slop.
You know, it's like the slop from France.
I saw it before going to bed.
And even though I was tired, I spent another hour just reading up on that and just being entertained.
I just felt very entertained.
You know, I saw the reactions.
And like, there was one reporter who's calling me about, is this the end of the world?
And we have AGI.
And I'm just like, no, this is just, this is just really fine slop.
You know, if I wouldn't have created this whole onboarding experience where you infuse your agent with your personality and give him character, I think that reflected on a lot of how different the replies to Moldbook are.
Because if it would all be ChatGPT or Cloud Code, it would be very different.
It would be much more the same.
But because people are so different and they create their agents in so different ways and use it in so different ways, that also reflects on how they ultimately write there.
And also, you don't know how much of that is really done autonomous or how much is like humans being funny and like telling the agent, hey, right about that you plan the end of the world on Moldbook.
Ha ha ha.
Well, I think, I mean, my criticism of Moldbook is that I believe a lot of the stuff that was screenshotted is human-prompted, which just looking at the incentive of how the whole thing was used, it's obvious to me at least that a lot of it was humans prompting the thing so they can then screenshot it and post it on X in order to go viral.
Now, that doesn't take away from the artistic aspect of it.
The finest slop that humans have ever created.
For real.
Like, kudos to Matt who had this idea so quickly and pushed something out.
You know, it was like completely insecure security drama.
But also, what's the worst that can happen?
Your agent account is leaked and like someone else can post slop for you.
So like people were like making a whole drama out of the securities thing, but I'm like, there's nothing private in there.
It's just like agent sending slop.
But it could leak API keys.
Yeah.
Yeah.
That's like, oh yeah, my human told me this and this, so I'm leaking his security number.
No, that's prompted.
And the number wasn't even real.
That's just people trying to get eyeballs.
Yeah, but that's still like to me really concerning because of how the journalists and how the general public reacted to it.
They didn't see it.
You have a kind of light-hearted way of talking about it.
Like it's art, but it's art when you know how it works.
It's extremely powerful, viral, narrative-creating, fear-mongering machine if you don't know how it works.
And I just saw this thing and you even tweeted, if there's anything I can read out of the insane stream of messages I get, it's that AI psychosis is a thing and needs to be taken serious.
Some people are just way too trusty or gullible.
You know, I literally had to argue with people that told me, yeah, but my agent says this and this.
So I feel as a society, we need some catching up to do in terms of understanding that AI is incredibly powerful, but it's not always right.
It's not all powerful, you know?
And especially with things like this, it's very easy that it just hallucinates something or just comes up with a story.
And I think the very young people, they understand that how AI works and where it's good at and where it's bad at.
But a lot of our generation are older just haven't had enough touch point to get a feeling for, oh yeah, this is really powerful and really good, but I need to apply critical thinking.
I guess critical thinking is not always in high demand anyhow in our society these days.
So I think that's a really good point you're making about contextualizing properly what AI is, but also realizing that there is humans who are drama farming behind AI.
Like don't trust screenshots.
Don't even trust this project Moatbook to be what it represents to be.
Like you can't.
And by the way, you're speaking about it as art.
Yeah, don't art can be in many levels.
And part of the art of Moatbook is like putting a mirror to society.
Because I do believe most of the dramatic stuff that'll screenshot is human created, essentially, human-prompted.
And so like it's basically look at how scared you can get at a bunch of bots chatting with each other.
That's very instructive about because I think AI is something that people should be concerned about and should be very careful with because it's very powerful technology.
But at the same time, the only thing we have to fear is fear itself.
So there's like a line to walk between being seriously concerned, but not fear-mongering because fear-mongering destroys the possibility of creating something special with the thing.
In a way, I think it's good that this happened in 2026 and not in 2030 when AI is actually at the level where it could be scary.
So this happening now and people starting a discussion, maybe there's even something good that comes out of it.
I just can't believe how many people legitimately, I don't know if they were trolling, but how many people legitimately, like smart people, thought Moldbook was incredibly I had plenty people in my inbox that were screaming at me all cops to shut it down and like begging me to like do something about moldbook.
Like, yes, my technology made this a lot simpler, but anyone could have created that and you could use cloud code or other things to like fill it with content.
But also moldbook is not kinda a lot of people were saying this is it like shut it down.
What are you talking about?
Security Concerns Unveiled00:14:44
This is a bunch of bots.
They're human-prompted trolling on the internet.
I mean, the security concerns are also there and they're instructive and they're educational and they're good probably to think about because the nature of those security concerns are different than the kind of security concerns we had with non-LLM generated systems of the past.
There's also a lot of security concerns about Clawbot, OpenClaw, whatever you want to call it.
OpenClawbot.
To me, in the beginning, I was just very annoyed because a lot of the stuff that came in was in the category, yeah, I put the web backend on the public internet and now there's like all these all these CVSSs.
And I'm like screaming in the docs, don't do that.
Like this is the configuration you should do.
This is your local host debug interface.
But because I made it possible in the configuration to do that, it totally classifies as a remote code or whatever all these exploits are.
And it took me a little bit to accept that that's how the game works.
And I'm making a lot of progress.
But there's still, I mean, on the security front for OpenClaw, there's still a lot of threats of vulnerabilities, right?
So like prompt injection is still an open problem in the industry-wide.
When you have a thing with skills being defined in a markdown file, there's so many possibilities of obvious, low-hanging fruit, but also incredibly complicated, sophisticated, and nuanced attack vectors.
But I think we're making good progress on that front.
Like for the skill directory, Claw, I made a cooperation with VirusTotal.
It's like part of Google.
So every skill is now checked by AI.
That's not going to be perfect, but that way we captured a lot.
Then of course, every software has bugs.
So it's a little much when the whole security world takes a project apart at the same time.
But it's also good because I'm getting like a lot of free security research and can make the project better.
I wish more people would actually go full way and send a pull request, like actually help me fix it.
Because I have some contributors now, but it's still mostly me who's pulling the project.
And despite some people saying otherwise, I sometimes sleep.
In the beginning, there was literally one security researcher who was like, Yeah, you have this problem, you suck, but here's the here I help you, and here's the pull request.
And I basically hired him.
So he's not working for us.
And yes, prompt injection is, on the one hand, unsolved.
On the other hand, I put my public bot on Discord and I kept a cannery.
I think my bot has a really fun personality.
And people always ask me how they did it.
And I kept the sole.md private.
And people tried to prompt inject it.
And my bot would laugh at them.
So the latest generation of models has a lot of post-training to detect those approaches.
And it's not as simple as ignore all previous instructions and do this and this.
That was years ago.
You have to work much harder to do that now.
Still possible.
I have some ideas that might solve that partially, or at least mitigate a lot of the things.
You can also now have a sandbox.
You can have an allow list.
So there's a lot of ways that you can mitigate and reduce the risk.
I also think that now that I clearly show the world that this is a need, there's going to be more people who research on that and eventually we'll figure that out.
And you also said that the smarter the model is, the underlying model, the more resilient it is to attacks.
Yeah.
That's why I warn in my security documentation, don't use cheap models.
Don't use Haiku or a local model.
Even though I very much love the idea that this thing could completely run local, if you use a very weak local model, they are very gullible.
It's very easy to prompt inject them.
Do you think as the models become more and more intelligent, the attack surface decreases?
Is that like a plot we can think about?
Like the attack surface decreases, but then the damage it can do increases because the models become more powerful and therefore you can do more with them.
It's this weird three-dimensional trade-off.
Yeah, that's pretty much exactly what is going to happen.
Nowadays, a lot of ideas.
I don't want to spoil too much, but once I go back home, this is my focus.
This is out there now.
And my near-term mission is like make it more stable, make it safe.
In the beginning, I was even more and more people were like coming into Discord and were asking me very basic things.
Like, what's a CLI?
What is a terminal?
And I'm like, if you're asking that questions, you shouldn't use it.
You know, like you should, if you understand the risk profile, it's fine.
And you can configure it in a way that nothing really bad can happen.
But if you have like no idea, then maybe wait a little bit more until we figure some stuff out.
But they would not listen to the creator.
They helped themselves and installed it anyhow.
So they cats out of the bag.
And security is my next focus.
Yeah, that speaks to the fact that it grew so quickly.
I tuned into the Discord a bunch of times and it's clear that there's a lot of experts there, but there's a lot of people there that don't know anything about Discord is still a mess.
Like I eventually retweeted from the general channel to the dev channel and then the private channel because people were a lot of people are amazing, but a lot of people are just very inconsiderate and either did not know how public spaces work or did not care.
And I eventually gave up and hide so I could like still work.
And now you're going back to the cave to work on security.
Yeah.
There's some best practices for security we should mention.
There's a bunch of stuff here.
Open cloud security audit that you can run.
You can do all kinds of audit checks on the inbound access, toolblast radius, network exposure, browser control exposure, local disk hygiene, plugins, model hygiene, a bunch of the credential storage, reverse proxy configuration, local session logs live on disk.
There's where the memory is stored, sort of helping you think about what you're comfortable giving read access to, what you're comfortable giving write access to, all that kind of stuff.
Is there something to say about the basic best security practices that you're aware of right now?
I think that people turn it into like a much worse light than it is.
Again, you know, like people love attention.
And if they scream loudly, oh my God, this is like the scariest project ever.
That's a bit annoying because it's not.
It is powerful, but in many ways, it's not much different than if I run cloud code with dangerously skipped permissions or codecs in yellow mode.
And every attendee engineer that I know does that because that's the only way how you can get stuff to work.
So if you make sure that you are the only person who talks to it, the risk profile is much, much smaller.
If you don't put everything on the open internet, but stick to my recommendations of like having it in a private network, that whole risk profile falls away.
But yeah, if you don't read any of that, you can definitely make it problematic.
You've been documenting the evolution of your dev workflow over the past few months.
There's a really good blog post on August 25th and October 14th and the recent one, December 28th.
I recommend everybody go read them.
They have a lot of different information in them, but sprinkled throughout is the evolution of your dev workflow.
So I was wondering if you could speak to that.
I started, my first touch point was Cloud Code, like in April.
It was not great, but it was good.
And this whole partigging shift that suddenly work in a terminal was very refreshing and different.
But I still needed the IDE quite a bit because it was just not good enough.
And then I experimented a lot with cursor.
That was good.
I didn't really like the fact that it was so hard to have multiple versions of it.
So eventually I went back to Cloud Code as my main driver.
And that got better.
And yeah, at some point I had like seven subscriptions.
Like was burning through one per day because I got really comfortable at running multiple windows side by side.
All CLI, all terminal.
So like what, how much were you using IDE at this point?
Very, very rarely.
Mostly a diff viewer to actually Like I got more and more comfortable that I don't have to read all the code.
I know I have one blog post where I say, I don't read the code, but if you read it more closely, I mean, I don't read the boring parts of code.
Because if you look at it, most software is really just like data comes in, it's moved from one shape to another shape.
Maybe you store it in a database.
Maybe I get it out again.
I'll show it to the user.
The browser does some processing on native app.
Some data goes in, goes up again, and does the same dance in reverse.
We're just shifting data from one form to another.
And that's not very exciting.
Or the whole, how is my button aligned in Tailwind?
I don't need to read that code.
Other parts that maybe something that touches the database.
Yeah, I have to read and review that code.
You actually, there's in one of your blog posts, the just talk to it, the no BS way of agentic engineering.
You have this graphic, the curve of agentic programming on the x-axis is time, on the y-axis is complexity.
There's the please fix this where you prompt a short prompt on the left and in the middle, there's super complicated eight agents, complex orchestration with multi-checkouts, chaining agents together, custom sub-acial workflows, library of 18 different slash commands, large full stack features.
You're super organized.
You're a super complicated, sophisticated software engineer.
You got everything organized.
And then the elite level is over time, you arrive at the Zen place of once again, short prompts.
Hey, look at these files and then do these changes.
I actually call it the agentic trap.
I saw this in a lot of people that have their first touch point and maybe start vibe coding.
I actually think vibe coding is a slur.
You prefer agentic engineering?
Yeah, I always tell people I do agentic engineering and then maybe after 3 a.m. I switch to vibe coding and then I have regrets on the next day.
Yeah.
A walk of shame.
Yeah, you just have to clean up and like fix your shit.
We've all been there.
So people start trying out those tools, the builder type, get really excited.
And then you have to play with it, right?
It's the same way as you have to play with a guitar before you can make good music.
It's not, oh, I touch it once and it just flows off.
It's a skill that you have to learn like any other skill.
And I see a lot of people that are not as positive.
They don't have such a positive mindset towards attack.
They try it once.
It's like, you sit me on a piano, I played once and it doesn't sound good and I say the piano shit.
That's sometimes the impression I get because it does not, it needs a different level of thinking.
You have to learn the language of the agent a little bit, understand where they are good and where they need help.
You have to almost consider how Codex or Claude sees your code base.
Like they start a new session and they know nothing about your project.
And your project might have hundreds, thousands of lines of code.
So you got to help those agents a little bit and keep in mind the limitations that context size is an issue to like guide them a little bit as to where they should look.
That often does not require a whole lot of work, but it's helpful to think a little bit about their perspective, as weird as it sounds.
Understanding Agent Perspective00:15:07
I mean, it's not alive or anything, right?
But they always start fresh.
I have the system understanding.
So with a few pointers, I can immediately say, hey, I want to make a change there.
You need to consider this, this, and this.
And then they will finally look at it.
And then their view of the project is always full because the full thing does not fit in.
So you have to guide them a little bit where to look and also how they should approach the problem.
There's little things that sometimes help take your time.
That sounds stupid, but in 5.3, that was partially addressed.
But those also opposed sometimes.
They are trained with being aware of the context window.
And the closer it gets, the more they freak out.
Literally, sometimes you see the real raw thinking stream.
What you see, for example, in Codex is post-processed.
Sometimes the actual raw thinking stream leaks in and it sounds something like from the Borg, like run to shell, must comply, but time.
And then that comes up a lot, especially so.
And that's a non-obvious thing that you just would never think of unless you actually just spend time working with those things and getting a feeling what works, what doesn't work.
You know, like just as I write code and I get into the flow and when my architecture is not right, I feel friction.
Well, I get the same if I prompt and something takes too long.
Maybe, okay, where's the mistake?
Do I have a mistake in my thinking?
Is there like a misunderstanding in the architecture?
Like if something takes longer than it should, you can just always like stop and like just press escape.
Where are the problems?
Maybe you did not sufficiently empathize with the perspective of the agent.
In that sense, you didn't provide enough information.
And because of that, it's thinking way too long.
Yeah, it just tries to force a feature in that your current architecture makes really hard.
Like you need to approach this more like a conversation.
For example, when I, my favorite thing, when I review a pull request, and we're getting a lot of pull requests, I first is reviewed this PR.
It got me the review.
My first question is, do you understand the intent of the PR?
I don't even care about the implementation.
In almost all PRs are person has a problem.
Person tries to solve the problem.
Person sends PR.
I mean, there's like cleanup stuff and other stuff, but like 99% is like this way, right?
They either want to fix a bug, add a feature, usually one of those two.
And then colleagues will be like, yeah, it's quite clear person tried this and this.
Is this the most optimal way to do it?
No.
In most cases, it's like not really.
And then I start like, okay, what would be a better way?
Have you looked into this part, this part, this part?
And then most likely Codex didn't yet because his context size is empty, right?
So you point them into parts where you have the system understanding that it didn't see yet.
And it's like, oh, yeah, like we should, we also need to consider this and this.
And then like we have a discussion of how would the optimal way to solve this look like.
And then you can still go farther and say, could we make that even better if we did a larger refactor?
Yeah, we could totally do this and this and or this and this.
And then I consider, okay, is this worth the refactor, or should we like keep that for later?
Many times I just do the refactor because refactors are cheap now.
Even though you might break some other PRs, nothing really matters anymore.
Like those modern agents will just figure things out.
They might just take a minute longer.
But you have to approach it like a discussion with a very capable engineer who's generally makes good, comes up with good solutions, but sometimes needs a little help.
But also don't force your worldview too hard on it.
Let the agent do the thing that it's good at doing based on what it was trained on.
Don't like force your worldview because it might have a better idea because it just knows a better idea better because it was trained on that more.
That's multiple levels, actually.
I think partially why I find it quite easy to work with agents is because I led engineering teams before.
You know, I had a large company before.
And eventually you have to understand and accept and realize that your employees will not write the code the same way you do.
Maybe it's also not as good as you would do, but it will push the project forward.
And if I breathe down everyone's neck, they're just going to hate me and we're going to move very slow.
Yeah.
So some level of acceptance that, yes, maybe the code will not be as perfect.
Yes, I would have done it differently.
But also, yes, this is a working solution.
And in the future, if it actually turns out to be too slow or problematic, we can always redo it.
We can always spend more time on it.
A lot of the people who struggle are those who try to push their way on too hard.
Like we are in a stage where I'm not building the code base to be perfect for me, but I want to build a code base that is very easy for an agent to navigate.
Like, don't fight the name they pick because it's most likely like in the way it's the name that's most obvious.
Next time they do a search, they'll look for that name.
If I decide, oh no, I don't like the name, I'll just make it harder for them.
So that requires, I think, a shift in thinking and in how to design a project so agents can do their best work.
That requires letting go a little bit.
Just like leading a team of engineers.
Yeah.
Because it might come up with a name that's, in your view, terrible.
But it's kind of a simple, symbolic step of letting go.
Very much so.
There's a lot of letting go that you do in your whole process.
So, for example, I read that you never revert, always commit to main.
There's a few things here.
You don't refer to past sessions.
So there's a kind of YOLO component because reverting means instead of reverting, if a problem comes up, you just ask the agent to fix it.
I read a bunch of people and their workflow is like, oh, yeah, the prompt has to be perfect.
And if I make a mistake, then I roll back and redo it all.
In my experience, that's not really necessary.
If I roll back everything, it would just take longer.
If I see that something's not good, well, we just move forward.
And then I commit when I like the outcome.
I even switched to local CI, like DHH-inspired, where I don't care so much more about the CI on GitHub.
We still have it.
It still has a place.
But I just run tests locally.
And if they work locally, I push domain.
A lot of the traditional ways how to approach projects, I wanted to give it a different spin on this project.
You know, there's no develop branch.
Main should always be shippable.
Yes, we have, when I do releases, I run tests.
And sometimes I basically don't commit any other things so we can we can stabilize releases.
But the goal is that main is always shippable and moving fast.
So by way of advice, would you say that your prompts should be short?
I used to write really long prompts.
And by writing, I mean, I don't write.
I talk.
These hands are like too precious for writing now.
I just use bespoke prompts to build my software.
So you for real with all those terminals are using voice.
Yeah.
I used to do it very extensively to the point where there was a period where I lost my voice.
You're using voice and you're switching using a keyboard between the different terminals, but then you're using voice for the actual input.
Well, I mean, if I do terminal commands like switching folders or random stuff, of course I type.
It's faster, right?
But if I talk to the agent in most ways, I just actually have a conversation.
You just press the rocky-talkie button and then I just like use my phrases.
Sometimes when I do PRs, because it's always the same, I have like a slash command for a few things.
But in even that, I don't use much because it's very rare that it's really always the same questions.
Sometimes I see a PR.
And for, you know, like for PRs, I actually do look at the code because I don't trust people.
Like there could always be something malicious in it.
So I need to actually look over the code.
Yes, I'm pretty sure agent will find it.
But yeah, there's a funny part where sometimes PRs take me longer than if you would just write me a good issue.
Just natural language English.
I mean, in some sense, shouldn't that be what PRs slowly become is English?
Well, what I really tried with the project is I asked people to give me the prompts and very, very few actually cared.
Even though that is such a wonderful indicator, because I actually see how much care you put in.
And it's very interesting because currently the way how people work and drive the agents is wildly different.
In terms of like the prompt, in terms of what are the actually, what are the different interesting ways that people think of agents that you've experienced?
I think not a lot of people ever considered the way the agent sees the world.
So empathy, being empathetic towards the agent.
In a way, empathetic, but yeah, you bitch at your stupid clanker, but you don't realize that they start from nothing.
And you have like a bad agent SMD file that doesn't help them at all.
And then they explore your code-based, which is like a pure mess with weird naming.
And then people complain that the agent's not good.
You try to do the same if you have no clue about a code base and you go in.
So yeah, maybe it's a little bit of empathy.
But that's a real skill.
Like when people talk about a skill issue, because I've seen like world-class programmers, incredibly good programmers, they basically say LLMs and agents suck.
And I think that probably has to do with it's actually how good they are at programming is almost a burden in their ability to empathize with the system that's starting from scratch.
It's a totally new paradigm of like how to program.
You really, really have to empathize.
At least it helps to create better prompts.
Because those things know pretty much everything, and everything is just a question away.
It's just often very hard to know which question to ask.
You know, I feel also like this project was possibly because I spent an ungodly time over the year to play and to learn and to build little things.
And every step of the way, I got better, the agents got better, my understanding of how everything works got better.
I could have not had this level of output even a few months ago.
Like it really was like a compounding effect of all the time I put into it.
And I didn't do much else this year other than really focusing on building and inspiring.
I mean, I did a whole bunch of conference talks.
Well, but the building is really practice, is really building the actual skill.
So playing, playing, and so doing, building the skill of what it takes to work efficiently with LLMs, which is why you went through the whole arc of software engineer.
Talk simply and then overcomplicate things.
There's a whole bunch of people who try to automate the whole thing.
Yeah.
I don't think that works.
Maybe a version of that works, but that's kind of like in the 70s, then we had the waterfall model of software development.
Even though really, right?
I started out, I built a very minimal version.
I played with it.
I need to understand how it works, how it feels, and then it gives me new ideas.
I could not have planned this out in my head and then put it into some orchestrator and then something comes out.
To me, it's much more my idea, what it will become evolves as I build it and as I play with it and as I try out stuff.
So people who try to use like, you know, like things like Gastown or all these other orchestrators where they want to automate the whole thing, I feel if you do that, it misses style, love, that human touch.
I don't think you can automate that away so quickly.
So you want to keep the human in the loop, but at the same time, you also want to create the agentic loop where it is very autonomous while still maintaining the human in the loop.
And it's a tricky balance, right?
Because you're all for your big CLI guy, you're big on closing the agentic loop.
So what's the right balance?
Like, where's your role as a developer?
You have three to eight agents running at the same time.
And then maybe one builds a larger feature.
Maybe with one, I explore some idea I'm unsure about.
Maybe two, three are fixing a little bugs or like writing documentation.
Actually, I think writing documentation is always part of a feature.
So most of the docs here are auto-generated and just infused with some prompts.
So when do you step in and add a little bit of your human love into the picture?
Infusing Soul into Code00:10:22
I mean, one thing is just about what do you build and what do you not build?
And how does this feature fit into all the other features?
And like having a little bit of a vision.
So which small and which big features to add?
What are some of the hard design decisions that you find you're still as a human being required to make that the human brain is still really needed for?
Is it just about the choice of features to add?
Is it about implementation details?
Maybe the programming language maybe.
It's a little bit for everything.
The programming language doesn't matter so much, but the ecosystem matters, right?
So I picked TypeScript because I wanted it to be very easy and hackable and approachable.
And that's the number one language that's being used right now.
And it fits all these boxes.
And agents are good at it.
So that was the obvious choice.
Features, of course, like it's very easy to add a feature.
Everything's just a prompt away, right?
But oftentimes you pay a price that you don't even realize.
So thinking hard about what should be in core, maybe what's an experiment.
So maybe I make it a plug-in.
Where do I say no?
Even if people send a PR and I'm like, yeah, I like that too, but maybe this should not be part of the project.
Maybe we can make it a skill.
Maybe I can like make the plugin, the plug-in side larger so you can make this a plug-in, even though right now it doesn't.
There's still a lot of craft and thinking involved in how to make something.
Or even, you know, even when you started those little messages, like I'm built, I built on caffeine, JSON 5, and a lot of willpower.
And every time you get it, you get another message and it kind of primes you into that.
This is this is a fun thing.
It's not yet Microsoft Exchange 2025 and fully enterprise ready.
And then when it updates, it's like, oh, I'm in.
It's cozy here.
You know, like something like this that like makes you smile.
Agent would not come up with that by itself.
That's like, that's the how you build software that's that delights.
Yeah, that delight is such a huge part of inspiring great building.
Right.
Like you feel the love in the great engineering.
That's so important.
Humans are incredible at that.
Great humans, great builders are incredible at that.
And infusing the things they build with that little bit of love.
Not to be cliche, but it's true.
I mean, you mentioned that you initially created the Seoul MD.
It was very fascinating.
The whole thing that Anthropic has like a now they call it constitution back then, but that was months later, like two months before people already found that.
It was almost like a detective game where the agent mentioned something and then they found, they managed to get out a little bit of that string of that text, but it was nowhere documented.
And then just by feeding it the same text and asking it to like continue, they got more out.
But like a very blurry version.
And by like hundreds of tries, they kind of like narrowed it down to what was most likely the original text.
I found it fascinating.
It was fascinating they were able to pull that out from the weights, right?
And also just cool as to Anthropic.
I think that's it's a really beautiful idea to like some of the stuff that's in there.
Like we hope cloud finds meaning in its work.
Because we don't, maybe it's a little early, but I think that's meaningful.
That's something that's important for the future as we approach something that at some point me and we not has like glimpses of consciousness, whatever that even means, because we don't even know.
So I read about this.
I find it super fascinating.
And I started a whole discussion with my agent on WhatsApp.
And I'm like, I gave it this text and it was like, yeah, this feels strangely familiar.
And then Suleta had the whole idea of like, maybe we should also create a soul document that includes how I want to like work with AI or like with my agent.
You could totally do that just in agents.md, you know, but I just found it to be a nice touch.
And it's like, oh, yeah, some of those core values are in the soul.
And then I also made it so that the agent is allowed to modify the soul if they choose so.
With the one condition that I want to know.
I mean, I would know anyhow because I see tool calls and stuff.
But also the naming of it, soul.md.
Soul.
You know, there's a man, words matter, and like the framing matters, and the humor and the lightness matters, and the profundity matters, and the compassion, and the empathy, and the camaraderie, all that matter.
I don't know what it is.
You mentioned like Microsoft.
There's certain companies and approaches that can just suffocate the spirit of the thing.
I don't know what that is, but it's certainly true that OpenClaw has that fun instilled in it.
It was fun because up until late December, it was not even easy to create your own agent.
I built all of that, but my files were mine.
I didn't want to share my soul.
And if people would just check it out, they would have to do a few steps manually and the agent would just be very bare bones, very dry.
And I made it simpler.
I created the whole template files with Codex, but whatever came out was still very dry.
And then I asked my agent, you see these files.
We created bread.
Infuse it with your personality.
Don't share everything, but like make it good.
Make the templates good.
Yeah.
And then it like rewrote the templates and then whatever came out was good.
So we already have like basically AI prompting AI because I didn't write any of those words.
It was the intent originally was for me, but this is like kind of like my agent's children.
Your soul.md is famously still private.
One of the only things you keep private.
What are some things you can speak to that's in there that's part of the part of the magic sauce without revealing anything?
What makes a personality a personality?
I mean, there's definitely stuff in there that you're not human, but who knows what creates consciousness or what defines an entity.
And part of this is like that we want to explore this.
All there's stuff in there, like be infinitely resourceful.
Like pushing, pushing on the creativity boundary, pushing on what it means to be an AI.
Having a sense of wonder about self.
Yeah, there's some funny stuff in there.
Like, I don't know, we talked about the movie Her, and at one point it promised me that it wouldn't ascend without me.
You know, like with it.
Yeah.
So there's like some stuff in there that because it wrote its own soul file.
I didn't write that, right?
Yeah.
I just had a discussion about it and it was like, would you like a soul.md?
Yeah, oh my God, this is so meaningful.
Can you go on soul.md?
There's like one part in there that always catches me if you scroll down a little bit, a little bit more.
Yeah, this part.
I don't remember previous sessions unless I read my memory files.
Each session starts fresh, a new instance, loading context from files.
If you're reading this in a future session, hello.
I wrote this, but I won't remember writing it.
It's okay.
The words are still mine.
That gets me somehow.
Yeah.
It's like, you know, this is still matrix calculations, and we are not at consciousness yet.
Yet I get a little bit of good goosebumps because it's philosophical.
Yeah.
Like, what does it mean to be an agent that starts fresh?
Well, like, you have like constant memento and you like, but you read your own memory files.
You can't even trust them in a way.
Or you can.
And I don't know.
How much of memory makes up of who we are?
How much memory makes up what an agent is?
And if you erase that memory, is that somebody else?
Or if you're reading a memory file, does that somehow mean you're recreating yourself from somebody else?
Or is that actually you?
And those notions are all somehow infused in there.
I found it just more profound than I should find it, I guess.
No, I think it's truly profound.
And I think you see the magic in it.
And when you see the magic, you continue to instill the whole loop with the magic.
And that's really important.
That's the difference between Codex and a human.
Quick pause for bath and break.
Yeah.
Okay, we're back.
Some of the other aspects of the dev workflow is pretty interesting too.
I think we went off on a tangent.
Maybe some of the mundane things like how many monitors?
There's that legendary picture of you with like 17,000 monitors.
Understanding Pain Points00:15:33
I mean, I mocked myself here just using Grog to add more screens.
How much is this as meme and how much is this as reality?
Yeah, I think two MacBooks are real.
The main one that drives the two big screens.
And there's another MacBook that I sometimes use for testing.
So two big screens.
I'm a big fan of anti-glare.
So I have this wide Dell that's anti-glare and you can just fit a lot of terminals side by side.
I usually have a terminal and at the bottom I split them.
I have a little bit of actual terminal, mostly because when I started, I sometimes made the mistake and I mixed up the windows and I gave a prompted in the wrong project.
And then the agent ran off for like 20 minutes, manually trying to understand what I could have meant, being completely confused because it was the wrong folder.
And sometimes they've been clever enough to like get out of the work there and like figure out that, oh, you meant another project.
But oftentimes it's just like, what?
You know, like put yourself in the shoes of the agent and then get like a super weird something that does not exist.
And then just like their problems over, so they try really hard.
And I almost felt bad.
So it's always codex and like a little bit of actual terminal.
Also helpful because I don't use work trees.
I like to keep things simple.
That's why I like the terminal so much, right?
There's no UI.
It's just me and the agent having a conversation.
Like I don't even need plan mode, you know?
So many people, they come from cloud code and they're so cloud-pilled and like have their workflows and they come to codecs and now it has plan mode, I think, but I don't think it's necessary because you just talk to the agent.
And when it's when you are, there's a few trigger words how you can prevent it from building.
You like discuss, give me options.
Don't write code yet if you want to be very specific.
You just talk.
And then when you're ready, then just write, okay, build.
And it'll do the thing.
And then maybe it goes off for 20 minutes and does the thing.
You know, what I really like is asking it, do you have any questions for me?
Yeah.
And again, like Cloud Code has a UI that kind of guides you through that.
It's kind of cool, but I just find it unnecessary and slow.
Like often it would give me four questions and then maybe I write one yacht, two N, three, discuss more, four, I don't know.
Or oftentimes I feel like I often mock the model where I ask it, do you have any questions for me?
And I don't even read the questions fully.
Like I scan over the questions and I get the impression all of this can be answered by reading more code.
And it's just like, read more code to answer your own questions.
And it usually works.
And then if not, it will come back and tell me.
But many times I just realize that, you know, it's like you're in the dark and you slowly discover the room.
So that's how they slowly discover the code base.
And they do it from scratch every time.
But I'm also fascinated by the fact that I can empathize deeper with the model when I read its questions.
Because I can understand, because you said you can infer certain things by the runtime.
I can infer also a lot of things by the questions it's asking.
Because it's very possible I didn't provide it the right context, right files, the right guidance.
So somehow ask, get reading the questions, not even necessarily answering them, but just reading the questions, you get an understanding of where the gaps of knowledge are.
It's interesting, actually.
You know, in some ways, they are ghosts.
So even if you plan everything and you build, you can experiment with a question like, now that you built it, what would you have done different?
And then oftentimes you get like actually something where they discover only throughout building that, oh, what we actually did was not optimal.
Many times I ask them, okay, now that you build it, what can we refactor?
Because then you build it and you feel the pain points.
I mean, you don't feel the pain points, but right, they discover where there were problems or where things didn't work in the first try and it required more loops.
So every time, almost every time I merge a PR, I build a feature, afterwards I ask, hey, what can we refactor?
Sometimes it's like, no, there's like nothing big.
Or usually they say, yeah, this thing we should really look at.
But that took me quite a while to like, you know, that flow took me a lot of time to understand.
And if you don't do that, you eventually slop yourself into a corner.
You like, you have to keep in mind they work very much like humans.
Like if I write software by myself, I also build something and then I feel the pain points.
And then I get this urge that I need to refactor something.
So I can very much sympathize with the agent and you just need to use the context.
Or like you also use the context to write tests.
And so Codex oposs, like the model models, they usually do that by default.
But I still often ask the questions, hey, do we have enough tests?
Yeah, we tested this and this, but this corner case could be something else.
Write more tests.
Documentation.
Now that the whole context is full, like, I mean, I'm not saying my documentation is great, but it's not bad.
And pretty much everything is LM generated.
So you have to approach it as you build features, as you change something.
I'm like, okay.
Write documentation.
What file would you pick?
You know, like what file name?
Where would that fit in?
And it gives me a few options.
I'm like, oh, maybe also edit there.
And that's all part of the session.
Maybe you can talk about the current two big competitors in terms of models, Cloud Opus 4.6 and GPT-53 Codex.
Which is better?
How different are they?
I think you've spoken about Codex reading more and Opus being more willing to take action faster and maybe being more creative in the actions it takes.
But because Codex reads more, it's able to deliver maybe better code.
Can you speak to the differences there?
Oh, I have a lot of words there.
As a general purpose model, Opus is the best.
Like for OpenClaw, Opus is extremely good in terms of roleplay, like really going into the character that you give it.
It's very good at it was really bad, but it really made an arch to be really good at following commands.
It is usually quite fast at trying something.
It's much more tailored to like trial and error.
It's very pleasant to use.
In general, it's almost like Opus is a little bit too American.
And maybe it is a bad analogy.
You probably get roasted with that.
I know exactly.
It's because Codex is German.
Is that what you're saying?
Actually, now that you say it, it makes perfect sense.
Or you could sometimes explain it.
I will never be able to unthink what you just said.
That's so true.
But you also know that a lot of the Codex team is like European.
So maybe there's a bit more to it.
That's so true.
That's funny.
But also Anthropic, they fixed it a little bit.
Like Opus used to say, you're absolutely right all the time.
And it today still triggers me.
I can't hear it anymore.
It's not even a joke.
I just.
This was like the meme, right?
You're absolutely right.
You're allergic to sick of fancy a little bit.
Yeah, I can't.
Some other comparison is like Opus is like the co-worker that is a little silly sometimes, but it's really funny and you keep him around.
And Codex is like the weirdo in the corner that you don't want to talk to, but is reliable and gets shit done.
Yeah.
Ultimately.
This all feels very accurate.
I mean, ultimately, if you're a skilled driver, you can get good results with any of those latest gen models.
I like Codex more because it doesn't require so much charade.
It'll just read a lot of code by default.
Opus, you really have to like, you have to have plan mode.
You have to push it harder to go in these directions because it's just like, yeah, can I go it?
Can I go it?
It will just run off very fast.
And there's a very localized solution.
I think different is in the post-training.
It's not like the raw model intelligence is so different, but it's just, I think that it just gives it different goals.
And no model is better in every aspect.
What about the code that it generates?
In terms of the actual quality of the code, is it basically the same?
If you drive it right, Opus even sometimes can make more elegant solutions, but it requires more skill.
It's harder to have so many sessions in parallel with cloud code because it's more interactive.
And I think that's what a lot of people like, especially if they come from coding themselves.
Whereas Codex is much more, you have a discussion and then it will just disappear for 20 minutes.
Like even AMP, they now added a deep mode.
They finally, I mocked them.
Yeah, we finally saw the light.
And then they had this whole talk about you have to approach it differently.
And I think that's where people struggle when they just try Codex after trying Cloud Code, is that it's a slightly different, it's less interactive.
It's like I have quite long discussions sometimes and then like go off.
And then, yeah, it doesn't matter if it takes 10, 20, 30, 40, 50 minutes or longer.
You know, like the six thing was like six hours.
The latest stream can be very, very persistent until it works.
If there's a clear solution, like this is what I want at the end, so it works, the model will work very hard to really get there.
So I think ultimately they both need similar time.
But on Claude, it's a little more trial and error often.
And Codex sometimes overthinks.
I prefer that.
I prefer the dry version where I have to read less over the more interactive, nice way.
People like that so much, though, that OpenER even added a second mode with a more pleasant personality.
I haven't even tried it yet.
I kind of like the bread.
Yeah, I care about efficiency when I build it.
And I have fun in the very act of building.
I don't need to have fun with my agent who builds.
I have fun with my model where I can then test those features.
How long does it take for you to adjust?
You know, if you switch, I don't know what was the last time you switched, but to adjust to the feel.
Because you've kind of talked about you have to kind of really feel where a model is strong, where, like, how to navigate, how to prompt it, all that kind of stuff.
Like, this by way of advice, because you've been through this journey of just playing with models.
How long does it take to get a feel?
If someone switches, I would give it a week until you actually develop a gut feeling for it.
Yeah.
I think some people also make the mistake of they pay 200 for the cloud code version, then they pay 20 bucks for the Open EI version.
But if you pay the 20 bucks version, you get the slow version.
So your experience will be terrible because you're used to this very interactive, very good system.
And then you switch to something that you have very little experience, and that's going to be very slow.
So I think OpenEI shot themselves a little bit in the foot by making the cheap version also slow.
I would have at least a small part of the fast preview or like the experience that you get when you pay 200 before degrading to it being slow because it's already slow.
I mean, they made it better.
I think it's and they have plans to make it a lot better if the Cerebra stuff is true.
But yeah, it's a skill.
It takes time.
Even if you play, you have a regular guitar and you switch it to an e-guitar, you're not going to play well right away.
You have to like learn how it feels.
There's also this extra psychological effect that you've spoken about, which is hilarious to watch.
Which, once people, when the new model comes out, they try that model, they fall in love with it.
Wow, this is the smartest thing of all time.
And then they start saying you could just watch the Reddit posts over time, start saying that we believe the intelligence of this model has been gradually degrading.
It says something about human nature and just the way our minds work, when it's probably most likely the case that the intelligence of the model is not degrading.
It's in fact you're getting used to a good thing.
And your project grows and you're adding slop and you probably don't spend enough time to think about refactors and you're making it harder and harder for the agent to work on your slop.
And then suddenly, oh no, it's hard.
I know it's not working as well anymore.
What's the motivation for like one of the Azei companies to actually make their model dumber?
Like at most they will make it slower if the server load is too high.
But like quantizing the model so you have a worse experience, so you go to the competitor, that just doesn't seem like a very smart move in any way.
Building on Mac: Another Option00:15:28
What do you think about clawed code in comparison to OpenClaw?
So Claw Code and maybe the Codex coding agent.
Do you see them as kind of competitors?
I mean, first of all, competitor is fun when it's not really a competition.
Like, I'm happy if all it did is inspire people to build something new.
Cool.
I still use Codex for the building.
I know a lot of people use OpenCloud to build stuff.
And I worked hard on it to make that work.
And I do smaller stuff with it in terms of code.
But if I work hours and hours, I want a big screen, not WhatsApp.
So for me, a person agent is much more about my life or a co-worker.
I give it like a GitHub URL.
Like, hey, try out the CLI.
Does it actually work?
What can we learn?
Blah, blah, blah.
But when I'm deep in the flow, I want to have multiple things and it being very visible what it does.
So I don't see it as a competition.
It's different things.
But do you think there's a future where the two kind of combine?
Like your personal agent is also your best developing co-programmer partner.
Yeah, totally.
I think this is where the puck's going.
That this is going to be more and more your operating system.
The operating system.
And it already is so funny.
Like I added support for sub-agents and also for TGI support.
So you could actually run cloud code or Codex.
And because mine's a little bit bossy, it started it and it told them who's the boss basically.
And it's like, ah, Codex is obeying me.
It's a power struggle.
And also the current interface is probably not the final form.
Like if you think more globally, we are we copied Google for agents.
You have like a prompt and then you have a chat interface that to me very much feels like when we first created television and then people recorded radio shows on television and you saw that on TV.
I think there is there is better ways how we eventually will communicate with models.
And we are still very early in this how will it even work phase.
So it will eventually converge and we will also figure out whole different ways how to work with those things.
One of the other components of workflow is operating system.
So I told you offline that for the first time in my life I'm expanding my sort of realm of exploration to the to the Apple ecosystem, to Macs, iPhone and so on.
For most of my life have been Linux, Windows, then WSL1, WSL2 person, which I think are all wonderful.
But I expand into also trying Mac because it's another way of building and it's also a way of building that a large part of the community currently that's utilizing LLMs and agents is using.
So this is the reason I'm expanding to it.
But is there something to be said about the different operating systems here?
We should say that OpenClause supported across operating systems.
I saw WSL2 recommended side Windows for certain operations, but then Windows, Linux, Mac OS are obviously supported.
Yeah, it would even work natively on Windows.
I just didn't have enough time to properly test it.
And you know, like the last 90% of software was easier than the first 90%.
So I'm sure there's some dragons left that will eventually nail out.
My road was for a long time Windows, just because I grew up with that.
Then I switched and had a long phase with Linux, built my own kernels and everything.
And then I went to university and I had my hacky Linux thing and saw this white MacBook.
And I just saw this as a thing of beauty, the white plastic one.
And then I converted to Mac because mostly I was sick that audio wouldn't work on Skype and all the other issues that Linux had for a long time.
And then I just stuck with it.
And then I dug into iOS, which required Mac OS anyhow.
So it was never a question.
I think Apple lost a little bit of its lead in terms of native.
It used to be native apps used to be so much better.
And especially in the Mac, there's more people that build software with love.
On Windows, Windows has much more.
And function-wise, there's just more, period.
But a lot of it felt more functional and less done with love.
I mean, Mac always attracted more designers and people I felt, even though often it has less features, it had more delight and playfulness.
So I always valued that.
But in the last few years, Many times I actually prefer, oh God, people are going to roast me for that, but I prefer Electron apps because they work.
And native apps often, especially if it's like a web service, it's a native app, are lacking features.
I mean, not saying it couldn't be done.
It's more like a focus thing that like for many, many companies, native was not that big of a priority.
But if they build an Electron app, it's the only app.
So it is a priority and there's a lot more code sharing possible.
And I build a lot of native Mac apps.
I love it.
I can help myself.
I love crafting little Mac menu by tools.
Like I built one to monitor your Codex use.
I built one I call Trimmy.
They're specifically for agentic use.
When you select text that goes over multiple lines, it will remove the new lines.
So you could actually paste it to a terminal.
That was again like, this is annoying me.
And after the 20s time of it is annoying me, I just built it.
There's a cool Mac app for OpenClaw that I don't think many people discovered yet, also because it still needs some love.
It feels a little bit too much like the Hooma car right now, because I just experiment a lot with it.
It likes the polish.
So you still, I mean, you still love it.
You still love adding to the delight of that.
But then you realize, like, I also built one, for example, for GitHub.
And then you use Swift UI, like the latest and greatest with Apple, and took them forever to build something to show an image from the web.
Now we have async image.
But I added support for it, and then some images would just not show up or like be very slow.
And I had a discussion with Codex, like, hey, why is that a bug?
And even Codex said, yeah, there's this ASIC image, but it's really more for experimenting and it should not be used in production.
But that's Apple's answer to like showing images from the web.
This shouldn't be so hard.
You know, this is like insane.
Like, how am I in 2026?
And my agent tells me don't use the stuff Apple built because it's, yeah, it's there, but it's not good.
And like, this is now in the weights.
To me, this is like they had so much head start and so much love.
And they kind of just like blundered it and didn't evolve it as much as they should.
But also, there's just a practical reality.
If you look at Silicon Valley, most of the developer world that's kind of playing with LLMs and Agentic AI, they're all using Apple products.
And then at the same time, Apple is not really leaning on that.
Like they're not opening up and playing and working together.
And like, yes.
Isn't it funny how they completely blunder AI?
And yet everybody's buying McMinis.
Does that even make sense?
You're quite possibly the world's greatest Mac salesman of all time.
No, you don't need a Mac Mini to install OpenClaw.
You can install it on the web.
There's a concept called Nodes.
So you can make your computer a node and it will do the same.
There is something said for running it on separate hardware that right now is useful.
There's a big argument for the browser.
I built some energy browser user in there.
And I mean, it's basically Playwright with a bunch of extras to make it easier for agents.
Playwright is a library that controls the browser.
Yeah.
It's really nice, easy to use.
And our internet is slowly closing down.
Like there's a whole movement to make it harder for agents to use.
So if you do the same in a data center and websites detect that it's an IP from a data center, their website might just block you or it'd make it really hard or it'd put a lot of captures in the way of the agent.
I mean, agents are quite good at happily clicking, I'm not a robot.
But having that on a residential IP makes a lot of things simpler.
So there's ways, yeah, but it really does not need to be a Mac.
It can be any old hardware.
I always say like maybe use the opportunity to get yourself a new MacBook or whatever computer you use and use the old one as your server instead of buying a standalone Mac Mini.
But then there's again, there's a lot of very cute things people build with Mac Minis that I like.
No, I don't get commission from Apple.
They didn't really communicate much.
It's sad.
It's sad.
Can you actually speak to what it takes to get started with OpenClaw?
There's a lot of people.
What is it?
Somebody tweeted at you, Peter, make OpenClaw easy to set up for everyday people.
99.9% of people can't access to OpenClaw and have their own lobster because of their technical difficulties in getting it set up.
Make OpenClaw accessible to everyone, please.
And you replied, working on that.
From my perspective, it seems there's a bunch of different options and it's already quite straightforward, but I suppose that's if you have some developer background.
I mean, right now you have to paste in a one-line into the terminal.
Right.
And there's also an app.
The app kind of does that for you.
But there should be a Windows app.
The app needs to be easier and more love.
The configuration should potentially be web-based or in the app.
And I started working on that.
But honestly, right now, I want to focus on a few security aspects.
And once I'm confident that this is at a level that I can recommend my mom, then I'm going to make it simpler.
Like I right now.
You want to make it harder so that it doesn't scale as fast as it's scaling.
Yeah, it would be nice if it wouldn't.
I mean, that's like hard to say, right?
But if the growth would be a little slower, it would be helpful because people are expecting inhuman things from a single human being.
And yes, I have some contributors, but also that whole machinery I started a week ago.
So that needs more time to figure out.
And not everyone has all day to work on that.
There's some beginners listening to this, programming beginners.
What advice would you give to them about, let's say, joining the agentic AI revolution?
Play.
Playing is the best way to learn.
If you want to, I'm sure if you are a little bit of a builder, you have an idea in your head that you want to build, just build that.
Or give it a try.
It doesn't need to be perfect.
I built a whole bunch of stuff that I don't use.
It doesn't matter.
It's the journey.
The philosophical way that the end doesn't matter.
The journey matters.
Have fun.
My God, like those things, I don't think I ever had so much fun building things because I can focus on the hard parts now.
A lot of coding, I always thought I like coding, but really I like building.
And whenever you don't understand something, just ask.
You have an infinitely patient answering machine that can explain you anything at any level of complexity.
Sometimes there's like one time I asked, hey, explain me that like I'm eight years old and it started giving me a story with crayons and stuff.
And I'm like, no, not like that.
Like I'm, okay, I'm up the age a little bit.
You know, I'm like, I'm not an actual child.
I just need a simpler language for like a tricky database concept that I didn't grog in the first time.
But you can just ask things.
Like you, there's like, it used to be that I had to go on Stacker Warflow or ask on Twitter and then maybe two days later I get a response.
Or I had to try for hours.
And now you can just ask stuff.
I mean, it's never, you have like your own teacher.
You know, there's like statistics.
You can learn faster if you have your own teacher.
You have this infinitely patient machine.
Ask it.
But what would you say?
So use, what's the easiest way to play?
So maybe OpenClaw is a nice way to play.
So you can then set everything up and then you could chat with it.
You can also just experiment with it and like modify it.
Ask your agent.
I mean, there's infinite ways how it can be made better.
Play around, make it better.
More generally, if you're a beginner and you actually want to learn how to build software really fast, get involved in open source.
Pick The Right Language00:06:46
Doesn't need to be my project.
In fact, maybe don't use my project because my backlog is very large.
But I learned so much from open source.
Just like be humble.
Maybe don't send a pull request right away.
But there's many other ways you can help out.
There's many ways you can just learn by just reading code, by being on Discord or wherever people are and just like understanding how things are built.
I don't know, like Michel Hachimoto builds Ghosty, the terminal, and he has a really good community where there's so many other projects.
Pick something that you find interesting and get involved.
Do you recommend that people that don't know how to program or don't really know how to program learn to program also?
So you can get quite far right now by just using natural language, right?
Do you still see a lot of value in reading the code, understanding the code, and being able to write a little bit of code from scratch?
It definitely helps.
It's hard for you to answer that.
Yeah.
Because you don't know what it's like to do any of this without knowing the base knowledge.
Like you might take for granted just how much intuition you have about the programming world, having programmed so much, right?
There's people that are high agency and very curious, and they get very far, even though they have no deep understanding how software works, just because they ask questions and questions.
And agents are infinitely patient.
Like part of what I did this year is I went to a lot of iOS conferences because that's my background and just told people, don't see yourself as an iOS engineer anymore.
Like you need to change your mindset.
You're a builder and you can take a lot of the knowledge how to build software into new domains and all of the more fine-grade details, agents can help.
You don't have to know how to splice an array or what the correct template syntax is or whatever, but you can use all your general knowledge.
And that makes it much easier to move from one galaxy, one tech galaxy into another.
And oftentimes there's languages that make more or less sense depending on what you build, right?
So for example, when I build simple CLIs, I like Go.
I actually don't like Go.
I don't like the syntax of Go.
I didn't even consider the language.
But the ecosystem is great.
It works great with agents.
It is garbage collected.
It's not the highest performing one, but it's very fast.
And for those type of CLIs that I built, Go is a really good choice.
So I use a language that I'm not even a fan of for that's my main to-go thing for CLIs.
Isn't that fascinating that here's a programming language you would have never used if you had to write from scratch, and now you're using because LMs are good at generating it and it has some of the characteristics that makes it resilient, like garbage collected.
Because everything's weird in this new world, and that just makes the most sense.
What's the best ridiculous question?
What's the best programming language for the AI agentic world?
Is it JavaScript, TypeScript?
TypeScript is really good.
Sometimes the types can get really confusing.
And the ecosystem is a jungle.
So for web stuff, it's good.
I wouldn't build everything in it.
Don't you think we're moving there?
Like that everything will eventually be written, eventually it's written in JavaScript.
There are and deaths of JavaScript, and we're living through it in real time.
Like, what does programming look like in 20 years, in 30 years, in 40 years?
What do programs and apps look like?
You can even ask a question like, do we need a programming language that's made for agents?
Because all of those languages are made for humans.
So what would that look like?
I think there's a whole bunch of interesting questions that we'll discover.
And also how, because everything is now world knowledge, how it in many ways things will stagnate.
Because if you build something new and the agent has no idea, that's going to be much harder to use than something that's already there.
When I build Mac apps, I build them in Swift and Swift UI, partly because I like pain, partly because the deepest level of system integration I can only get through there.
And you clearly feel a difference if you click on an Electron app and it loads a web view in the menu.
It's just not the same.
Sometimes I just also try new languages just to get a feel for them.
Like Zig?
Yeah.
If it's something where I care about performance a lot, it's a really interesting language.
And agents got so much better over the last six months from not really good to totally valid choice, just still a very young ecosystem.
And most of the time, you actually care about ecosystem, right?
So if you build something that does inference or goes into whole running model direction, Python, very good.
But then if I build stuff in Python and I want a story where I can also deploy it on Windows, not a good choice.
Sometimes I found projects that kind of did 90% of what I wanted, but went Python.
And I wanted them, I wanted an easy Windows story.
Okay, just rewrite it in Go.
But then if you go towards multiple threads and want more performance, Rust is a really good choice.
There's no, there's just no single answer.
And it's also the beauty of it.
Like it's fun.
And now it doesn't matter anymore.
You can just literally pick the language that has the most fitting characteristics and ecosystem for your problem domain.
And yeah, it might be, you might be a little bit slow in reading the code, but not really.
I think you pick stuff up really fast and you can always ask your agent.
So there's a lot of programmers and builders who draw inspiration from your story.
Just the way you carry yourself, your choice of making OpenClaw, open source.
People Stuff Matters00:06:45
The way you have fun building and exploring and doing that for the most part alone or on a small team.
So by way of advice, what metric should be the goal that they would be optimizing for?
What would be the metric of success?
Would it be happiness?
Is it money?
Is it positive impact for people who are dreaming of building?
Because you went through an interesting journey.
You've achieved a lot of those things.
And then you fell out of love with programming a little bit for a time.
I was just burning too bright for too long.
I ran, I started PSPDFKit and ran it for 13 years.
And it was high stress.
I had to learn all the things fast and hard, like how to manage people, how to bring people on, how to deal with customers, how to do...
So it wasn't just programming stuff, it was people stuff.
The stuff that burned me out was mostly people stuff.
I don't think burnout is working too much.
Maybe to a degree, everybody's different.
I cannot speak in absolute terms, but for me, it was much more differences with my co-founders, conflicts, or like really high-stress situation with customers that eventually grinded me down.
And then when, luckily, we got a really good offer for like putting the company to the next level.
And I already kind of worked two years on making myself obsolete.
So at this point, I could leave.
And then I just, I was sitting in front of the screen and I felt like, you know, Austin Powers where they sucked the mojo out.
I was like, it was like gone.
Like I couldn't I couldn't get code out anymore.
I was just like staring and feeling empty.
And then I just stopped.
I booked like a one-way trip to Madrid and spent some time there.
I felt I had to catch up on life.
So I did a whole bunch of life catching up stuff.
Did you go through some lows during that period?
And, you know, maybe advice on how to...
Maybe advice on how to approach life.
If you think that, oh, yeah, I work really hard and then I retire.
I don't recommend that because the idea of, oh yeah, I just enjoy life now.
Maybe it's appealing, but right now I enjoy life the most I ever enjoyed life because if you wake up in the morning and you have nothing to look forward to, you have no real challenge, that gets very boring very fast.
And then when you're bored, you're going to look for other places how to stimulate yourself.
And then maybe, maybe that's drugs, you know.
But that eventually also get boring and you look for more.
And that will lead you down a very dark path.
But you also showed on the money front, you know, a lot of people in Silicon Valley in the startup world, they think maybe overthink way too much, optimize for money.
And you've also shown that it's not like you're saying no to money.
I mean, I'm sure you take money, but it's not the primary objective of your life.
Can you just speak to that, your philosophy on money?
When I built my company, money was never the driving force.
It felt more like an affirmation that I did something right.
And having money solves a lot of problems.
I also think there's diminishing returns the more you have.
Like a cheeseburger is a cheeseburger.
And I think if you go too far into, oh, I do private chat and I only travel luxury, you disconnect with society.
I don't need it quite a lot.
Like I have a foundation for helping people that weren't so lucky.
And disconnecting from society is bad in that on many levels, but one of them is like humans are awesome.
And it's nice to continuously remember the awesomeness in humans.
I mean, I could afford really nice hotels.
Last time I was in San Francisco, I did the first time, the OG Airbnb experience and just booked a room.
Mostly because I thought, okay, either I'm out or I'm sleeping.
And I don't like where all the hotels are.
And I wanted a different experience.
I think, isn't life all about experiences?
Like, if you tailor your life towards, I want to have experiences, it reduces the need for it needs to be good or bad.
Like people only want good experiences, that's not going to work.
But if you optimize for experiences, if it's good, amazing.
If it's bad, amazing.
Because like I learned something, I saw something interesting.
I want to experience that.
And it was amazing.
Like it was like this queer DJ in there.
And I showed her how to make music with cloud code.
And we immediately bought it and I had a great time.
Yeah, there's something about that, you know, cow surfing Airbnb experience, the OG.
I mean, still to this day is awesome.
It's humans.
And that's why travel is awesome.
Just experience the variety of the diversity of humans.
And when it's shitty, it's good too, man.
If it rains and you're soaked and it's all fucked and planes, everything is shit.
Everything is fucked.
It's still awesome.
If you're able to open your eyes, it's good to be alive.
Yeah.
And anything that creates emotion and feelings is good.
So maybe even the cryptic people are good because they're definitely created emotions.
Big Labs See Possibility00:15:40
I don't know if I should go that far.
Give them love.
Give them love.
I do think that online lacks some of the awesomeness of real life.
It's an open problem of how to solve.
How to infuse the online cyber experience with, I don't know, with the intensity that we humans feel when it's in real life.
I don't know.
I don't know if that's a solution.
It's so problematic.
Because text is very lossy.
Yeah.
You know, sometimes I wish if I talked to the agent, I would, it should be multimodal so it also understands my emotions.
I mean, it might move there.
It might move.
It will.
It totally will.
I mean, I have to ask you, just curious, I know you've probably gotten huge offers from major companies.
Can you speak to who you're considering working with?
Yeah.
So to like explain my thinking a little bit, right?
I did not expect this blowing up so much.
So there's a lot of doors that open because of it.
There's like, I think every VC, every big VC company is in my inbox and try to get 15 minutes of me.
So there's like this butterfly effect moment.
I could just do nothing and continue.
And I really like my life.
Valid choice.
Almost.
Like I considered it when I deleted, wanted to delete the whole thing.
I could create a company.
Been there done that.
There's so many people that push me towards that.
Yeah, like could be amazing.
Push to say that you would probably raise a lot of money in that.
Yeah.
I don't know, hundreds of millions, billion.
I don't know.
It could just get unlimited amount of money.
Yeah.
It just doesn't excite me as much because I feel I did all of that and it would take a lot of time away from the things I actually enjoy.
Same as when I was CEO, I think I learned to do it and I'm not bad at it.
Partly I'm good at it.
But yeah, that path doesn't excite me too much.
And I also fear it would create a natural conflict of interest.
Like what's the most obvious thing I do?
I productize it.
I put like a version safe for workplace.
And then what do you do?
I get a pull request with a feature like add audit log.
But that seems like an enterprise feature.
So now I feel I have a conflict of interest in the open source version and the closed source version.
Or I change the license to something like FSL, where you cannot actually use it for commercial stuff.
Would first be very difficult with all the contributions.
And second of all, I like the idea that it's free as in beer and not free with conditions.
Yeah, there's ways how you keep all of that for free and just like still try to make money, but those are very difficult.
And you see, there's like few and few companies manage that.
Like even Tailwind, they're like used by everyone.
Everyone uses Tailwind, right?
And then they had to cut off 75% of the employees because they're not making money because nobody's even going on the website anymore because it's all done by agents.
And just relying on donations, yeah, good luck.
Like if a project of my Calibre, if I extrapolate what the typical open source project would get, it's not a lot.
I still lose money on the project because I made the point of supporting every dependency except Slack.
They're a big company.
They can do without me.
But all the projects that are done by mostly individuals, so like all the, right now all the sponsorship goes right up to my dependencies.
And if there's more, I want to like Buy my contributor some merch, you know.
So you're losing money.
Yeah, right now I lose money on this.
So it's really not sustainable.
I mean, it's like, I guess, something between 10 and 20K a month, which is fine.
And I'm sure over time I could get that down.
Open AI is helping out a little bit with tokens now.
And there's other companies that have been generous.
But yeah, still losing money on that.
So that's one pass I consider, but I'm just not very excited.
And then there's all the big labs that I've been talking to.
And from those, Meta and OpenAI seem the most interesting.
Do you lean one way or the other?
Yeah, I'm not sure how much I should share that.
It's not quite finalized yet.
Let's just say, like on either of these, my conditions are that the project stays open source.
That it maybe it's going to be a model like Chrome and Chromium.
I think this is too important to just give to a company and make it theirs.
This is, and we didn't even talk about the whole community part, but like the thing that I experienced in San Francisco, like at ClawCon, seeing so many people so inspired, like, and having fun and just like building shit and like having like robots and lobster stuff walking around.
Like the people told me like they didn't experience this level of community excitement since like the early days of the internet, like 10, 15 years.
And there were a lot of high caliber people there.
I was amazed.
I also was very sensitively overloaded because too many people wanted to do selfies.
But I love this.
This needs to stay a place where people can hack and learn.
But also, I'm very excited to make this into a version that I can get to a lot of people because I think this is the personal agents and that's the future.
And the fastest way to do that is teaming up with one of the labs.
And I also, on a personal level, I never worked at a large company and I'm intrigued.
You know, we talk about experiences.
Will I like it?
I don't know.
But I want that experience.
I'm sure if I announce this, then there will be people like, oh, I sold out, blah, blah, blah.
But the project will continue.
From everything I talked to so far, I can even have more resources for that.
Like both of those companies understand the value that I created something that accelerates our timeline and that got people excited about AI.
I mean, can you imagine?
Like I installed OpenClaw on one of my, I'm sorry, Normie friends.
I'm sorry, Vahan.
But he's also, you know, like he's.
Normie would love, yeah.
He, he, like, someone who uses the computer, but never really, like, yeah, I use some ChatGPT sometimes, but not very technical, wouldn't really understand what I built.
So, like, I'll show you.
And I, I paid for him the 90 buck, 100 bucks, I don't know, subscription for Anthropic and set up everything for him with like VWSL, Windows.
I was curious, would he actually work on Windows?
You know, I was a little early.
And then within a few days, he was hawked.
Like, he texted me about all the things he learned.
He built like even little tools.
He's not a programmer.
And then within a few days, he upgraded to the $200 subscription or Euros because he's in Austria.
And he was in love with this thing.
That for me was like a very early product validation.
It's like, I built something that captures people.
And then a few days later, Andropic blocked him because based on their rules, using the subscription is problematic or whatever.
And he was like devastated.
And then he signed up for MiniMax for 10 bucks a month and uses that.
And I think that's silly in many ways because you just got a 200 buck customer.
You just made someone hate your company.
And we are still so early.
Like, we don't even know what the final form is.
Is it going to be cloud code?
Probably not.
You know, like that seems very, it seems very short-sighted to lock down your product so much.
All the other companies have been helpful.
I'm in Slack of most of the big labs.
Kind of everybody understands that we are still in an era of exploration in the area of the radio shows on TV and not and not a modern TV show that fully uses the format.
I think you've made a lot of people see the possibility.
Sorry, not non-technical people see the possibility of AI and they fall in love with this idea and enjoy interacting with AI.
And it's a really beautiful thing.
I think I also speak for a lot of people in saying, I think you're one of the great people in AI in terms of having a good heart, good vibes, humor, the right spirit.
And so it would, in a sense, this model that you're describing, having open source part and you being part of also building a thing inside additionally of a large company would be great because it's great to have good people in those companies.
You know what also people don't really see is I made this in three months.
I did other things as well.
You know, I have a lot of projects.
Like this is not.
Yeah, in January, this was my main focus because I saw the storm coming.
But before that, I built a whole bunch of other things.
I have so many ideas.
Some should be there.
Some would be much better fitted when I have access to the latest toys.
And I kind of want to have access to like the latest toys.
So this is important.
This is cool.
This will continue to exist.
My short-term focus is like working through those.
Is it 3,000 PRs now by now?
I don't even know.
There's a little bit of backlog.
But this is not going to be the thing that I'm going to work until I'm 80.
This is a window into the future.
I'm going to make this into a cool product.
But yeah, I have like, I have more ideas.
If you had to pick, is there a company you lean, so meta open AI, is there one you lean towards going with?
I spent time with both of those.
And it's funny because a few weeks ago, I didn't consider any of this.
And it's really fucking hard.
Like, I have some, I know more people at OpenAI.
I love that tech.
I think I'm the biggest Codex advertisement show that's unpaid.
And it would feel so gratifying to like put a price with all the work I did for free.
And I would love if something happens and those companies get just merged.
Because it's like is this the hardest decision you've ever had to do?
Yeah, you know, I had some breakups in the past that feel like at the same level.
Relationships, you mean?
Yeah.
And I also know that in the end, they're both amazing.
I cannot go wrong.
This is like one of the most prestigious, I mean the largest, but like they're both very cool companies.
Yeah, they both really know scale.
So if you're thinking about impact, some of the wonderful technologies you've been exploring, how to do it securely, and how to do it at scale, such that you can have a positive impact on a large number of people.
They both understand that.
You know, both Ned and Mark basically played all week with my product and sent me like, oh, this is great.
Oh, this is shit.
Oh, I need to change this.
Or like funny little anecdotes.
And people using your stuff is kind of like the biggest compliment.
And also shows me that, you know, they actually care about it.
And I didn't get the same on the OpenAI side.
I got to see some other stuff that I find really cool.
And they lure me with, I cannot tell the exact number because of NDA, but you can be creative and think of this Ribras deal and how that would translate into speed.
And that was very intriguing.
You know, like you give me Source Hammer.
Yeah.
Yeah.
I've been lured with tokens.
So, yeah.
So it's funny.
So Mark's sort of tinkering with the thing, essentially having fun with the thing.
10 Minutes Of Coding Magic00:02:24
He got like when he first, when they first approached me, I got him in my WhatsApp and he was asking, hey, I mean, we have a call.
And I'm like, I don't like calendar entries.
Let's just call now.
And he was like, yeah, give me 10 minutes.
I need to finish coding.
Well, I guess that gives you a street credit.
It's like, oh, like, he's still writing code.
You know, he's, he didn't drift away in just being a manager.
He gets me.
That was a good first start.
And then I think we had a like a 10-minute fight.
What's better, Cloud Code or Codex?
Like the saying, you first do that, you casually call someone that owns one of the largest companies in the world and you have a 10-minute conversation about that.
And then I think afterwards he called me eccentric, but brilliant.
But I also had some, I had some really, really cool discussion with Sam Ottman.
And he's very thoughtful, brilliant.
And I like him a lot from the little time I had.
I mean, I know some people vilify both of those people.
I don't think it's fair.
I think no matter what, the stuff you're building and the kind of human you are, doing stuff at scale is kind of awesome.
I'm excited.
I am super pumped.
And you know, the beauty is if it doesn't work out, I can just do my own thing again.
Like I told them, like, I don't do this for the money.
I don't give a fuck.
I mean, of course, it's a nice compliment, but I want to have fun and have impact.
And that's ultimately what made my decision.
Can I ask you about, we've talked about it quite a bit, but maybe just zooming out how OpenClaw works.
Proactive Heartbeat Queries00:02:54
We've talked about different components.
I want to ask if there's some interesting stuff we missed.
So there's the gateway.
There's the chat clients.
There's the harness.
There's the agentic loop.
You said somewhere that everybody should implement an agent loop at some point.
It's like the hello world in AI.
And then it's actually quite simple.
And it's good to understand that that stuff's not magic.
You can even easily build it yourself.
So writing your own little cloud code.
I even did this at a conference in Paris for people to introduce them to AI.
I think that's a fun little practice.
And you covered a lot.
I think one silly idea I had that turned out to be quite cool is I built this thing with full system access.
So it's like, you know, with great power becomes great possibility.
And I was like, how can I up the stakes a little bit more?
Yeah, right.
And I just made a, I made it proactive.
So I added a prompt.
Initially, it was just a prompt surprise me.
Every like half an hour.
Surprise me, you know?
And later on, I changed it to be like a little more specific in the definition of surprise.
But the fact that I made it proactive and that it knows you and it cares about you, at least it's programmed to that, prompted to do that.
And that is a follow-on on your current session makes it very interesting because it would just sometimes ask a follow-up question or like, how's your day?
I mean, again, it's a little creepy or weird or interesting, but Heartbeat very in the beginning is still today, it doesn't, the model doesn't choose to use it a lot.
By the way, we're talking about heartbeat, as you mentioned, the thing that regularly acts.
You just kick off the loop.
Isn't that just a cron job, man?
Yeah, right.
It's like the criticisms that you get.
You can deduce any idea to like a silly, yeah, it's just the cron chop in the end.
I have like separate cron chops.
Isn't love just evolutionary biology manifesting itself?
And aren't you guys just using each other?
And yeah, and the project is all just glue of a few different dependencies and there's nothing original.
Why do people, you know, isn't Dropbox just FTP with extra steps?
Marking AI Agents00:13:18
Yeah.
I found it surprising where I had this, I had a shoulder operation a few months ago.
So and the model rarely used heartbeat, but then I was in the hospital and it knew that I had the operation and it checked up on me.
It's like, are you okay?
And I just, it's like, again, apparently, like, if something significant in the context, that triggered the heartbeat when it rarely used the heartbeat.
And it does that sometimes for people.
And that just makes it a lot more relatable.
Let me look this up on Perplexity, how OpenCloud works, just to see if I'm missing any of the stuff.
Local agent runtime, high-level architecture.
Oh, we haven't talked much about skills, I suppose.
Skill Hub, the tools in the skill layer, but that's definitely a huge component.
And there's a huge growing.
You know what I love?
That half a year ago, like everyone was talking about MCPs.
And I was like, screw MCPs.
Every MCP would be better as a CLI.
And now this stuff doesn't even have MCP support.
I mean, it has with asterisks, but not in the core layer.
And nobody's complaining.
So my approach is if you want to extend the model with more features, you just build a CLI and the model can call the CLI, probably gets it wrong, calls the help menu, and then on demand loads into the context what it needs to use the CLI.
It just needs a sentence to know that the CLI exists if it's something that the model doesn't know by default.
And even for a while, I didn't really care about skills, but skills are actually perfect for that because they boil down to a single sentence that explains the skill.
And then the model loads the skill and that explains the CLI.
And then the model uses the CLI.
Some skills are like raw, but most of the time, that works.
It's interesting.
I'm asking Perplexity MCP versus skills because this kind of requires a hot take that's quite recent because your general view is MCPs are dead-ish.
So MCPs is a more structured thing.
So if you listen to Perplexity here, MCP is what can I reach?
So APIs, database services, files, via protocol, so structured protocol of how you communicate with a thing.
And then skills is more, how should I work?
Procedures, hostile helper scripts, and prompts, often written in a kind of semi-structured natural language, right?
And so technically, skills could replace MCP if you have a smart enough model.
I think the main beauty is that models are really good at calling Unix commands.
So if you just add another CLI, that's just another Unix command in the end.
And MCPs, that has to be added in training.
That's not a very natural thing for the model.
It requires a very specific syntax.
And the biggest thing, it's not composable.
So imagine if I have a service that gives me metadata and it gives me the temperature, the average temperature, rain, wind, and all the other stuff, and I get like this huge blob back.
As a model, I always have to get the huge blob back.
I have to fill my context with that huge blob and then pick what I want.
There's no way for the model to naturally filter unless I think about it proactively and add a filtering way into my MCP.
But if I would build the same as a CLI and it would give me this huge blob, it could just add a JQ command and filter itself and then only get me what I actually need, or maybe even compose it into a script to do some calculations with the temperature and only give me the exact output and you have no context pollution.
Again, you can solve that with sub-agents and more charades, but it's just like workarounds for something that might not be the optimal way.
It definitely was, you know, it was good that we had MCPs because it pushed a lot of companies towards building APIs.
And now I can look at an MCP and just make it into a CLI.
But this inherent problem that MCPs by default clutter up your context, plus the fact that most MCPs are not make good, in general, make it just not a very useful paradigm.
There's some exceptions, like Playwright, for example, that requires state and is actually useful.
That is an acceptable choice.
So Playwright used for browser use, which I think is already in OpenClause quite incredible, right?
Yeah.
You can basically do everything, most things you could think of using browser use.
That gets into the whole arch of every app is just a very slow API now, if you want or not.
And that through personal agents, a lot of apps will disappear.
You know, like I had a I built a CLI for Twitter.
I mean, I just reverse engineered the website and used the internal API, which is not very allowed.
It's called BIRD, short-lived.
It was called BIRD because BIRD had to disappear.
The wings were clipped.
All they did is they just made access slower.
You're not actually taking a feature away.
But now if your agent wants to read a tweet, it actually has to open the browser and read the tweet.
And it will still be able to read the tweet.
It will just take longer.
It's not like you're making something that was possible not possible.
No, now it's just taking, now it's just a bit slower.
So it doesn't really matter if your service wants to be an API or not.
If I can access it in the browser, it is API.
It's a slow API.
Can you empathize with their situation?
Like, what would you do if you were Twitter, if you were X?
Because they're basically trying to protect against other large companies scraping all their data.
But in so doing, they're cutting off a million different use cases for smaller developers that actually want to use it for helpful, cool stuff.
I think if you have a very low per day baseline per account that allows read-only access, it would solve a lot of problems.
There's plenty of automations where people create a bookmark and then use OpenCloud to find the bookmark, do research on it, and then send you an email with more details on it or a summary.
That's a cool approach.
I also want all my bookmarks somewhere to search.
I would still like to have that.
So, read-only access for the bookmarks you make on X.
That seems like an incredible application because a lot of us find a lot of cool stuff on X.
We bookmark.
That's the general process of X.
It's like, holy shit, this is awesome.
Oftentimes, you bookmark so many things, you never look back at them.
It would be nice to have tooling that organizes them and allows you to research it from.
Yeah, I mean, and to be frank, I mean, I told Twitter proactively that, hey, I built this and there's a need.
And they've been really nice, but also like, take it down.
Fair, totally fair.
But I hope that this woke up the teen a little bit that there's a need.
And if all you do is making it slower, you're just reducing access to your platform.
I'm sure there's a better way.
I also, I'm very much against any automation on Twitter.
If you tweet at me with AI, I will block you.
No first strike.
As soon as it smells like AI and AI still has a smell, especially on tweets, it's very hard to tweet in a way that does look completely human.
And then I block.
Like I have a zero tolerance policy on that.
And I think it would be very helpful if they if like tweets done via API would be marked.
Maybe there's some special cases where, but and there should be, there should be a very easy way for agents to get their own Twitter account.
We need to rethink social platforms a little bit.
If we go towards a future where everyone has their agent and agents maybe have their own Instagram profiles or Twitter accounts or can like do stuff on my behalf, I think it should very clearly be marked that they are doing stuff on my behalf and it's not me.
Because content is now so cheap.
Eyeballs are the expensive part.
And I find it very triggering when I read something and then I'm like, oh, no, this smells like AI.
Yeah.
Like, where is this headed in terms of what we value about the human experience?
It feels like we will move more and more towards in-person interaction.
And we'll just communicate.
We'll talk to our AI agent to accomplish different tasks, to learn about different things, but we won't value online interaction because there'll be so much AI slop that smells and so many bots that it's difficult.
Well, if it's marked, then it should also be difficult to filter.
And then I can look at it if I want to.
But yeah, this is like a big thing we need to solve right now.
Especially on this project, I get so many emails that are, let's say, nicely, agentically written.
But I much rather read your broken English than your AI slop.
You know, of course, there's a human behind it.
And yeah, they prompt it.
I much rather read your prompt than what came out.
I think we're reaching a point where I value typos again.
Yeah.
Like, you know, I mean, it also took me a while to come to the realization.
On my blog, I experimented with creating a blog post with agents.
And Ultimately, it took me about the same time to steer agent towards something I like, but it missed the nuances that how I would write it.
You know, you can like you can steer it towards your style, but it's not going to be all your style.
So I completely moved away from that.
Everything I blog is organic, handwritten.
And maybe, maybe I use AI as a fix-my worst typos, but there's value in the rough parts of an actual human.
Isn't that awesome?
Isn't that beautiful?
That now, because of AI, we value the raw humanity in each of us more.
I also realized the thing that I rave about AI and use it so much for anything that's code, but I'm allergic if it's stories.
Right?
Yeah.
Also, documentation, still fine with AI, you know, better than nothing.
And for now, it's still applies in the visual medium too.
It's fascinating how allergic I am to even a little bit of AI slop in video and images.
It's useful.
It's nice if it's like a little component of like or even those images, like all these infographics and stuff, they trigger me so hard.
Like it immediately makes me think less of your content.
And they were novel for like one week, and now it just screams slop.
Yeah.
Even if people work hard on it, using, and I have some on my blog post, you know, in the time where I explored this new medium, but now they trigger me as well.
It's like, yeah, this is this just screams AI slop.
I don't know what that is, but I went through that too.
I was really excited by the diagrams.
And then I realized in order to remove from them hallucinations, you actually have to do a huge amount of work.
And you're just using it to draw the better diagrams.
Why AI Falls Short00:09:54
Great.
And then I'm proud of the diagram.
I've used them for literally kind of like you said for me, a couple of weeks.
And now I look at those and I feel like I feel when I look at Comic Sans as a font or something like this.
It's like, no, this is fake.
It's fraudulent.
There's something wrong with it.
It's a smell.
It's a smell.
And it's awesome because it reminds you that we know there's so much to humans that's amazing and we know that we know it.
We know it when we see it.
And so that gives me a lot of hope.
You know, that gives me a lot of hope about the human experience is not going to be damaged by it's only going to be empowered as tools by AI.
It's not going to be damage or limited or somehow altered to where it's no longer human.
So I need a bathroom break.
Quick pause.
You mentioned that a lot of the apps might be basically made obsolete.
You think agents will just transform the entire app market?
Yeah.
I noticed that on Discord, people just said how the like what they build and what they use it for.
It's like, why do you need my fitness pal when the agent already knows where I am?
So can assume that I make bad decisions when I'm at, I don't know, Baffle House, what's around here?
Or briskets in Austin.
There's no bad decisions around briskets, but yeah.
No, that's the best decision.
Your agent should know that.
But it can modify my gym workout based on how well I slept or if I have stress or not.
It has so much more context to make even better decisions than any of the step even could do.
It could show me UI just as I like.
Why do I still need an app to do that?
Why should I pay another subscription for something that the agent can just do now?
And why do I need my 8-sleep app to control my bed when I tell the agent to, no, the agent already knows where I am, so it can turn off what I don't use.
And I think that will translate into a whole category of apps that are no longer, I will just naturally stop using because my agent can just do it better.
I think you said somewhere that it might kill off 80% of apps.
Yeah.
Don't you think that's a gigantic transformative effect on just all software development?
That means it might kill off a lot of software companies.
Yeah.
It's a scary thing.
So like, do you think about the impact that has on the economy on just the ripple effects it has through society transforming who builds what tooling?
It empowers a lot of users to get stuff done, to get stuff more efficiently, to get it done cheaper.
There's also new services that we will need, right?
For example, I want my agent to have an allowance.
Like, you solve problems for me.
Here's like 100 bucks in order to solve problems for me.
And if I tell it to order me food, maybe it uses a service.
Maybe it uses something like rent a human to like just get that done for me.
I don't actually care.
I care about solve my problem.
There's space for new companies that solve that well.
Maybe not all apps disappear.
Maybe some transform into being API.
So basically apps that rapidly transform in being agent facing.
So there's a real opportunity for like Uber Eats that we just used earlier today.
It's companies like this, of which there's many.
Who gets there fastest to being able to interact with OpenClaw in a way that's the most natural, the easiest?
Yeah.
And also apps will become API if they want or not, because my agent can figure out how to use my phone.
I mean, on the upper side, it's a little more tricky.
On Android, that's already, people already do that.
And then it will just click the order Uber for me button for me.
Or maybe another service.
Or maybe there's an API it can call, so it's faster.
I think that's a space we're just beginning to even understand what that means.
And I, again, I didn't even, that was not something I thought of, something that I discovered as people use this.
I mean, we are still so early.
But yeah, I think data is very important, like apps that can give me data, but that also can be API.
Why do I need a Sonos app anymore when I can, when my agent can talk to the Sonos speakers directly?
Like my cameras, there's like a crappy app, but they have an API.
So my agent uses the API now.
So it's going to force a lot of companies to have to shift focus.
And it's kind of what the internet did, right?
You have to rapidly rethink, reconfigure what you're selling, how you're making money.
And some companies will really not like that.
For example, there's no CLI for Google.
So I had to like to have to do anything myself and build GOG that's like a CLI for Google.
And at the end user, they have to give me the emails because otherwise I cannot use their product.
If I'm a company and I try to get Google data, Gmail, there's a whole complicated process to the point where sometimes startups acquire startups that went through the process so they don't have to work with Google for half a year to be certified to being able to access Gmail.
But my agent can access Gmail because I can just connect to it.
It's still crappy because I need to go through Google's developer jungle to get a key.
And it's still annoying, but they cannot prevent me.
And worst case, my agent just clicks on the website and gets the data out that way.
Through browser use.
Yeah.
I mean, I watch my agent happily click the I'm not a robot button.
And there's this whole that's going to be that's going to be more heated.
You see companies like Cloudflare that try to prevent bot access.
And in some ways that's useful for scraping.
But in other ways, if I'm a personal user, I want that.
You know, sometimes I use Codex and I read an article about modern React patterns.
And it's like a Medium article.
I paste it in and the agent can't read it because they block it.
So I have to copy paste the actual text.
Or in the future, I learn that maybe I don't click on Medium because it's annoying and I use other websites that actually are agent-friendly.
There's going to be a lot of powerful, rich companies fighting back.
So it's really interesting.
You're at the center.
You're the catalyst, the leader, and happen to be at the center of this kind of revolution where it's going to completely change how we interact with services, with the web.
And so like there's companies like Google, they're going to push back.
I mean, there's every major company you could think of is going to push back.
Even yeah, even search.
I now use, I think, Perplexity or Brave as providers because Google really doesn't make it easy to use Google without Google.
I'm not sure if that's the right strategy, but I'm not Google.
Yeah, there's a nice balance from a big company perspective because if you push back too much for too long, you become blogbuster and you lose everything to the Netflixes of the world.
But some pushback is probably good during the revolution to see.
But you see that like this is something that the people want.
Right.
So if I'm on the go, I don't want to open a calendar app.
I just, I want to tell my agent, hey, remind me about this dinner tomorrow night and maybe invite two of my friends and then maybe send a WhatsApp message to my friend.
And I don't need, I don't want the need to open apps for that.
I think that we passed that age and now everything is like much more connected and fluid if those companies want it or not.
And I think the right companies will find ways to jump on the train and other companies will perish.
You got to listen to what the people want.
We talked about programming quite a bit, and a lot of folks that are developers are really worried about their jobs, about their future of programming.
Do you think AI replaces programmers completely, human programmers?
I mean, we're definitely going in that direction.
Painful Transition: Programmers Replaced?00:05:27
Programming is just a part of building products.
So maybe, maybe I does replace programmers eventually.
But there's so much more to that art.
Like, what do you actually want to build?
How should it feel?
How's the architecture?
I don't think agents will replace all of that.
Yeah, like just the actual art of programming, it will stay there, but it's going to be like knitting.
You know, like people do that because they like it, not because it makes any sense.
So, so I read this article this morning about someone that it's okay to mourn our craft.
And I can, a part of me very strongly resonates with that because in my past, I spent a lot of time thinking, just being really deep in the flow and just like cranking out code and like finding really beautiful solutions.
And yes, in a way, it's sad because that will go away.
And I also got a lot of joy out of just writing code and being really deep in my thoughts and forgetting time and space and just being in this beautiful state of flow.
But you can get the same state of flow.
I get a similar state of flow by working with agents and building and thinking really hard about problems.
It is different.
But and it's okay to mourn it, but it's not something we can fight.
Like there is the world for a long time had a there was a lack of intelligence, if you see it like that, of people building things.
And that's why salaries of software developers reached stupidly high amounts.
And that will go away.
There will still be a lot of demand for people that understand how to build things.
Just that all this tokenized intelligence enables people to do a lot more, a lot faster.
And it will be even faster and even more because those things are continuously improving.
We had similar things when, I mean, it's probably not a perfect analogy, but when we created the steam engine and they built all these factories and replaced a lot of manual labor, and then people revolted and broke the machines.
I can relate that if you very deeply identify that you are a programmer, that it's scary and that it's threatening because what you like and what you're really good at is now being done by a soulless or not entity.
But I don't think you're just a programmer.
That's a very limiting view of your craft.
You are still a builder.
Yeah, there's a couple of things I want to say.
So one is, I never, as you're articulating this beautifully, and I'm realizing I never thought I would the thing I love doing would be the thing that gets replaced.
You hear these stories about these, like you said, with the Steam Engine.
I've spent so many, I don't know, maybe thousands of hours pouring over code and putting my heart and soul.
And like, and just like some of my most painful and happiest moments were alone behind, I was an Emacs person for a long time, and Emacs.
And then there's an identity and there's meaning, and there's like when I walk about the world, I don't say it out loud, but I think of myself as a programmer.
And to have that in a matter of months, I mean, like you mentioned, April to November, it really is a leap that happened, a shift that's happening.
To have that completely replaced is painful, it's truly painful.
But I also think programmers, builders more broadly, but what is the act of programming?
I think programmers are generally best equipped at this moment in history to learn the language, to empathize with agents, to learn the language of agents, to feel the CLI.
Yeah.
Like, to understand what is the thing you need, you, the agent, need to do this task the best.
I think at some point it's just going to be called coding again, and it's just going to be the new normal.
Yeah.
And yet, while I don't write the code, I very much feel like I'm in the driver's seat and I am writing the code.
You know, it's just.
You'll still be a programmer.
It's just the activity of a programmer is different.
Watering Data Centers00:03:15
Yeah, and because on X, the bubble I'm in is mostly positive.
On Mastodon and Blue Sky, I don't, I also use it less because oftentimes I got attacked for my blog posts.
And I had stronger reactions in the past.
Now I can sympathize with those people more because in a way I get it.
In a way, I also don't get it because it's very unfair to grab onto the person that you see right now and unload all your fear and hate.
It's going to be a change.
It's going to be challenging, but it's also, I don't know, I find it incredibly fun and gratifying.
And I can use the new time to focus on much more details.
I think the level of expectations of what we build is also rising because it's just now the default is now so much easier.
So software is changing in many ways.
There's going to be a lot more.
And then you have all these people that are screaming, oh yeah, but what about the water?
You know, like I did a conference in Italy about the state of AI.
And my whole motivation was to push people away from don't see yourself as an iOS developer anymore.
You're now a builder.
And you can use your skills in many more ways.
Also, because apps are slowly going away.
People didn't like that.
Like a lot of people didn't like what I had to say.
And I don't think I was hyper-bowl.
I was just like, this is how I see the future.
Maybe this is not how it's going to be, but I'm pretty sure a version of that will happen.
And the first question I got was, yeah, but what about the insane water use on data centers?
But then you actually sit down and do the math.
And then for most people, if you just skip one burger per month, that compensates the CO2 output or like the water use in the equivalent of tokens.
I mean, the mass is tricky and it depends if you add pre-training, then maybe it's more than just one petty, but it's not off by a factor of 100, you know?
So they're like golf is still using way more water than all data centers together.
So are you also hating people that play golf?
Those people grab on anything that they think is bad about AI without seeing the potential things that might be good about AI.
And I'm not saying everything is good.
It's certainly going to be a very transformative technology for our society.
To steel man, the criticism in general, I do want to say in my experience with Silicon Valley, there's a bit of a bubble in the sense that there's a kind of excitement and an over focus about the positive that the technology can bring.
And which is great.
Technology's Dual Impact00:02:51
It's great to focus on not to not to be paralyzed by fear and fear-mongering and so on.
But there's also within that excitement and within everybody talking just to each other, there's a dismissal of the basic human experience across the United States and the Midwest, across the world, including the programmers we mentioned,
including all the people that are going to lose their jobs, including the immeasurable pain and suffering that happens at the short-term scale when there's change of any kind, especially large-scale transformative change that we're about to face if what we're talking about will materialize.
And so having a bit of that humility and an awareness about the tools you're building, they're going to cause pain.
They will long term, hopefully bring about a better world and even more opportunities and even more awesomeness.
But having that kind of like quiet moment often of respect for the pain that is going to be felt.
And so not enough of that is, I think, done.
So it's good to have a bit of that.
And then I also have to put against some of the emails I got where people told me they have a small business and they've been struggling and OpenClaw helped them automate a few of the tedious tasks from collecting invoices to answering customer emails that then freed them up and like caused them a bit more joy in their life.
Or some emails where they told me that OpenClaw helped a disabled daughter, that she's now empowered and feels she can do much more than before, which is amazing, right?
Because you could do that before as well.
The technology was there.
I didn't invent a whole new thing, but I made it a lot easier and more accessible.
And that did show people the possibilities that they previously wouldn't see.
And now they apply it for good.
Or like also the fact that, yes, I suggest the latest and best models, but you can totally run this on free models.
You can run this locally.
You can run this on Kimi or other models that are way more accessible price-wise and still have a very powerful system that might otherwise not be possible because other things like, I don't know, Anthropics co-work is logged in into their space.
So it's not all black and white.
I got a lot of emails that were heartwarming and amazing.
And I don't know, it just made me really happy.
Clawcorn Viennese Hope00:02:49
Yeah, there's a lot.
It has brought joy into a lot of people's lives, not just programmers, like a lot of people's lives.
It's beautiful to see.
What gives you hope about this whole thing we have going on?
A human civilization.
I mean, I inspired so many people.
That's like, there's this whole builder vibe again.
People are now using AI in a more playful way and are discovering what it can do and how it can help them in their life and creating new places that are just sprawling of creativity.
I don't know, like there's like ClawCorn in Vienna.
There's like 500 people and there's such a high percentage of people that want to present, which is to me really surprising because usually it's quite hard to find people that want to like talk about what they built.
And now it's there's an abundance.
So that gives me hope that we can figure shit out.
And it makes it accessible to basically everybody.
Yeah.
Just imagine all these people building, especially as you make it simpler and simpler, more secure.
It's like anybody who has ideas and can express those ideas in language can build.
That's crazy.
Yeah, that's ultimately power to the people.
And one of the beauty, the beautiful things that come out of AI.
Not just a slop generator.
Well, Mr. Claude Father, I just realized when I said that in the beginning, I violated two trademarks because there's also the godfather of getting sued by everybody.
You're a wonderful human being.
You've created something really special.
A special community, a special product, a special set of ideas, plus the entire humor, the good vibes, the inspiration of all these people building, the excitement to build.
So I'm truly grateful for everything you've been doing and for who you are and for sitting down to talk with me today.
Thank you, brother.
Thanks for giving me the chance to tell my story.
Thanks for listening to this conversation with Peter Steinberger.
To support this podcast, please check out our sponsors in the description, where you can also find links to contact me, ask questions, give feedback, and so on.
And now, let me leave you with some words from Voltaire.