I must have been a little too ambitious with my first test with Claude Code. I a...

jpc0 · 2025-03-09T16:58:08 1741539488

Really yoy nearly got the correct approach there.

I generally follow the same approach these days, ask it to develop a plan then execute but importantly have it excute each step in as small increments as possible and do a proper code review for each step. Ask if for changes you want it to make.

There is certainly times I need to do it myself but definitely this has improved some level of productivity for me.

It's just pretty tedious so I generally write a lot of "fun" code myself, and almost always do the POC myself then have the AI do the "boring" stuff that I know how to do but really don't want to do.

Same with docs, the modern reasoning models are very good at docs and when guided to a decent style can really produce good copy. Honestly R1/4o are the first AI I would actually concider pulling into my workflow since they make less mistakes and actually help more than they harm. They still need to be babysit though as you noticed with Claude.

UncleEntity · 2025-03-09T16:34:20 1741538060

> ...do all the changes myself little by little, checking that the tests keep succeeding as I go along.

Or... you can do that with the robots instead?

I tried that with the last generation of Claude, only adding new functionality when the previously added functionality was complete, and it did a very good job. Well, Claude for writing the code and Deepseek-R1 for debugging.

Then I tried a more involved project with apparently too many moving parts for the stupid robots to keep track of and they failed miserably. Mostly Claude failed since that's where the code was being produced, can't really say if Deepseek would've fared any better because the usage limits didn't let me experiment as much.

Now that I have an idea of their limitations and had them successfully shave a couple yaks I feel pretty confident to get them working on a project which I've been wanting to do for a while.

darkerside · 2025-03-09T12:05:16 1741521916

I'm curious for the follow up post from Yegge, because this post is worthless without one. Great, Claude Code seems to be churning out bug fixes. Let's see if it actually passes tests, deploys, and works as expected in production for a few days if not weeks before we celebrate.

pchristensen · 2025-03-09T15:15:32 1741533332

He posts a few times a year at https://sourcegraph.com/blog

elcomet · 2025-03-09T16:25:45 1741537545

I'm wondering if you can prompt it to work like this - make minimal changes, and run the tests at each step to make sure the code is still working

espdev · 2025-03-10T02:05:15 1741572315

This thing can "fix" tests, not code. It just adjusts tests to incorrect code. So you need to keep an eye on the test code as well. That sounds crazy, of course. You have to constantly keep in mind that LLM doesn't understand what it is doing.

biorach · 2025-03-10T11:21:38 1741605698

git commit after each change it makes. It will eventually get itself into a mess. Revert to the last good state and tell it to try a different approach. Squash your commits at the end