Comment by Forgeties79

Forgeties79 Dec 30, 2025 parent

Have you tried using it? Not being flippant and annoying. Just curious if you tried it and what the results were

Game_Ender Dec 30, 2025

Why should he put effort into measuring a tool that the author has not? The point is there are so many of these tools an objective measure that the creators of these tools can compare against each other would be better.

So a better question to ask is - Do you have any ideas for an objective way to a measure a performance of agentic coding tools? So we can truly determine what improves performance or not.

I would hope that internal to OpenAI and Anthropic they use something similar to the harness/test cases they use for training their full models to determine if changes to claude code result in better performance.

morkalork Dec 30, 2025

Well, if I were Microsoft and training co-pilot, I would log all the <restore checkpoint> user actions and grade the agents on that. At scale across all users, "resets per agent command" should be useful. But then again, publishing the true numbers might be embarrassing..

kuboble Dec 30, 2025

I'm not sure it's a good signal.

I often use restore conversion checkpoint after successfully completing a side quest.

gbnwl Dec 30, 2025

Who has time to try this when there's this huge backlog here: https://www.reddit.com/r/ClaudeAI/search/?q=memory

Forgeties79 OP Dec 30, 2025

Have you tried any of those?

gbnwl Dec 30, 2025

Yes, they haven't helped. Have you found one that works for you?

austinbaggio Dec 30, 2025

What are you both looking for? What is the problem you want solved?

ggm Dec 30, 2025

Is a series of postings all in the form of questions an indication somebody hooked "eliza" up as an input device?

morkalork Dec 30, 2025

Nah, just another one of those spam bots on all the small-business, finance and tradies sub-reddits: "Hey fellow users, have you ever suffered from <use case>? What is the problem you want solved? Tell me your honest opinions below!"

troupo Dec 30, 2025

It does nothing but send a bunch of data to a "alpha use at your own risk" third-party site that may or may not run some LLM on your data: https://ensue-network.ai/login

This item has no comments currently.

Preferences

Keyboard Shortcuts

Story Lists

Navigation

Miscellaneous