Currently, I’m exploring foundational models, and cognitive architectures at Julep. Not just for what they can do, but what they can teach us about our own mysterious minds.
https://julep.ai https://diwank.space
Feel free to write to me at hi@diwank.space
- 5 points
- 125 points
- 3 points
- 2 points
- 1 point
- 2 points
- 3 points
- 2 points
- 6 points
- 4 points
- 2 points
- 55 points
- 3 points
- Google's response:
"Read our statement on today’s decision in the case involving Google Search."
https://blog.google/outreach-initiatives/public-policy/doj-s...
- 3 points
- 2 points
- Agreed. The fact that it has any structure at all is fascinating (and super pretty). Could signal at interesting internal structures. I would love to see a version for Qwen-3 and Mistral too!
I wonder if being trained on significant amounts of synthetic data gave it any unique characteristics.
- also ettin is a new favorite and a solid alternative: https://huggingface.co/jhu-clsp/ettin-encoder-1b
I'd encourage you to give setfit a try, along with aggressively deduplicating your training set, finding top ~2500 clusters per label, and using setfit to train multilabel classifier on that.
Either way- would love to know what worked for you! :)
- It's coming soon! I think this experiment has really taught me a lot about the limits of agentic code assistants, stuff that they're good at, they're insanely good at, and stuff that they're horrible at and cannot seem to overcome. I did write a little bit about how I use Claude Code [1] before I started this project a while back, and I'm planning to finish a sequel pretty soon.
^[1]: https://diwank.space/field-notes-from-shipping-real-code-wit...
- yup. I started a fully autonomous, 100% vibe coded side project called steadytext, mostly expecting it to hit a wall, with LLMs eventually struggling to maintain or fix any non-trivial bug in it. turns out I was wrong, not only has claude opus been able to write up a pretty complex 7k LoC project with a python library, a CLI, _and_ a postgres extension. It actively maintains it and is able to fix filed issues and feature requests entirely on its own. It is completely vibe coded, I have never even looked at 90% of the code in that repo. it has full test coverage, passes CI, and we use it in production!
granted- it needs careful planning for CLAUDE.md and all issues and feature requests need a lot of in-depth specifics but it all works. so I am not 100% convinced by this piece. I'd say it's def not easy to get coding agents to be able to manage and write software effectively and specially hard to do so in existing projects but my experience has been across that entire spectrum. I have been sorely disappointed in coding agents and even abandoned a bunch or projects and dozens of pull requests but I have also seen them work.
you can check out that project here: https://github.com/julep-ai/steadytext/
- 1 point
- 2 points
- 2 points
- 2 points
https://github.blog/changelog/2025-12-18-github-copilot-now-...