Agents in production fail unpredictably, and you only know when customers complain. We find root causes and help you fix what matters, turning issues into reliable agent behaviour.
Feel free to reach out: robert@moyai.ai https://moyai.ai
- "Because there’s too much you need to feed into it" - what does the author mean by this? If it is the amount of data, then I would say sampling needs to be implemented. If that's the extent of the information required from the agent builder, I agree that an LLM-as-a-judge e2e eval setup is necessary.
In general, a more generic eval setup is needed, with minimal requirements from AI engineers, if we want to move forward from Vibe's reliability engineering practices as a sector.
- Just another example of money scaling your way out of a problem. What you don't understand is hard to optimize. Like how you have solved this by acting as an smart router in between that first understands what to optimize and then actually implement that optimization.
- Moyai | Amsterdam, The Netherlands | FULL-TIME | ONSITE | Founding AI Engineer | 60k-80k + Equity
About the company:
• Technical founding team with relevant industry experience (part of Nvidia Inception Program)
• Backed by well-known European VCs (SpeedInvest + Galion.exe)
• Building infrastructure-less agentic evals and agent-as-a-judge monitoring
Ideal candidate:
• Demonstrated shipping ability with past projects & roles
• "Young and hungry” mindset, prioritising ability to learn with agency over experience
• Familiar with fine-tuning algorithms and frameworks, transformers/trl, ART, verl and unsloth
• Bonus points for experience in contributing to open-source projects, startups, AI agents, & similar technologies
Reach out to the founder directly: https://www.linkedin.com/in/rhommes/ or visit our website https://moyai.ai
- My personal opinion is that true engineering, which revolves around turning complex theory into working practice, has seen a decline in grace. Why spend a lot of time trying to master the art of engineering if you can ride the wave of engineering services and get away with it?
In true hacker spirit, I don't think trying to train a model on a wonky GPU is something that needs an ROI for the individual engineer. It's something they do because they yearn to acquire knowledge.
- Moyai | Amsterdam, The Netherlands | FULL-TIME | ONSITE | Founding AI Engineer | 60k-80k + Equity
About the company:
• Technical founding team with relevant industry experience (part of Nvidia Inception Program)
• Backed by well-known European VCs (SpeedInvest + Galion.exe)
• Building agentic advanced analytics and fine-tuning analytical reasoning models
Ideal candidate:
• Demonstrated shipping ability with past projects & roles
• "Young and hungry” mindset, prioritising ability to learn with agency over experience
• Familiar with fine-tuning algorithms and frameworks, transformers/trl, ART, verl and unsloth
• Bonus points for experience in contributing to open-source projects, startups, AI agents, & similar technologies
Reach out to the founder directly: https://www.linkedin.com/in/rhommes/
- Awesome to see the Cursor Airweave example!
- Sad to see the "change world GDP" mantra didn't trickle down to the people doing the actual plumbing.
- Nice great learnings, Storybook FTW
- Moyai | Amsterdam, The Netherlands | Full Time | Onsite | Founding AI Engineer | 60k-80k + Equity
About the company:
• Technical founding team with relevant industry experience (part of Nvidia Inception Program)
• Backed by well-known European VCs (SpeedInvest+Galion.exe)
• Building agentic advanced analytics and fine-tuning analytical reasoning models
Ideal candidate:
• Demonstrated shipping ability with past projects & roles
• "Young and hungry” mindset, prioritising ability to learn with agency over experience
• Familiar with fine-tuning algorithms and frameworks, transformers/trl, ART, verl and unsloth
• Bonus points for experience in contributing to open-source projects, startups, AI agents, & similar technologies
Reach out to the founder directly: https://www.linkedin.com/in/rhommes/
- Only thing that is unclear to me is to which extend this setup depends on the package publisher. PyPi might be terrible at least it just works when you want to publish that it leads to more complexity for the ones that are looking to use this piece of free software is not for the maintainer.
Maybe they are only targeting dev tooling companies as a way to simplify how they distribute. Especially in the accelerated compute era.
- Founding AI Engineer at AI Tech Startup (Amsterdam, The Netherlands)
About the company:
- Technical founding team with relevant industry experience - Backed by well-known European VCs - Building agentic advanced analytics and fine-tuning analytical reasoning models
Ideal candidate:
- Demonstrated shipping ability with past projects & roles - “Young and hungry” mindset, prioritizing ability to learn with agency over experience - Familiar with fine-tuning algorithms and frameworks, transformers/trl, verl and unsloth - Bonus points for experience in contributing to open-source projects, startups, AI agents, & similar technologies
Help us push the boundaries of what's possible with analytical reasoning AI. Reach out to the founder directly: https://www.linkedin.com/in/rhommes/
- Typical dark pattern with your mystery boxes and player skins. Claude code just makes it easier to tell yourself that its ok to play.
- Here I feel that having the right tools is going to prevent this kind of drift from happening.
Double entry bookkeeping to make sure the mutations are sound and solving the floating point error by this simple trick: https://www.hackerneue.com/item?id=21687430.
All major fintech's use these two tricks to prevent them from fundamentally screwing up. Believe me I worked at a really big one ;)
- Sandboxing is all I have to say. Its good to build guardrails to ask the LLM politely to not screw up. Its better to put firewalls in place that simply prevent it from happening.
- Underlying mechanism here might be preventing model collapse
- Love the old school microsoft interface. Feels familiar sight when my system is failing.
- Congrats!
About the company:
• Technical founding team with relevant industry experience
• Backed by well-known European VCs (SpeedInvest + Galion.exe)
• Cloud native & freedom to shape our tech stack (TypeScript + Python)
Ideal candidate:
• Previous experience at start-up building tech at scale
• Thinks in terms of product functionality and customer demands not just features
• Familiar with API first practices and frameworks
• Bonus points if you are an ex-founder or have been first hire before
Moyai is an AI-powered agent monitoring tool for AI engineers looking to catch agent failures in production. Reach out to the founder directly: https://www.linkedin.com/in/rhommes/ or visit our website https://moyai.ai
*No agencies or recruiters, and we are unable to provide visa sponsorship