150
points
suchintan
Joined 594 karma
Founder of Skyvern - YC S23
We help companies automate workflows in the web using our open source browser automation tool
Check us out here: https://github.com/Skyvern-AI/Skyvern
- suchintanThat makes a lot of sense. Sometimes it's easier to leave the baggage behind. It's too bad..selenium is a masterpiece. Thanks for sharing it with the world
- This makes sense. I guess I wanted to understand why starting from scratch was better than "fixing" selenium, but perhaps "fixing" selenium isn't an option?
- This is very cool. We were thinking about doing something very similar with Skyvern
What was the reason you went down this path instead of extending selenium with AI features?
- You're right, but this is where the LLMs are especially useful. Our customers all prompt it to terminate if it doesn't have the right information / the pre submission confirmation doesn't match
- What are some of the risks? This is a public web form available on the IRS website
- Send me an email suchintan@skyvern.com - we can get you started
- No, we don't have a lot of usage in that direction. People mainly use us to log into websites and either fill out forms or download files!
- Unrelated, but thoughtful gave us some very very helpful feedback early in our journey. We are big fans!
- That's the dream
- Definitely. What are your thoughts on the CloudFlare agent identity
- It's funny, one time we had a customer that wanted to use us to test their website for bugs..
Skyvern kept suggesting improvements unrelated to the issue they were testing for
- We do have them! We are HIPAA compliant, have soc-2 type 2 and offer self hosted deployments
- This is really cool. We might integrate this into Skyvern actually - we've been looking for a faster HTML extraction engine
Thanks for sharing!
- I have a 2yo and it's been surreal watching her learn the world. It deeply resembles how LLMs learn and think. Crazy
- Yeah, reverse engineering APIs is another fantastic approach. They aren't enough if you are dealing with wizards (eg typeform), but they can work really well
- IF you can use crawlers, definitely do.
They aren't enough for anything that's login-protected, or requires interacting with wizards (eg JS, downloading files, etc)
- I think they're complementary, and that's the direction we're headed.
We can ask the vision based models to output why they are doing what they are doing, and fallback to code-based approaches for subsequent runs
- We can do it remotely with Skyvern if you're interested