Preferences

ArcHound
Joined 492 karma
Hello, I write my blog at blog.miloslavhomer.cz

  1. No. The factory must grow.
  2. Can we please wait till at least Q2 with another historic event? I'm getting tired.
  3. Gotcha. Yes, with just a VPS you have to do a lot of busywork to get online - DNS, reverse proxy, docker, dev environment, DB setup and others.

    I'd still recommend starting with SQLite, seems that by skipping a DB service you can save quite a few bucks.

  4. Thanks for the reply, you are right, I missed the threshold on my first read. While I am still sad I can see the reasons for it. Guess I have some posting to do.
  5. Let me describe my setup, so that you can compare. I use a Contabo VPS for around 5 USD month to host my Wagtail (django-based) site. The DB also runs on the same infra and since it's SQLite I can back it up externally.

    I probably wouldn't be able to handle 0.5M requests, but I am nowhere near getting them. If I start approaching such numbers I'll consider an upgrade.

    Check out Wagtail if you'd like to have even more batteries included for your site, it was a delight building my site with it:

    https://blog.miloslavhomer.cz/hello-wagtail/

  6. Thank you for the reply, I'll go and make a PR.
  7. Honestly, it makes me a bit sad I am not anywhere on the list at all. Yes, I had only one front page mention ever, the rest of my entries are probably bad and useless, but still.

    I don't see how and why I wouldn't fall into the dataset, does anybody know please?

  8. Hello HN!

    Here's my writeup for the years 2025, 2020 and 2015 of the Advent of Code. I am also slowly building a toolset and libraries in python to tackle these problems effectively.

    I also have all of the solutions and the library available at GitHub: https://github.com/ArcHound/advent_of_code

    Hope you'll have a great start of 2026!

  9. Thank you for the reference, it's a fascinating read.

    It would be good to highlight that this is fiction, though.

  10. Yes, but there's a hidden benefit taken for granted: machines do not make human errors.

    Sadly, machines not needing human treatment might be reason enough.

  11. There is, but it's hard to obtain: curate, identify and fix the biases in our current texts.

    I am fully aware it's ridiculously expensive to do so.

  12. I can agree with you. And in a discussion with adults working together to address our issues I will.

    The issue is that we don't have exact proof that AI is suitable for tasks and the people doing those are already laid off.

    The economy now is propped up only by the belief that AI will be so successful that it will eliminate most of the workforce. I just don't see how this ends well.

    Remember, regulations are written in blood. And I think we're about to write many brand new regulations.

  13. To me the key point was:

    > One way of looking at this is that we rediscovered that bureaucracy matters. Although some might chafe against procedures and checklists, they exist for a reason: providing a kind of institutional memory that helps employees avoid common screwups at work.

    That's why we want machines in our systems - to eliminate human errors. That's why we implement strict verifiable processes - to minimize the risk of human errors when we need humans in the loop.

    Having a machine making human errors is the exact opposite of what we want. How would we even fix this if the machines are trained on human input?

  14. Consider this situation: security review before a project go-live.

    I have never seen this team before and I'll "never" see this team after the fact. They might be contracted externally, they might leave before the second review.

    Let's say I can sus out people doing this. I don't have the option of giving them the benefit of the doubt and they have the motivation to trick me.

    I guess I've answered my own question a bit, such an environment isn't built to foster trust at all.

  15. > If you know in your heart of hearts that you didn’t put the work in, you’re undermining the social contract between you and your reader.

    There's been a lot of social contract undermining lately. Does anyone please know about something that can be done to try and revert back? Social contract of "F you. I got mine" isn't very appealing to me, but that seems to be the current approach.

  16. I'd say it's a good PoC.

    They want to have many users. So they are ok with using OCR for many users. And since they are sending the accessed content through their APIs, might as well send a copy of it to training.

    In conclusion, it seems that mass OCR usage is within the scope of the AI companies.

  17. AFAIK at least the comet browser uses OCR, so I worry that the "OCR not feasible" argument is sadly wrong.
  18. Disagree on the method:

    I recall that bot farms use pre-paid SIM cards for their data connections so that their traffic comes from a good residential ASN.

    No client compromise required, it's a networking abuse that gives you good reputation of you use mobile data.

    But yes, selling botnets made of compromised devices is also a thing.

  19. Seems like you're cooking up a solid bot detection solution. I'd recommend adding JA3/JA4+ into the mix, I had good results against dumb scrapers.

    Also, have you considered Captchas for first contact/rate-limit?

    If you have smart scrapers, then good luck. I recall that bot farms use pre-paid SIM cards for their data connections so that their traffic comes from a good residential ASN. They also have a lot of IPs and overall well-made headless browsers with JS support. Then it's a battle of JS quirks where the official implementation differs from headless one.

  20. I agree. No use in breaking yourself over a "top business priority" that'll change next week.

    Maybe I'd dispute the last point - seems companies with such employees can do rather well.

  21. As with all issues of power abuse, the real question is: "what are you going to do about it?"

    If the answer from the workers is an overwhelming "nothing", then there's no reason to change.

    And I am not blaming workers. Bills need to be paid, mouths need to be fed. Staying low and taking it might be better than speaking up and risking homelessness.

    Please tell me how I am wrong, I struggle to see how the situation could improve.

  22. Thanks. I just saw at BBC that it was "for a very good reason". I just thought that I'm missing some context. I guess all that's left to say is to wish you a great day.
  23. Hello, I am from overseas. Can someone please explain to me why would they do that? What is the goal, what is the plan, what is the intent? Thanks for any comments, I am utterly confused.
  24. Hello, I am from overseas. Can someone please explain to me why would they do that? What is the goal, what is the plan, what is the intent? Thanks for any comments, I am utterly confused.
  25. I can imagine, that instead of a text select, they instead take a screenshot.

    Round trip through recall and OCR, here's your "text" or image for pasting.

    Sounds dumb. I know.

  26. Advent is coming! Post your toolkits for AoC.

    I really enjoyed not having to write/debug 2d map features from scratch every other day.

    Blog: https://blog.miloslavhomer.cz

    Repo: https://github.com/ArcHound/advent_of_code

  27. And to think that a simple API key almost completely solves this problem.

    And yes, I've double checked that my instance uses wire guard.

    We'll never learn, will we?

  28. Hello, security here - I'd highly recommend setting a captcha for registrations. Unless the true intent of the site is to capture any and all hackers that are searching for new plausibly deniable channel for their attacks.

This user hasn’t submitted anything.

Keyboard Shortcuts

Story Lists

j
Next story
k
Previous story
Shift+j
Last story
Shift+k
First story
o Enter
Go to story URL
c
Go to comments
u
Go to author

Navigation

Shift+t
Go to top stories
Shift+n
Go to new stories
Shift+b
Go to best stories
Shift+a
Go to Ask HN
Shift+s
Go to Show HN

Miscellaneous

?
Show this modal