Preferences

dalf
Joined 243 karma
alex.andre [at] al-f.net

  1. This blog post was really helpful to understand the ecosystem: https://alpopkes.com/posts/python/packaging_tools/

    It mentions uv at the end and rye at first (which use uv internally).

  2. I remember the kettle took forever at ~ 9000 ft (near Huanglong, Sichuan Province).
  3. You might not have a fork if you dine with philosophers

    ( https://en.wikipedia.org/wiki/Dining_philosophers_problem )

  4. Disclaimer: this is an *hypothesis* I made, all this might be wrong:

    * A social graph is built from various sources <-- this is where I wonder what the sources are, what is extracted?

    * If I'm connected to Bob, I will see content related to his interests : It appears that the system tries to pick video channels related to me, but the video selection is skewed towards Bob's topics. It seems that Bob's topic embeddings are mapped to the text embeddings of video thumbnails (same for the video titles but with less weight). Since the context is small, sometimes it's off.

    * If you refresh the page, it may return to your original recommendations, unless new data from external sources is fed into the algorithm. What's scare me: I wonder if the feed is in near real time. If this is the case, that's explain why my recommendation are as usual and sometimes completely off track for me but related to some people I know.

    * It might buffer some topics : if you connect again with Bob but there are no new topics, it gives back the old topic related to Bob.

    Once again, this is a pill of a lot of gueses.

    ---

    I really wish to go back to the previous algorithm which one of the reason I subscribed to YT premium, now it's just a minefield.

  5. similar post 4 months ago:

    Synology Lost the Plot with Hard Drive Locking Move (servethehome.com) https://www.hackerneue.com/item?id=43734706

  6. Same experience here. It used to give me spot-on recommendations based on my watch history-often out of nowhere, but still accurate. That was one of the reasons I subscribed to YouTube Premium.

    Now, I strongly feel it's based on a wrongly inferred social graph. It recommends videos according to what's happen in the group. It's really unsettling.

    I wish there were a setting to disable this "feature." and to stick on my history.

  7. This partly explains why the recommendations I receive don't feel like mine. Multiple times, it's been obvious that the suggestions were pulled from other profiles and I could even tell whose.

    My hypothesis

    * The algorithms have linked my account to some others.

    * They then serve me the embeddings extracted from those profiles. The near-real-time nature of this has crossed my mind more than once.

    It's really unsettling, and afterwards I feel uneasy about any recommendations (all Google services, Netflix seems problematic too, not Amazon).

    YouTube seems to have some hidden knobs for tuning this behaviour: after multiple negative feedbacks, the problematic content disappeared from my front page. However, the recommendations on the right-hand side of individual videos remain problematic, and the automatic playlists of YouTube Music are still strangely disturbing (even after multiple negative feedbacks).

  8. I used to love YouTube's algorithm: suggestions outside of my subscriptions but exactly on the spot. I used to took time to curate.

    However since few months, all recommendations became off track. Then I started to realize these recommendations were related to the life of people I know. At least most of them, some subjects are private and personal, so this is really unsettling and I can't say if the algorithm is halucinating or not (I want to know if people tells me, not like this...). I have the impression that this is a Plato's cave illuminate with the private lives of those around me.

    Technically, my understand is this:

    * YT suggests channels closed to my interrests

    * YT picks videos with thumbnails very closed to this "life of others" feed = the thumbnail leaks more than the video title.

    I ended up to send a feedback to YT, no response, but most of these recommendations disapeared, and then it went back here and there, so I don't use the recommendation page anymore. YT music has more or less the same issue with automatic playlist.

    The analytical part of me wonders why YT subscribes me to these "feeds", how these "feeds" are fed (maybe it goes beyond Alphabet / Google), but that's strange and disturbing enough.

    Note: perhaps the fact I use ublock, blokadda, Firefox, etc... most probably makes my profile focussed on technical stuff, which makes other recommendations more visible.

  9. I discovered that my institution blocks all traffic to Russia (v8.js-dos.com is hosted in Russia).
  10. A few years ago, I remember someone conducted a study on the quality of SearX(NG) results using different Internet providers: mobile, fiber, and VPN.

    I'm not sure if this person is still active on HN, but I'm really curious about the results.

  11. Disclaimer: I am one of the maintainers.

    The intent of SearXNG is to be stateless (with no sessions on the server) and to work without JavaScript.

    However, this approach limits certain features because of the restricted size of cookies (and other forms of browser storage require JavaScript).

  12. Podcast is one of two the reasons I've unsubscribed Spotify (as you describe, the UX is terrible).

    The other one, I was not able to "teach" the algorithm what I like even after 3 months.

  13. I've turned off web/app activities and don't use Google web search (directly). However, I keep YouTube and maps tracking (from time to time).

    YouTube because this is one of the algorithms that provide good suggestions with minor work on my side.

    Position because I can go back in time when I travel (I try to turn it off otherwise).

  14. Previous post from 2019: https://www.hackerneue.com/item?id=18895833

    (there are other posts with one or very few comments)

  15. This might be relevant here: there is a Rust implementation of this algorithm by BurntSushi [1], and discussion [2] about the performances and Hyperscan [3] in the case of Suricata [4]. HyperScan being regular expression matching library using AMD64 SIMD instructions by Intel. HyperScan is in low maintenance mode, but there is a maintained fork which is compatible with more architectures: Vectorscan [5]

    [1] https://github.com/BurntSushi/aho-corasick/

    [2] https://github.com/BurntSushi/aho-corasick/discussions/136

    [3] https://github.com/intel/hyperscan

    [4] https://github.com/OISF/suricata

    [5] https://github.com/VectorCamp/vectorscan

  16. I was avoiding EMM386 as much as I could on my slow 386SX25: there was a huge performance hit.

    There was the Unreal Mode aka Flat mode. With few lines of assembly code, an memory allocation using HIMEM.SYS, it was possible to access all the memory: the segment limit is 4GB instead of 64KB in this "mode". Of course it was not compatible with Windows, so the unreal mode was more for fun than anything else.

    https://en.wikipedia.org/wiki/Unreal_mode

  17. Seafile (a file sync storage) is inspired by git to store the files (internally there are repositories, branches and commits). However the file are not stored directly:

    > A file is further divided into blocks with variable lengths. We use Content Defined Chunking algorithm to divide file into blocks.

    > This mechanism makes it possible to deduplicate data between different versions of frequently updated files, improving storage efficiency. It also enables transferring data to/from multiple servers in parallel.

    I use it on old PC without issue. Drawback: since the files are not stored in clear, in case of data corruption of the Seafile repositories, I need backup (never happened to me).

    * https://manual.seafile.com/develop/data_model/

    * https://pdos.csail.mit.edu/papers/lbfs:sosp01/lbfs.pdf

  18. For now the suggested communication protocol is:

    * os.pipe and serialization (pickle or whatever): https://peps.python.org/pep-0554/#synchronize-using-an-os-pi...

    * immortal object, but I don't see a way to create immortal object from Python (only from C). https://engineering.fb.com/2023/08/15/developer-tools/immort...

    I guess it will more iteration to get a better way to communicate between the interpreters.

  19. There is also typing.ParamSpec when the purpose is to write a generic wrapper:

    https://docs.python.org/3/library/typing.html#typing.ParamSp...

  20. wikidata [1] [2] has "few" species with links to many other databases like

    * plazi.org list taxonomic treatment (~ species description) found in journals, papers using OCR when needed.

    * gbif.org list specimens (and other things) using normalized datasets provided by various institutions (including Plazi).

    one process among many others : some algorithms run by GBIF find potential matches between species and specimens, with some human curation, we can link between a specimen and the related papers.

    [1] https://www.wikidata.org/wiki/Q309337 [2] https://www.wikidata.org/wiki/Q106254624

  21. I'm used to website ( often old.reddit.com because it is more readable ).

    Last year, when I tried the app, it asked if I was happy about it very often. It becomes annoying to skip this question. I tried to answer "No": I felt my answer got lost in a confusing procedure. I never tried to choose "Yes", but I guess it would have led me to the notation on Google Play?

  22. mypyc does that: https://github.com/mypyc/mypyc

    > Mypyc compiles Python modules to C extensions. It uses standard Python type hints to generate fast code. Mypyc uses mypy to perform type checking and type inference.

    > Mypyc can compile anything from one module to an entire codebase. The mypy project has been using mypyc to compile mypy since 2019, giving it a 4x performance boost over regular Python.

    I have not experience a 4x boost, rather between 1.5x and 2x. I guess it depends on the code.

This user hasn’t submitted anything.

Keyboard Shortcuts

Story Lists

j
Next story
k
Previous story
Shift+j
Last story
Shift+k
First story
o Enter
Go to story URL
c
Go to comments
u
Go to author

Navigation

Shift+t
Go to top stories
Shift+n
Go to new stories
Shift+b
Go to best stories
Shift+a
Go to Ask HN
Shift+s
Go to Show HN

Miscellaneous

?
Show this modal