kipukun
Joined 42 karma
- kipukun parentXGBoost is just faster on the GPU, regardless of the file format. A sibling post also pointed out compression helping out on bandwidth.
- The cuDF interop in the roadmap [1] will be huge for my workloads. XGBoost has the fastest inference time on GPUs, so a fast path straight from these Vortex files to GPU memory seems promising.
- For your wimsey library, using “pipe” to validate the contracts would seem to me to drastically slow down the Polars query because the UDF pushes the query out of Rust into Python. I think a cool direction would be to have a “compiler” which takes in a contract and spits out native queries for a variety of dataframe libraries (pandas/polars/pyspark). It becomes harder to define how to error with a test contract but that can be the secret sauce.
- 1. I use it on all of my servers.
2. FreeBSD offers a coherent base system with sane tools. pf is a godsend, and the ports collection is a good trade-off of configuration and simplicity. ZFS used to be a killer feature but is being integrated into Linux systems.
3. FreeBSD is designed around "general purpose" use and has a lot of backing from corporations like Netflix.
4. FreeBSD is missing a true container solution that is compatible with Docker (jails don't count). There has been some work on this as of the most recent BSDcon, so I'm hopeful.
5. I never want to look at iptables again.
EDIT: fix spacing