whinvik parent
Since we are talking about an embedded solution shouldn't the benchmark be something like sqlite with a vector extension or lancedb?
My natural point of comparison without actually be DuckDB plus their vector search extension.
I mention sqlite + sqlite-vec at the end, noting it requires technical overhead and it's not as easy as read_parquet() and write_parquet().
I just became aware of lancedb and am looking into that, although from glancing at the README it has similar issues to faiss with regards to usability for casual use, although much better than faiss in that it can work with colocated metadata.