YC Badge: 0x9F755062601B9bBA3354c090e11A13a508d7CA0e
- that was me swyx
- i support this
- Good article - the most use cases i see of pg_vector are typically “chat over their technical docs” - small corpus - doesn’t change often / can rebuild the index - no multi-tenancy avoids much of the issues with post-filtering
Chroma implements SPANN and SPFresh (to avoid the limitations of HNSW), pre-filtering, hybrid search, and has a 100% usage-based tier (many bills are around $1 per month).
Chroma is also apache 2.0 - fully open source.
- congrats to factory on the amazing product and release!
- chroma stores both
- 66 points
- Hey there
Chroma is fully OSS - embedded, single-node and distributed (data and control plane). afaik lance distributed is not OSS.
We do have plans to release the crate (enabling embedded chroma in rust) - but haven't gotten around to it yet. Hopefully soon!
> Do you see all providers converging on similar alpha i.e cheap object storage, nvme drives,ssd cache to solve this?
It's not only a new pattern in search workloads, but it's happening in streaming, KV, OLTP, OLAP, etc. Yea - it's the future.
- > Supabase/pgVector needs lots of resources when adding new rows to the index -> wish the resources scale up/down automatically. Instead of having to monitor and switch to the next plan.
Many ways potentially - but one way is Chroma makes all this pain go away.
We're also working on some ingestion tooling that will make it so you don't have to scale, manage or run those pipelines.
- very fair!
cloud has been in private beta for a year now.
we chose to not release it to the public until we were extremely confident in the system and its characteristics.
databases are a serious business. developers trust us with their mission critical data.
- no worries!
Chroma and Weaviate are both popular OSS projects, 21k and 14k stars respectively.
Weaviate has some features that Chroma does not (yet) - notably vector quantization. Weaviate is a single-node DB that can be run in cluster mode.
Chroma focuses on reliability and operational simplicity. The fully distributed architecture means there are no knobs to tune or config to set.
There's no right tool for everyone. Try both!
- Qdrant team is also smart!
The main reason we have seen people switch from Qdrant to Chroma is for operational simplicity and reliability. With Qdrant you have to size nodes, manage sharding, be oncall, etc - with Chroma there is literally 0 config or operational pain. Search, that just works!
- thank you!
- your customers might have heard of RAG (retrieval augmented generation) before. chroma powers the “R” in RAG. It enables language models to dynamically pull in information to help them answer questions and solve tasks.
- can you tell me more about your use case?
- if you want or need to optimize for speed, cost, scalability or accuracy.
dedicated solutions have more advanced search features enable more accurate results. search indexing is resource intensive and can contend for resources with postgres/redis. the cost and speed benefits are naturally more pronounced as data volume scales.
for example - chroma has built in regex+trigram search and copy-on-write forking of indexes. this feature combo is killer for the code-search use case.
- 93 points
- 13 points
- There is so much signal in job posts - excited to see this launch.
- how could i forget!
i claimed swyx heard it through me - which he did