Astro - Hacker News

19 comments

init0 30 minutes ago

I built a lib for myself https://pypi.org/project/piragi/
CuriouslyC an hour ago

Don't use a vector database for code, embeddings are slow and bad for code. Code likes bm25+trigram, that gets better results while keeping search responses snappy.
[-]
- itake 3 minutes ago
  
  With AI needing more access to documentation, WDYT about using RAG for documentation retrieval?
- lee1012 an hour ago
  
  static embedding models im finding quite fast lee101/gobed https://github.com/lee101/gobed is 1ms on gpu :) would need to be trained for code though the bigger code llm embeddings can be high quality too so its just yea about where is ideal on the pareto fronteir really , often yea though your right it tends to be bm25 or rg even for code but yea more complex solutions are kind of possible too if its really important the search is high quality
rahimnathwani 14 hours ago

If your data aren't too large, you can use faiss-cpu and pickle
https://pypi.org/project/faiss-cpu/
[-]
- notyourwork an hour ago
  
  For the uneducated, how large is too large? Curious.
  [-]
  - itake 2 minutes ago
    
    FAISS runs in RAM. If your dataset can't fit into ram, FAISS is not the right tool.
lee1012 an hour ago

lee101/gobed https://github.com/lee101/gobed static embedding models so they are embedded in milliseconds and on gpu search with a cagra style on gpu index with a few things for speed like int8 quantization on the embeddings and fused embedding and search in the same kernel as the embedding really is just a trained map of embeddings per token/averaging
jeanloolz 19 minutes ago

Sqlite-vec
eajr 14 hours ago

Local LibreChat which bundles a vector db for docs.
motakuk 13 hours ago

LightRAG, Archestra as a UI with LightRAG mcp
pdyc 33 minutes ago

sqlite's bm25
nineteen999 10 hours ago

A little BM25 can get you quite a way with an LLM.
jeffchuber an hour ago

try out chroma or better yet as opus to!
electroglyph an hour ago

simple lil setup with qdrant
whattheheckheck 14 hours ago

Anythingllm is promising
undergrowth 44 minutes ago

undergrowth.io
undergrowth 44 minutes ago

Undergrowth.io
ramesh31 10 hours ago

SQLite with FTS5