1. One of the reasons we created Khoj was being able to do natural language sear...

darkteflon · on July 8, 2023

Thank you! I’d love to hear more about your experiences with:

1. content / question vector mismatch

2. what types of embedding you experimented with storing per-chunk (text only? Hypothetical question? Metadata?)

3. choice of embeddings model (eg OpenAI vs instructorEmbeddings or an alternative from the MTEB leaderboard)

It’s a great project, going to have a deeper dig today.