How to not weigh part of a vector? #148

jurgen-reconcept · 2025-10-01T06:57:42Z

jurgen-reconcept
Oct 1, 2025

I got here through the recent livestream, where @svonava explained how you concatenate multiple embeddings to boost several criteria. I have a few questions.

For context, I'm making a RAG allowing users to ask questions about various relatively large documents (such as medical guidelines and manuals). For a proof of concept, I want to boost results when the user specifically mentions a document (e.g. "What does the dementia guideline say about..."). To achieve this, I ask an LLM to process the question and return a document if mentioned (in this case "dementia guideline") or nothing. I have also created my vectors so that they're a concatenation of the document's original title (e.g. "Guideline on Dementia") and the document's contents.

issue
The issue I'm running into is the following: this proof of concept works well if an actual document is referenced, or in other words if my search query contains a vector for the title part. However, if this part is empty (embedding of an empty string), I notice the documents retrieved are strongly biased to certain sources. My suspicion is that the 'empty' query-embedding is not actually neutral, but in fact is 'closer' to some results than others. How do you go about 'disabling' a certain property in your queries?
number of dimensions
Another more general question: if you keep adding properties using your method of concatenation, the number of dimensions keeps adding up, right? My embeddings are 1024d, so in my proof of concept with 2 properties I'm already at 2048d. If you keep adding properties, what does that do to performance? What's the limit?

Practically: I'm using a default setup of Chromadb (squared L2). For embedding I'm using the Mistral embedder, which produces 1024-dimension results. (I'm not using Superlinked as of yet).

morkapronczay · 2025-10-01T19:17:57Z

morkapronczay
Oct 1, 2025

I think the main issue is your distance function - text embeddings are generally trained with cosine similarity, so that or inner product should work as expected and show meaningful semantic similarities. That I believe also explains why your solution works well with exact document matches and produces false similarities for cases without them. When you use cosine or inner product, for 1. the easiest solution is having a 0 vector for the query embedding part corresponding to the document.
About 2, I can't tell you about an exact limit, but most VDBs KNN implementations i have seen scale really well with dimensions (highly sublinearly), and we are using indices with multiple thousand dimensions easily - close to 10k even.

0 replies

jurgen-reconcept · 2025-10-02T22:32:28Z

jurgen-reconcept
Oct 2, 2025
Author

That seemed to do the trick!

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to not weigh part of a vector? #148

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

How to not weigh part of a vector? #148

Uh oh!

jurgen-reconcept Oct 1, 2025

Replies: 2 comments

Uh oh!

morkapronczay Oct 1, 2025

Uh oh!

jurgen-reconcept Oct 2, 2025 Author

jurgen-reconcept
Oct 1, 2025

morkapronczay
Oct 1, 2025

jurgen-reconcept
Oct 2, 2025
Author