[BUG]: In query mode, return answer citations with mach rates ranging from 0% to 3%. #3076

kaanvur · 2025-01-31T14:09:43Z

How are you running AnythingLLM?

AnythingLLM desktop app

What happened?

I added two files.
I selected query mode in chat settings.
Document similarity threshold
medium is set in vector database settings.
But the model still returns answers to irrelevant questions.
As citations, it shows a few examples with mach rates ranging from 0% to 3%.

Are there known steps to reproduce?

timothycarambat · 2025-01-31T17:58:03Z

Turn off reranking when in query mode - that is the root cause of this (and also why it is not the default mode for a workspace)

kaanvur · 2025-02-01T19:51:51Z

Turn off reranking when in query mode - that is the root cause of this (and also why it is not the default mode for a workspace)

Thanks for your answer @timothycarambat but I couldn't find any instructions on where and how to do this setting. Can you help me?

timothycarambat · 2025-02-01T22:39:05Z

Of course, in the UI we call reranking "Search Preference" since re-ranking is very ambiguous to non-technical people. If you click on a workspace's settings (gear icon) and navigate to "Vector Database" you will find a field "search preference" which you can change from "Accuracy Optimized" back to "default"

https://docs.anythingllm.com/llm-not-using-my-docs#vector-database-settings--search-preference

kaanvur · 2025-02-02T09:10:35Z

Thank you for your answer @timothycarambat . When I read your explanations, it gives the opposite impression :) I made the changes. This time it brought up more relevant information. However, none of the quotes that it said were 65% relevant to my question "egg bread recipe" have anything to do with this topic. How can I improve this?

kaanvur · 2025-02-02T09:12:18Z

Would changing the Embedding model here provide better results?

timothycarambat · 2025-02-03T19:22:52Z

@kaanvur The default embedder is pretty bad at multi-lingual embedding (see #658). However we support using Ollama/LMStudio with any embedder you can find on HF, which goes a long way.

You would indeed get better responses with the non-default. The default we have is super small so that is why we use it. Some other models like Jina can be more comprehensive but also are 300MB

kaanvur · 2025-02-03T20:10:50Z

Thank you for all your answers and patience. I guess I need to try different combinations and see what the results are for the better results :) Thanks for your efforts!

kaanvur added the possible bug Bug was reported but is not confirmed or is unable to be replicated. label Jan 31, 2025

kaanvur changed the title ~~[BUG]:~~ [BUG]: In query mode, return answer citations with mach rates ranging from 0% to 3%. Jan 31, 2025

timothycarambat closed this as completed Jan 31, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG]: In query mode, return answer citations with mach rates ranging from 0% to 3%. #3076

[BUG]: In query mode, return answer citations with mach rates ranging from 0% to 3%. #3076

kaanvur commented Jan 31, 2025

timothycarambat commented Jan 31, 2025

kaanvur commented Feb 1, 2025

timothycarambat commented Feb 1, 2025

kaanvur commented Feb 2, 2025

kaanvur commented Feb 2, 2025

timothycarambat commented Feb 3, 2025

kaanvur commented Feb 3, 2025

[BUG]: In query mode, return answer citations with mach rates ranging from 0% to 3%. #3076

[BUG]: In query mode, return answer citations with mach rates ranging from 0% to 3%. #3076

Comments

kaanvur commented Jan 31, 2025

How are you running AnythingLLM?

What happened?

Are there known steps to reproduce?

timothycarambat commented Jan 31, 2025

kaanvur commented Feb 1, 2025

timothycarambat commented Feb 1, 2025

kaanvur commented Feb 2, 2025

kaanvur commented Feb 2, 2025

timothycarambat commented Feb 3, 2025

kaanvur commented Feb 3, 2025