(DOCSP-50370): Create new LangChain self-query retrieval notebook #21

davidhou17 · 2025-05-28T20:56:40Z

dacharyc

Everything works, but I've got a couple of nits with re-declaring stuff we've already declared, and some of the filter results. Non-blocking comments below!

dacharyc · 2025-06-27T14:58:16Z

ai-integrations/langchain-self-query-retrieval.ipynb

+    "from langchain_core.runnables import RunnablePassthrough\n",
+    "from langchain_openai import ChatOpenAI\n",
+    "\n",
+    "llm = ChatOpenAI(model=\"gpt-4o\")\n",


Nit: in the context of this notebook, we're re-declaring an llm we already declared up above in ln 233. I'd probably omit this line, and omit the related import from langchain_openai import ChatOpenAI in ln 343 above.

I also don't love re-declaring the retriever with one additional param. It would be great if we could set enable_limit when we initially declare the retriever in ln 234, and then remove the re-initializing here.

It makes sense to have these things on a docs page if we want this to be a stand-alone code example, but here in the context of the notebook, it's not needed.

dacharyc · 2025-06-27T15:03:16Z

ai-integrations/langchain-self-query-retrieval.ipynb

+   "id": "833d90d9",
+   "metadata": {},
+   "source": [
+    "### Queries with filters"


I got some query results that seem unrelated to the filter. i.e. for "toys", I got this document:

Document(id='685eaec1edc703d86a4c7201', metadata={'_id': '685eaec1edc703d86a4c7201', 'year': 1979, 'rating': 9.9, 'genre': 'science fiction'}, page_content='Three men walk into the Zone, three men walk out of the Zone')

For thriller and action, I got this document:

Document(id='685eaec1edc703d86a4c7203', metadata={'_id': '685eaec1edc703d86a4c7203', 'year': 1995, 'genre': 'animated', 'rating': 9.3}, page_content='Toys come alive and have a blast doing so')

I'm sure this is related to the limited amount of sample data we're providing, but it doesn't show the feature great to have these seemingly unrelated results being returned. I wonder if we want to add more sample data to show only obviously related results being retrieved?

davidhou17 force-pushed the DOCSP-50370 branch from da02802 to bd8f2f5 Compare May 28, 2025 21:10

create new notebook

a2b3560

davidhou17 force-pushed the DOCSP-50370 branch from bd8f2f5 to a2b3560 Compare June 26, 2025 17:33

davidhou17 requested a review from dacharyc June 26, 2025 17:38

dacharyc approved these changes Jun 27, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

(DOCSP-50370): Create new LangChain self-query retrieval notebook #21

(DOCSP-50370): Create new LangChain self-query retrieval notebook #21

Uh oh!

davidhou17 commented May 28, 2025 •

edited

Loading

Uh oh!

dacharyc left a comment

Uh oh!

dacharyc Jun 27, 2025

Uh oh!

dacharyc Jun 27, 2025

Uh oh!

Uh oh!

(DOCSP-50370): Create new LangChain self-query retrieval notebook #21

Are you sure you want to change the base?

(DOCSP-50370): Create new LangChain self-query retrieval notebook #21

Uh oh!

Conversation

davidhou17 commented May 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dacharyc left a comment

Choose a reason for hiding this comment

Uh oh!

dacharyc Jun 27, 2025

Choose a reason for hiding this comment

Uh oh!

dacharyc Jun 27, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

davidhou17 commented May 28, 2025 •

edited

Loading