Langchain Vector Search Tool uses MCP under the hood by nisha2003 · Pull Request #295 · databricks/databricks-ai-bridge

nisha2003 · 2026-01-27T02:03:40Z

Migrate Langchain Vector Search Tool to use MCP adapters. We still preserve the direct API path for self-managed embeddings (for which there is no MCP support).

Refactored some duplicate code for the MCP path between OpenAI and Langchain to the base mixin class. Moved tests to the mixin class for shared functionality.

Manual tests (https://eng-ml-inference.staging.cloud.databricks.com/editor/notebooks/1465545330011655?o=1653573648247579) in the Langchain Migration section

aravind-segu

couple comments but looks good overall. Will stamp after E2E testing

aravind-segu · 2026-01-27T23:32:53Z

integrations/langchain/src/databricks_langchain/vector_search_retriever_tool.py

+    ) -> Dict[str, Any]:
+        """Build input for MCP tool invocation."""
+        mcp_input = self._build_mcp_params(filters, **kwargs)
+        mcp_input["query"] = query


nit: why special case this? Can we pass it into _build_mcp_params

aravind-segu · 2026-01-27T23:52:06Z

integrations/langchain/src/databricks_langchain/vector_search_retriever_tool.py

+        query: str,
+        filters: Optional[Union[Dict[str, Any], List[FilterItem]]] = None,
+        **kwargs,
+    ) -> List[Document]:


was just curious here, before _run returns a str and now it returns a List[Document]. Did we change something or was the previous return type wrong

I think it maybe have been wrong? Previously we just returned self._vector_store.similarity_search(**kwargs) for which the return type is List[Document].

aravind-segu · 2026-01-28T00:13:02Z

integrations/openai/src/databricks_openai/vector_search_retriever_tool.py

-            return filters
-        return {item.model_dump()["key"]: item.model_dump()["value"] for item in filters}
-
    def _build_mcp_meta(


nit: use _build_mcp_params directly?

aravind-segu · 2026-01-28T00:14:23Z

integrations/openai/src/databricks_openai/vector_search_retriever_tool.py

+        """Build metadata dict for MCP tool invocation."""
+        return self._build_mcp_params(filters, **kwargs)

    def _parse_mcp_response(self, mcp_response: str) -> List[Dict]:


nit: same thing here, use the _parse_mcp_response_to_dicts directly?

aravind-segu · 2026-01-28T00:16:11Z

integrations/openai/src/databricks_openai/vector_search_retriever_tool.py

        self,
        query: str,
        filters: Optional[Union[Dict[str, Any], List[FilterItem]]] = None,
+        openai_client: OpenAI = None,


Can we use the DatabricksOpenAI here to automatically authenticate with WorkspaceClient

aravind-segu · 2026-01-28T00:25:55Z