Skip to content

Conversation

@VirajAgarwal1
Copy link
Contributor

No description provided.

- azure
- claudeAI
- cohere
- crewAI
- crewAi-short-term-memory
- huggingface
- jinaAI
- openrouter-deepseek
- Introduced new tutorials for Retrieval-Augmented Generation (RAG) using LlamaIndex with Couchbase's FTS and GSI.
- Updated frontmatter and descriptions to reflect the integration of FTS and GSI capabilities.
- Enhanced Jupyter Notebooks with detailed instructions and examples for both FTS and GSI implementations.
@VirajAgarwal1 VirajAgarwal1 self-assigned this Oct 31, 2025
@github-actions
Copy link

Caution

Notebooks or Frontmatter Files Have Been Modified

  • Please ensure that a frontmatter.md file is accompanying the notebook file, and that the frontmatter is up to date.
  • These changes will be published to the developer portal tutorials only if frontmatter.md is included.
  • Proofread all changes before merging, as changes to notebook and frontmatter content will update the developer tutorial.

21 Notebook Files Modified:

Notebook File Frontmatter Included?
awsbedrock/fts/RAG_with_Couchbase_and_Bedrock.ipynb
awsbedrock/gsi/RAG_with_Couchbase_and_Bedrock.ipynb
azure/fts/RAG_with_Couchbase_and_AzureOpenAI.ipynb
azure/gsi/RAG_with_Couchbase_and_AzureOpenAI.ipynb
claudeai/fts/RAG_with_Couchbase_and_Claude(by_Anthropic).ipynb
claudeai/gsi/RAG_with_Couchbase_and_Claude(by_Anthropic).ipynb
cohere/fts/RAG_with_Couchbase_and_Cohere.ipynb
cohere/gsi/RAG_with_Couchbase_and_Cohere.ipynb
crewai-short-term-memory/fts/CouchbaseStorage_Demo.ipynb
crewai-short-term-memory/gsi/CouchbaseStorage_Demo.ipynb
crewai/fts/RAG_with_Couchbase_and_CrewAI.ipynb
crewai/gsi/RAG_with_Couchbase_and_CrewAI.ipynb
haystack/fts/RAG_with_Couchbase_Capella_and_OpenAI.ipynb
haystack/gsi/RAG_with_Couchbase_Capella_and_OpenAI.ipynb
huggingface/fts/hugging_face.ipynb
jinaai/fts/RAG_with_Couchbase_and_Jina_AI.ipynb
jinaai/gsi/RAG_with_Couchbase_and_Jina_AI.ipynb
llamaindex/fts/RAG_with_Couchbase_Capella_and_OpenAI.ipynb
llamaindex/gsi/RAG_with_Couchbase_Capella_and_OpenAI.ipynb
openrouter-deepseek/fts/RAG_with_Couchbase_and_Openrouter_Deepseek.ipynb
openrouter-deepseek/gsi/RAG_with_Couchbase_and_Openrouter_Deepseek.ipynb

3 Frontmatter Files Modified:

Frontmatter File
haystack/fts/frontmatter.md
haystack/gsi/frontmatter.md
llamaindex/fts/frontmatter.md
Note: frontmatter will be checked and tested in the Test Frontmatter workflow.

@gemini-code-assist
Copy link

Summary of Changes

Hello @VirajAgarwal1, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request primarily focuses on enhancing the accuracy and consistency of documentation within the vector-search-cookbook repository. It involves a widespread update of internal links in Jupyter notebooks to ensure they correctly reference the FTS or GSI versions of tutorials. Additionally, a significant structural change includes renaming the lamaindex directory to llamaindex to align with standard naming conventions. These changes collectively improve the user experience by providing precise navigation and clearer distinctions between different search service implementations.

Highlights

  • Link Corrections in Jupyter Notebooks: Numerous Jupyter notebooks across various integration examples (AWS Bedrock, Azure OpenAI, Claude AI, Cohere, CrewAI, Hugging Face, Jina AI, OpenRouter Deepseek) have had their internal tutorial links updated to correctly point to their respective FTS (Full-Text Search) or GSI (Global Secondary Index) paths within the repository.
  • Metadata and Path Updates for Haystack Tutorials: The frontmatter.md files for Haystack FTS and GSI tutorials have been updated to explicitly reflect the use of FTS or GSI services in their titles, short titles, and descriptions, along with corresponding path adjustments.
  • LlamaIndex Directory Renaming: The directory lamaindex has been renamed to llamaindex, affecting all contained Jupyter notebooks and metadata files.
  • Consistency in Tutorial Links: Several GSI-based Jupyter notebooks now include explicit links to their FTS counterparts, and vice-versa, improving navigation and discoverability between related tutorials.
  • Minor Formatting Fixes: Trailing newlines have been removed from the end of several Jupyter notebook files, ensuring cleaner file formatting.
Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature Command Description
Code Review /gemini review Performs a code review for the current pull request in its current state.
Pull Request Summary /gemini summary Provides a summary of the current pull request in its current state.
Comment @gemini-code-assist Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help /gemini help Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Footnotes

  1. Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution.

Copy link

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request introduces a series of minor fixes across multiple Jupyter notebooks and markdown files. The changes primarily involve updating URLs to reflect new file paths, correcting a typo in the llamaindex directory name, and improving the clarity of metadata in frontmatter.md files. Additionally, newlines have been added to some files for better formatting. My review focuses on improving the consistency and readability of the user-facing content. I've suggested capitalizing 'OpenAI' for proper branding in some short titles and using more descriptive link text in a few notebooks.

"metadata": {},
"source": [
"This tutorial shows how to implement a custom memory backend for CrewAI agents using Couchbase's high-performance GSI (Global Secondary Index) vector search. CrewAI agents can retain and recall information across interactions, making them more contextually aware and effective. We'll demonstrate measurable performance improvements with GSI optimization.\n",
"This tutorial shows how to implement a custom memory backend for CrewAI agents using Couchbase's high-performance GSI (Global Secondary Index) vector search. CrewAI agents can retain and recall information across interactions, making them more contextually aware and effective. We'll demonstrate measurable performance improvements with GSI optimization. Alternatively if you want to perform semantic search using the FTS, please take a look at [this.](https://developer.couchbase.com/tutorial-crewai-short-term-memory-couchbase-with-fts)\n",

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

The link text this. is not very descriptive. Consider using more descriptive text like this tutorial to improve readability and accessibility.

Suggested change
"This tutorial shows how to implement a custom memory backend for CrewAI agents using Couchbase's high-performance GSI (Global Secondary Index) vector search. CrewAI agents can retain and recall information across interactions, making them more contextually aware and effective. We'll demonstrate measurable performance improvements with GSI optimization. Alternatively if you want to perform semantic search using the FTS, please take a look at [this.](https://developer.couchbase.com/tutorial-crewai-short-term-memory-couchbase-with-fts)\n",
"This tutorial shows how to implement a custom memory backend for CrewAI agents using Couchbase's high-performance GSI (Global Secondary Index) vector search. CrewAI agents can retain and recall information across interactions, making them more contextually aware and effective. We'll demonstrate measurable performance improvements with GSI optimization. Alternatively if you want to perform semantic search using the FTS, please take a look at [this tutorial.](https://developer.couchbase.com/tutorial-crewai-short-term-memory-couchbase-with-fts)\n",

"metadata": {},
"source": [
"In this guide, we will walk you through building a powerful semantic search engine using [Couchbase](https://www.couchbase.com) as the backend database and [CrewAI](https://github.com/crewAIInc/crewAI) for agent-based RAG operations. CrewAI allows us to create specialized agents that can work together to handle different aspects of the RAG workflow, from document retrieval to response generation. This tutorial uses Couchbase's **Global Secondary Index (GSI)** vector search capabilities, which offer high-performance vector search optimized for large-scale applications. This tutorial is designed to be beginner-friendly, with clear, step-by-step instructions that will equip you with the knowledge to create a fully functional semantic search system from scratch. Alternatively if you want to perform semantic search using the FTS index, please take a look at [this.](https://developer.couchbase.com/tutorial-crewai-couchbase-rag-using-fts)"
"In this guide, we will walk you through building a powerful semantic search engine using [Couchbase](https://www.couchbase.com) as the backend database and [CrewAI](https://github.com/crewAIInc/crewAI) for agent-based RAG operations. CrewAI allows us to create specialized agents that can work together to handle different aspects of the RAG workflow, from document retrieval to response generation. This tutorial uses Couchbase's **Global Secondary Index (GSI)** vector search capabilities, which offer high-performance vector search optimized for large-scale applications. This tutorial is designed to be beginner-friendly, with clear, step-by-step instructions that will equip you with the knowledge to create a fully functional semantic search system from scratch. Alternatively if you want to perform semantic search using the FTS index, please take a look at [this.](https://developer.couchbase.com/tutorial-crewai-couchbase-rag-with-fts/)"

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

The link text this. is not very descriptive. Consider using more descriptive text like this tutorial to improve readability and accessibility.

Suggested change
"In this guide, we will walk you through building a powerful semantic search engine using [Couchbase](https://www.couchbase.com) as the backend database and [CrewAI](https://github.com/crewAIInc/crewAI) for agent-based RAG operations. CrewAI allows us to create specialized agents that can work together to handle different aspects of the RAG workflow, from document retrieval to response generation. This tutorial uses Couchbase's **Global Secondary Index (GSI)** vector search capabilities, which offer high-performance vector search optimized for large-scale applications. This tutorial is designed to be beginner-friendly, with clear, step-by-step instructions that will equip you with the knowledge to create a fully functional semantic search system from scratch. Alternatively if you want to perform semantic search using the FTS index, please take a look at [this.](https://developer.couchbase.com/tutorial-crewai-couchbase-rag-with-fts/)"
"In this guide, we will walk you through building a powerful semantic search engine using [Couchbase](https://www.couchbase.com) as the backend database and [CrewAI](https://github.com/crewAIInc/crewAI) for agent-based RAG operations. CrewAI allows us to create specialized agents that can work together to handle different aspects of the RAG workflow, from document retrieval to response generation. This tutorial uses Couchbase's **Global Secondary Index (GSI)** vector search capabilities, which offer high-performance vector search optimized for large-scale applications. This tutorial is designed to be beginner-friendly, with clear, step-by-step instructions that will equip you with the knowledge to create a fully functional semantic search system from scratch. Alternatively if you want to perform semantic search using the FTS index, please take a look at [this tutorial.](https://developer.couchbase.com/tutorial-crewai-couchbase-rag-with-fts/)"

"- OpenAI for embeddings and text generation\n",
"\n",
"The system allows users to ask questions about current events and get AI-generated answers based on the latest news articles."
"The system allows users to ask questions about current events and get AI-generated answers based on the latest news articles. Alternatively if you want to perform semantic search using the GSI index, please take a look at [this.](https://developer.couchbase.com/tutorial-openai-haystack-rag-with-global-secondary-index)\n",

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

The link text this. is not very descriptive. Consider using more descriptive text like this tutorial to improve readability and accessibility.

Suggested change
"The system allows users to ask questions about current events and get AI-generated answers based on the latest news articles. Alternatively if you want to perform semantic search using the GSI index, please take a look at [this.](https://developer.couchbase.com/tutorial-openai-haystack-rag-with-global-secondary-index)\n",
"The system allows users to ask questions about current events and get AI-generated answers based on the latest news articles. Alternatively if you want to perform semantic search using the GSI index, please take a look at [this tutorial.](https://developer.couchbase.com/tutorial-openai-haystack-rag-with-global-secondary-index)\n",

short_title: "RAG with Openai and Haystack"
path: "/tutorial-openai-haystack-rag-with-fts"
title: "Retrieval-Augmented Generation (RAG) with OpenAI and Haystack using FTS Service"
short_title: "RAG with Openai and Haystack using FTS"

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

For consistency with the title field and proper branding, Openai should be capitalized to OpenAI.

Suggested change
short_title: "RAG with Openai and Haystack using FTS"
short_title: "RAG with OpenAI and Haystack using FTS"

"- OpenAI for embeddings and text generation\n",
"\n",
"We leverage Couchbase's Global Secondary Index (GSI) vector search capabilities to create and manage vector indexes, enabling efficient semantic search capabilities. GSI provides high-performance vector search with support for both Hyperscale Vector Indexes and Composite Vector Indexes, designed to scale to billions of vectors with low memory footprint and optimized concurrent operations.\n",
"We leverage Couchbase's Global Secondary Index (GSI) vector search capabilities to create and manage vector indexes, enabling efficient semantic search capabilities. GSI provides high-performance vector search with support for both Hyperscale Vector Indexes and Composite Vector Indexes, designed to scale to billions of vectors with low memory footprint and optimized concurrent operations. Alternatively if you want to perform semantic search using the FTS, please take a look at [this.](https://developer.couchbase.com/tutorial-openai-haystack-rag-with-fts)\n",

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

The link text this. is not very descriptive. Consider using more descriptive text like this tutorial to improve readability and accessibility.

Suggested change
"We leverage Couchbase's Global Secondary Index (GSI) vector search capabilities to create and manage vector indexes, enabling efficient semantic search capabilities. GSI provides high-performance vector search with support for both Hyperscale Vector Indexes and Composite Vector Indexes, designed to scale to billions of vectors with low memory footprint and optimized concurrent operations. Alternatively if you want to perform semantic search using the FTS, please take a look at [this.](https://developer.couchbase.com/tutorial-openai-haystack-rag-with-fts)\n",
"We leverage Couchbase's Global Secondary Index (GSI) vector search capabilities to create and manage vector indexes, enabling efficient semantic search capabilities. GSI provides high-performance vector search with support for both Hyperscale Vector Indexes and Composite Vector Indexes, designed to scale to billions of vectors with low memory footprint and optimized concurrent operations. Alternatively if you want to perform semantic search using the FTS, please take a look at [this tutorial.](https://developer.couchbase.com/tutorial-openai-haystack-rag-with-fts)\n",

short_title: "RAG with Openai and Haystack"
path: "/tutorial-openai-haystack-rag-with-global-secondary-index"
title: "Retrieval-Augmented Generation (RAG) with OpenAI and Haystack with GSI"
short_title: "RAG with Openai and Haystack with GSI"

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

For consistency with the title field and proper branding, Openai should be capitalized to OpenAI.

Suggested change
short_title: "RAG with Openai and Haystack with GSI"
short_title: "RAG with OpenAI and Haystack with GSI"

"metadata": {},
"source": [
"This tutorial demonstrates building a high-performance semantic search engine using Couchbase's GSI (Global Secondary Index) vector search and Jina AI for embeddings and language models. We'll show measurable performance improvements with GSI optimization and implement a complete RAG (Retrieval-Augmented Generation) system.\n",
"This tutorial demonstrates building a high-performance semantic search engine using Couchbase's GSI (Global Secondary Index) vector search and Jina AI for embeddings and language models. We'll show measurable performance improvements with GSI optimization and implement a complete RAG (Retrieval-Augmented Generation) system. Alternatively if you want to perform semantic search using the FTS, please take a look at [this.](https://developer.couchbase.com/tutorial-jina-couchbase-rag-with-fts)\n",

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

The link text this. is not very descriptive. Consider using more descriptive text like this tutorial to improve readability and accessibility.

Suggested change
"This tutorial demonstrates building a high-performance semantic search engine using Couchbase's GSI (Global Secondary Index) vector search and Jina AI for embeddings and language models. We'll show measurable performance improvements with GSI optimization and implement a complete RAG (Retrieval-Augmented Generation) system. Alternatively if you want to perform semantic search using the FTS, please take a look at [this.](https://developer.couchbase.com/tutorial-jina-couchbase-rag-with-fts)\n",
"This tutorial demonstrates building a high-performance semantic search engine using Couchbase's GSI (Global Secondary Index) vector search and Jina AI for embeddings and language models. We'll show measurable performance improvements with GSI optimization and implement a complete RAG (Retrieval-Augmented Generation) system. Alternatively if you want to perform semantic search using the FTS, please take a look at [this tutorial.](https://developer.couchbase.com/tutorial-jina-couchbase-rag-with-fts)\n",

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants