fix:stmt2 memleak #34169

Pengrongkun · 2026-01-05T08:28:09Z

Description

Issue(s)

Close/close/Fix/fix/Resolve/resolve: Issue Link

Checklist

Please check the items in the checklist if applicable.

Is the user manual updated?
Are the test cases passed and automated?
Is there no significant decrease in test coverage?

gemini-code-assist · 2026-01-05T08:28:45Z

Summary of Changes

Hello @Pengrongkun, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request primarily focuses on resolving a memory leak within the stmt2 component. The changes involve a significant refactoring of how bound column values are managed, transitioning from array-based storage to hash maps to facilitate better ownership transfer and deallocation. Key cleanup functions have been centralized and enhanced, and the statement reset logic has been streamlined to ensure all associated resources are properly freed, thereby preventing memory leaks. A new test case has also been added to validate the corrected behavior.

Highlights

Memory Leak Resolution: Addresses a memory leak in the stmt2 component by ensuring proper resource deallocation, particularly for column values and bound information, throughout the statement lifecycle.
Data Structure Modernization: Transitions the internal handling of bound column values from SArray to SSHashObj (hash map) for improved management, ownership transfer, and efficient lookup.
Cleanup Function Centralization and Enhancement: The qDestroyBoundColInfo function has been moved to a more central utility and enhanced to provide a comprehensive cleanup mechanism for both bound column and associated tag information.
Refactored Statement Reset Logic: The stmtDeepReset function has been significantly refactored to streamline resource cleanup and reinitialization, improving code maintainability and correctness by consolidating redundant deallocation logic.

🧠 New Feature in Public Preview: You can now enable Memory to help Gemini Code Assist learn from your team's feedback. This makes future code reviews more consistent and personalized to your project's style. Click here to enable Memory in your admin console.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

Copilot

Pull request overview

This PR fixes memory leaks in the stmt2 (prepared statement version 2) implementation by improving memory cleanup across multiple components.

Key Changes:

Moved qDestroyBoundColInfo from parser library to qcom library and enhanced it to properly clean up tag information
Changed parsed values data structure from SArray to SSHashObj (hash map) for more efficient storage and cleanup
Added missing memory cleanup calls for aRowP array and pCreateTbReq structures

Reviewed changes

Copilot reviewed 12 out of 12 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
source/libs/qcom/src/queryUtil.c	Added `destroySTagsInfo` and `qDestroyBoundColInfo` functions to properly clean up bound column info including nested tag data
source/libs/parser/src/parInsertUtil.c	Removed `qDestroyBoundColInfo` (moved to qcom), made `destroyColVal` non-static for reuse, added `taosArrayDestroy` call for `aRowP`
source/libs/parser/src/parInsertSql.c	Changed `pParsedValues` from `SArray` to `SSHashObj`, updated all references to use `qDestroyBoundColInfo` and hash operations
source/libs/parser/src/parInsertSml.c	Updated to call `qDestroyBoundColInfo` instead of `insDestroyBoundColInfo`
source/libs/parser/inc/parInsertUtil.h	Removed `insDestroyBoundColInfo` declaration (no longer needed)
source/client/src/clientStmt2.c	Renamed `boundCols` to `fixedValueCols`, implemented ownership transfer pattern, added cleanup for `pCreateTbReq`, improved `stmtDeepReset` cleanup logic
source/client/src/clientStmt.c	Updated function signature for `stmtUpdateInfo` to use `SSHashObj*` instead of `SArray`
source/client/inc/clientStmt.h	Updated field name from `boundCols` to `fixedValueCols` with clarified comment
source/client/test/stmt2Test.cpp	Added test case for issue TD-6581610626 to verify fix, removed commented-out code
include/libs/qcom/query.h	Added `qDestroyBoundColInfo` function declaration
include/libs/parser/parser.h	Removed `qDestroyBoundColInfo` declaration (moved to qcom), updated `SStmtCallback` signature
include/common/tdataformat.h	Added `destroyColVal` function declaration for external use

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

source/libs/qcom/src/queryUtil.c

source/libs/parser/src/parInsertSql.c

gemini-code-assist

Code Review

This pull request effectively addresses a memory leak in stmt2 by improving memory management practices across several files. Key changes include:

Refactoring cleanup logic in stmtDeepReset to use a centralized stmtCleanSQLInfo function, which improves code maintainability.
Correctly freeing pCreateTbReq in stmtSetTbTags2 to prevent leaks.
Fixing a memory leak in checkAndMergeSVgroupDataCxtByTbname by destroying aRowP.
Implementing a more complete cleanup in qDestroyBoundColInfo to free parseredTags, which was a source of memory leaks.
Changing pParsedValues from SArray to SSHashObj and setting a proper free function, enhancing both performance and memory safety.

The changes are well-implemented and consistent. The addition of a new test case for the fixed issue is also a good practice.

gemini-code-assist · 2026-01-05T08:31:34Z

source/libs/qcom/src/queryUtil.c

+void qDestroyBoundColInfo(void* pInfo) {
+  if (NULL == pInfo) {
+    return;
+  }
+  SBoundColInfo* pBoundInfo = (SBoundColInfo*)pInfo;
+
+  taosMemoryFreeClear(pBoundInfo->pColIndex);
+  destroySTagsInfo(pBoundInfo->parseredTags);
+}


This is a crucial fix for a memory leak. The previous implementation of qDestroyBoundColInfo (or insDestroyBoundColInfo) only freed pColIndex. By adding the call to destroySTagsInfo(pBoundInfo->parseredTags), you are now correctly cleaning up the parseredTags and its contents, which was a source of the memory leak. The new destroySTagsInfo function also appears to be comprehensive in its cleanup.

gemini-code-assist · 2026-01-05T08:31:34Z

source/libs/parser/src/parInsertUtil.c


    parserDebug("merge same uid data: %" PRId64 ", vgId:%d", pTbCtx->pData->uid, pVgCxt->vgId);

+    taosArrayDestroy(pTbCtx->pData->aRowP);


Adding taosArrayDestroy(pTbCtx->pData->aRowP); here is a critical fix. Previously, aRowP was not being destroyed when merging data for the same table, leading to a memory leak. This change correctly frees the array.

gemini-code-assist · 2026-01-05T08:31:34Z

source/client/src/clientStmt2.c

  if (pStmt->sql.fixValueTags) {
+    pStmt->sql.fixValueTags = false;
    tdDestroySVCreateTbReq(pStmt->sql.fixValueTbReq);
+    pStmt->sql.fixValueTbReq = NULL;
  }


Good improvement. Resetting fixValueTags to false and nulling out fixValueTbReq after freeing it is a good defensive programming practice. It prevents potential double-frees or use-after-free issues if stmtCleanSQLInfo is called multiple times.

gemini-code-assist · 2026-01-05T08:31:34Z

source/client/src/clientStmt2.c

 static int32_t stmtDeepReset(STscStmt2* pStmt) {
+  // Save state that needs to be preserved
  char*             db = pStmt->db;
  TAOS_STMT2_OPTION options = pStmt->options;
  uint32_t          reqid = pStmt->reqid;
+  bool              stbInterlaceMode = pStmt->stbInterlaceMode;

  pStmt->errCode = 0;
+
+  // Wait for async execution to complete
  if (pStmt->options.asyncExecFn && !pStmt->execSemWaited) {
    if (tsem_wait(&pStmt->asyncExecSem) != 0) {
      STMT2_ELOG_E("bind param wait asyncExecSem failed");
    }
    pStmt->execSemWaited = true;
  }

-  if (pStmt->stbInterlaceMode) {
-    if (pStmt->bindThreadInUse) {
-      while (0 == atomic_load_8((int8_t*)&pStmt->sql.siInfo.tableColsReady)) {
-        taosUsleep(1);
-      }
-      (void)taosThreadMutexLock(&pStmt->queue.mutex);
-      pStmt->queue.stopQueue = true;
-      (void)taosThreadCondSignal(&(pStmt->queue.waitCond));
-      (void)taosThreadMutexUnlock(&pStmt->queue.mutex);
-
-      (void)taosThreadJoin(pStmt->bindThread, NULL);
-      pStmt->bindThreadInUse = false;
-      pStmt->queue.head = NULL;
-      pStmt->queue.tail = NULL;
-      pStmt->queue.qRemainNum = 0;
-
-      (void)taosThreadCondDestroy(&pStmt->queue.waitCond);
-      (void)taosThreadMutexDestroy(&pStmt->queue.mutex);
+  // Stop bind thread if in use (similar to stmtClose2)
+  if (stbInterlaceMode && pStmt->bindThreadInUse) {
+    while (0 == atomic_load_8((int8_t*)&pStmt->sql.siInfo.tableColsReady)) {
+      taosUsleep(1);
    }
-  }
-
-  pStmt->sql.autoCreateTbl = false;
-  taosMemoryFree(pStmt->sql.pBindInfo);
-  pStmt->sql.pBindInfo = NULL;
-
-  taosMemoryFree(pStmt->sql.queryRes.fields);
-  pStmt->sql.queryRes.fields = NULL;
-
-  taosMemoryFree(pStmt->sql.queryRes.userFields);
-  pStmt->sql.queryRes.userFields = NULL;
-
-  pStmt->sql.type = 0;
-  pStmt->sql.runTimes = 0;
-  taosMemoryFree(pStmt->sql.sqlStr);
-  pStmt->sql.sqlStr = NULL;
-
-  qDestroyQuery(pStmt->sql.pQuery);
-  pStmt->sql.pQuery = NULL;
-
-  taosArrayDestroy(pStmt->sql.nodeList);
-  pStmt->sql.nodeList = NULL;
-
-  taosHashCleanup(pStmt->sql.pVgHash);
-  pStmt->sql.pVgHash = NULL;
-
-  if (pStmt->sql.fixValueTags) {
-    tdDestroySVCreateTbReq(pStmt->sql.fixValueTbReq);
-    pStmt->sql.fixValueTbReq = NULL;
-  }
-  pStmt->sql.fixValueTags = false;
-
-  void* pIter = taosHashIterate(pStmt->sql.pTableCache, NULL);
-  while (pIter) {
-    SStmtTableCache* pCache = (SStmtTableCache*)pIter;
+    (void)taosThreadMutexLock(&pStmt->queue.mutex);
+    pStmt->queue.stopQueue = true;
+    (void)taosThreadCondSignal(&(pStmt->queue.waitCond));
+    (void)taosThreadMutexUnlock(&pStmt->queue.mutex);

-    qDestroyStmtDataBlock(pCache->pDataCtx);
-    qDestroyBoundColInfo(pCache->boundTags);
-    taosMemoryFreeClear(pCache->boundTags);
+    (void)taosThreadJoin(pStmt->bindThread, NULL);
+    pStmt->bindThreadInUse = false;
+    pStmt->queue.head = NULL;
+    pStmt->queue.tail = NULL;
+    pStmt->queue.qRemainNum = 0;

-    pIter = taosHashIterate(pStmt->sql.pTableCache, pIter);
+    (void)taosThreadCondDestroy(&pStmt->queue.waitCond);
+    (void)taosThreadMutexDestroy(&pStmt->queue.mutex);
  }
-  taosHashCleanup(pStmt->sql.pTableCache);

-  if (pStmt->sql.stbInterlaceMode) {
+  // Clean all SQL and execution info (stmtCleanSQLInfo already handles most cleanup)
+  pStmt->bInfo.boundColsCached = false;
+  if (stbInterlaceMode) {
    pStmt->bInfo.tagsCached = false;
  }
-  STMT_ERR_RET(stmtCleanExecInfo(pStmt, false, true));
-
-  resetRequest(pStmt);
-
-  if (pStmt->sql.siInfo.pTableCols) {
-    taosArrayDestroyEx(pStmt->sql.siInfo.pTableCols, stmtFreeTbCols);
-    pStmt->sql.siInfo.pTableCols = NULL;
-  }
-
-  if (pStmt->sql.siInfo.tbBuf.pBufList) {
-    taosArrayDestroyEx(pStmt->sql.siInfo.tbBuf.pBufList, stmtFreeTbBuf);
-    pStmt->sql.siInfo.tbBuf.pBufList = NULL;
-  }
-
-  if (pStmt->sql.siInfo.pTableHash) {
-    tSimpleHashCleanup(pStmt->sql.siInfo.pTableHash);
-    pStmt->sql.siInfo.pTableHash = NULL;
-  }
-
-  if (pStmt->sql.siInfo.pTableRowDataHash) {
-    tSimpleHashCleanup(pStmt->sql.siInfo.pTableRowDataHash);
-    pStmt->sql.siInfo.pTableRowDataHash = NULL;
-  }
-
-  if (pStmt->sql.siInfo.pVgroupHash) {
-    taosHashCleanup(pStmt->sql.siInfo.pVgroupHash);
-    pStmt->sql.siInfo.pVgroupHash = NULL;
-  }
-
-  if (pStmt->sql.siInfo.pVgroupList) {
-    taosArrayDestroy(pStmt->sql.siInfo.pVgroupList);
-    pStmt->sql.siInfo.pVgroupList = NULL;
-  }
-
-  if (pStmt->sql.siInfo.pDataCtx) {
-    qDestroyStmtDataBlock(pStmt->sql.siInfo.pDataCtx);
-    pStmt->sql.siInfo.pDataCtx = NULL;
-  }
+  STMT_ERR_RET(stmtCleanSQLInfo(pStmt));

-  if (pStmt->sql.siInfo.pTSchema) {
-    taosMemoryFree(pStmt->sql.siInfo.pTSchema);
-    pStmt->sql.siInfo.pTSchema = NULL;
-  }
+  // Reinitialize resources (similar to stmtInit2)
+  if (stbInterlaceMode) {
+    pStmt->sql.siInfo.transport = pStmt->taos->pAppInfo->pTransporter;
+    pStmt->sql.siInfo.acctId = pStmt->taos->acctId;
+    pStmt->sql.siInfo.dbname = pStmt->taos->db;
+    pStmt->sql.siInfo.mgmtEpSet = getEpSet_s(&pStmt->taos->pAppInfo->mgmtEp);

-  if (pStmt->sql.siInfo.pRequest) {
-    taos_free_result(pStmt->sql.siInfo.pRequest);
-    pStmt->sql.siInfo.pRequest = NULL;
-  }
+    if (NULL == pStmt->pCatalog) {
+      STMT_ERR_RET(catalogGetHandle(pStmt->taos->pAppInfo->clusterId, &pStmt->pCatalog));
+    }
+    pStmt->sql.siInfo.pCatalog = pStmt->pCatalog;

-  if (pStmt->stbInterlaceMode) {
    STMT_ERR_RET(stmtResetStbInterlaceCache(pStmt));
+
+    int32_t code = stmtIniAsyncBind(pStmt);
+    if (TSDB_CODE_SUCCESS != code) {
+      STMT2_ELOG("fail to reinit async bind in stmtDeepReset:%s", tstrerror(code));
+      return code;
+    }
  }

+  // Restore preserved state
  pStmt->db = db;
  pStmt->options = options;
  pStmt->reqid = reqid;
+  pStmt->stbInterlaceMode = stbInterlaceMode;

  pStmt->sql.pTableCache = taosHashInit(100, taosGetDefaultHashFunction(TSDB_DATA_TYPE_BIGINT), false, HASH_NO_LOCK);
  if (NULL == pStmt->sql.pTableCache) {
-    STMT2_ELOG("fail to allocate memory for pTableCache in stmtResetStmt:%s", tstrerror(terrno));
+    STMT2_ELOG("fail to allocate memory for pTableCache in stmtDeepReset:%s", tstrerror(terrno));
    return terrno;
  }

+  pStmt->bInfo.needParse = true;
  pStmt->sql.status = STMT_INIT;
+  pStmt->sql.siInfo.tableColsReady = true;

  return TSDB_CODE_SUCCESS;
 }


The refactoring of stmtDeepReset to use stmtCleanSQLInfo is a good improvement. It centralizes cleanup logic and removes a significant amount of duplicated code, which enhances maintainability and reduces the risk of introducing new bugs during future modifications.

Pengrongkun added 5 commits January 4, 2026 14:54

fix stmt2 fixed value bind memory error

0cf746b

fix memleak 1.0

c13c4a9

fix memleak 2.0

b653837

fix memleak 3.0

f0e2280

fix memleak 4.0

77cac20

Copilot AI review requested due to automatic review settings January 5, 2026 08:28

Pengrongkun requested review from dapan1121, guanshengliang and hzcheng as code owners January 5, 2026 08:28

taosdata-bot bot added the team engine label Jan 5, 2026

Copilot started reviewing on behalf of Pengrongkun January 5, 2026 08:28 View session

Copilot AI reviewed Jan 5, 2026

View reviewed changes

source/libs/qcom/src/queryUtil.c Show resolved Hide resolved

source/libs/parser/src/parInsertSql.c Show resolved Hide resolved

source/libs/parser/src/parInsertSql.c Show resolved Hide resolved

gemini-code-assist bot reviewed Jan 5, 2026

View reviewed changes

Pengrongkun added 3 commits January 6, 2026 09:26

fix review

781c7a5

Merge remote-tracking branch 'origin/main' into fix/main/6581610626

1828e45

fix CI

95008e5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix:stmt2 memleak #34169

fix:stmt2 memleak #34169

Pengrongkun commented Jan 5, 2026

Uh oh!

gemini-code-assist bot commented Jan 5, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Jan 5, 2026

Uh oh!

gemini-code-assist bot Jan 5, 2026

Uh oh!

gemini-code-assist bot Jan 5, 2026

Uh oh!

gemini-code-assist bot Jan 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants


		parserDebug("merge same uid data: %" PRId64 ", vgId:%d", pTbCtx->pData->uid, pVgCxt->vgId);

		taosArrayDestroy(pTbCtx->pData->aRowP);

fix:stmt2 memleak #34169

Are you sure you want to change the base?

fix:stmt2 memleak #34169

Conversation

Pengrongkun commented Jan 5, 2026

Description

Issue(s)

Checklist

Uh oh!

gemini-code-assist bot commented Jan 5, 2026

Summary of Changes

Highlights

Footnotes

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Jan 5, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Jan 5, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Jan 5, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Jan 5, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants