Andrey Karpov

May 07 2025

Tags:

#Cpp #Embedded

Why SSDLC needs static analysis: a case study of 190 bugs in TDengine

May 07 2025

Author: Andrey Karpov

130 shades of null pointers
Resource leaks
Buffer/array overflow
Typos
Other errors
Conclusion
Additional links

Static code analysis is one of the most important components of secure software development. It detects errors and potential vulnerabilities early in the development process, when they're cheaper and easier to fix. It also enables developers to detect security issues and flaws that they aren't aware of.

This article provides examples to show why static analysis is important, rather than simply speculates on the idea of using analyzers for safety. So, let's take a practical look at how static analysis can make your code safer, more reliable, and neater.

We'll continue examining the TDengine project, which we've covered in three small notes on code refactoring:

Breaking down bugs in TDengine to master refactoring, part 1: sausage code.
Breaking down bugs in TDengine to master refactoring, part 2: stack-consuming macro.
Breaking down bugs in TDengine to master refactoring, part 3: price of laziness.

TDengine is a database designed for IoT systems, where reliability and security are especially critical, which makes this project stand-out more. Checking the code using PVS-Studio static analyzer was especially interesting since it detects not only typos but also potential vulnerabilities.

TDengine is an open source, high-performance, cloud native time-series database optimized for Internet of Things (IoT), Connected Cars, and Industrial IoT. It enables efficient, real-time data ingestion, processing, and monitoring of TB and even PB scale data per day, generated by billions of sensors and data collectors.

The fourth part of the series is released much long after the project check. So, some code snippets may look different now, and some errors may already be fixed. However, it's not a big deal—this article is intended to highlight the value of static analysis as a practice, rather than find and fix as many errors as possible. Such one-time checks show the capabilities of PVS-Studio analyzer but they don't contribute meaningfully to long-term quality and reliability of the project. Static analysis should be used regularly. Introduce Static Analysis in the Process, Don't Just Search for Bugs with It.

Usually we only check the project code, discarding third-party libraries that it uses. However, this time I deliberately checked the entire codebase with external libraries. I'd like you to think over the following take:

From the user's point of view, it doesn't matter whether a bug or vulnerability originates in our own code or in the code of a library we use. If we use third-party code, we take responsibility for it. Errors and issues in third-party code become ours.

It is useful, and sometimes necessary, to perform static analysis of the third-party components that we use:

This will help us choose safer and more reliable libraries for future use;
We can proactively find and eliminate zero-day vulnerabilities that might affect the product reputation. Errors from third-party components don't seem to be our fault, but this fact doesn't make it any easier.
By fixing bugs in third-party libraries, we contribute to the development of open-source code.

130 shades of null pointers

Dereferencing a null pointer is a very frequent error. In this regard, the TDengine project code is no exception.

Error N1. Selection error

The taosArrayGetLast function can return NULL:

void* taosArrayGetLast(const SArray* pArray) {
  if (pArray->size == 0) {
    terrno = TSDB_CODE_INVALID_PARA;
    return NULL;
  }

  return TARRAY_GET_ELEM(pArray, pArray->size - 1);
}

The author of the following code attempted to handle this scenario, but failed.

static int32_t walInitWriteFile(SWal *pWal) {
  int64_t       fileFirstVer = -1;
  ....
  SWalFileInfo *pRet = taosArrayGetLast(pWal->fileInfoSet);
  if (pRet == NULL) {
    fileFirstVer = pWal->vers.lastVer + 1;
  }
  fileFirstVer = pRet->firstVer;
  ....
}

Regardless of the pRet pointer value, the pointer is still dereferenced. In addition, the value of the fileFirstVer variable is always overwritten. So, the analyzer issues two warnings at once:

V519 The 'fileFirstVer' variable is assigned values twice successively. Perhaps this is a mistake. Check lines: 696, 698. walWrite.c 698
V1004 The 'pRet' pointer was used unsafely after it was verified against nullptr. Check lines: 695, 698. walWrite.c 698

Perhaps the developer should have added else to remedy the situation:

SWalFileInfo *pRet = taosArrayGetLast(pWal->fileInfoSet);
if (pRet == NULL) {
  fileFirstVer = pWal->vers.lastVer + 1;
} else {
  fileFirstVer = pRet->firstVer;
}

Errors N2–N6. Errors in error handlers

Bugs often turn up in error handlers, and null pointer dereferences are no exception. It's hardly surprising since almost no one ever tests these parts of code. Neither does anyone write unit tests for them—it's just too tedious.

int32_t ctgGetFetchName(SArray* pNames, SCtgFetch* pFetch, SName** ppName) {
  STablesReq* pReq = (STablesReq*)taosArrayGet(pNames, pFetch->dbIdx);
  if (NULL == pReq) {
    qError("fail to get the %dth tb in pTables, tbNum:%d",
           pFetch->tbIdx, (int32_t)taosArrayGetSize(pReq->pTables));
    return TSDB_CODE_CTG_INTERNAL_ERROR;
  }
  ....
}

The PVS-Studio warning: V522 Dereferencing of the null pointer 'pReq' might take place. ctgUtil.c 1769

If the pReq pointer is null, it's dereferenced to print the number of elements in the table:

STablesReq* pReq = ;
if (NULL == pReq) {
  ....(pReq->pTables));

A dubious idea :)

On the one hand, the error doesn't seem crucial: it's unlikely to come up, otherwise it would have been noticed and fixed.

On the other hand, it's critical:

In case of a failure, the program will crash instead of providing a reasonable message to identify and fix the error. Perhaps, users have complained about such crashes before, but the developers might not have realized that the bug lies here. We're expecting a message ... but there isn't one :)
Dereferencing a null pointer makes behavior undefined. An optimizing compiler can do anything with this code. For example, it can delete the check and the message printing altogether, assuming that a pointer can't be null :)

Here are similar defects in error handlers:

V522 Dereferencing of the null pointer 'pBufInfo' might take place. groupcacheoperator.c 391
V522 Dereferencing of the null pointer 'item' might take place. scanoperator.c 4756
V522 Dereferencing of the null pointer 'pTrans' might take place. mndCompact.c 710
V522 Dereferencing of the null pointer 'pEntry' might take place. syncPipeline.c 885

Errors N7–N10. Incorrect assert usage

template <typename It>
static void
linkResultDirectedEdges(It first, It last)
// throw(TopologyException);
{
  for(; first != last; ++first) {
    Node* node = *first;
    assert(node);

    EdgeEndStar* ees = node->getEdges();
    assert(ees);
    DirectedEdgeStar* des = dynamic_cast<DirectedEdgeStar*>(ees);
    assert(des);

    // this might throw an exception
    des->linkResultDirectedEdges();
  }
}

The PVS-Studio warning: V522 There might be dereferencing of a potential null pointer 'des'. PlanarGraph.h 98

The dynamic_cast operator may return a null pointer, and therefore must be checked. However, using assert is clearly incorrect in this case. Such a check is of little use. If the pointer is null when running the debug version, the error will be noticed even without the assert operator. In the case of a release build, the assert macro will turn into nothing, and the use of a null pointer, leading to undefined behavior in the future.

Assertions (assert) are intended to ensure that the data is within the expected ranges while testing the application. But in this case, by using dynamic_cast, the programmer implies that the object casting may fail and the pointer will be null. In other words, a null pointer is an expected option. If a developer wants type casting to always perform successfully, they should have used static_cast. Don't hesitate to read the article on a related topic: "Why it is bad idea to check result of malloc call with assert".

It's better to replace assert with the if operator and write the code that handles the case when the pointer is null.

These are other similar errors:

V522 There might be dereferencing of a potential null pointer 'nextedge'. LineMergeDirectedEdge.cpp 64
V522 There might be dereferencing of a potential null pointer 'edge'. EdgeRing.cpp 225
V522 There might be dereferencing of a potential null pointer 'point'. PointGeometryUnion.cpp 52

Errors N11–N13. dynamic_cast gets bolder

There is no check at all after dynamic_cast is executed. A null pointer can be dereferenced while evaluating a condition when nextedge->getEdgeDirection() is called.

LineMergeDirectedEdge*
LineMergeDirectedEdge::getNext(bool checkDirection)
{
  ....
  if(getToNode()->getOutEdges()->getEdges()[0] == getSym()) {
    auto nextedge = dynamic_cast<LineMergeDirectedEdge*>(
      getToNode()->getOutEdges()->getEdges()[1]);
    return (!checkDirection || nextedge->getEdgeDirection()) ?
      nextedge : nullptr;
  }
  ....
}

The PVS-Studio warning: V522 There might be dereferencing of a potential null pointer 'nextedge'. LineMergeDirectedEdge.cpp 57

The result of dynamic_cast must be checked. If the type casting is expected to succeed, the faster static_cast operator should be used instead. This will add clarity to those who maintain the code.

In this case, it seems logical to me to refine the condition:

auto nextedge = dynamic_cast<LineMergeDirectedEdge*>(
  getToNode()->getOutEdges()->getEdges()[1]);
return (!checkDirection ||
        (nextedge && nextedge->getEdgeDirection())) ?
  nextedge : nullptr;

However, this code looks complicated, so let's make it more readable:

auto nextedge = dynamic_cast<LineMergeDirectedEdge*>(
  getToNode()->getOutEdges()->getEdges()[1]);

if (!checkDirection ||
    (nextedge && nextedge->getEdgeDirection()))
{
  return nextedge;
}
return nullptr;

Other warnings:

V522 There might be dereferencing of a potential null pointer. EdgeRing.cpp 300
V522 There might be dereferencing of a potential null pointer. EdgeRing.cpp 318

Error N14. Macros...

Do you remember the article about a stack-consuming macro? It said that macros can harbor unpleasant surprises that are hard to notice on a code review. Here comes another example of a "macro mess".

int32_t qWorkerInit(....) {
  ....
  if (NULL == mgmt->schHash) {
    taosMemoryFreeClear(mgmt);
    qError("init %d scheduler hash failed", mgmt->cfg.maxSchedulerNum);
    QW_ERR_JRET(terrno);
  }
  ....
}

Did you spot the error? I guess not. Reviewing this code, one may find it hard to see what's wrong here.

The issue is that taosMemoryFreeClear is not a function call to free memory, but a macro that also nullifies the pointer.

#define taosMemoryFreeClear(ptr)   \
  do {                             \
    if (ptr) {                     \
      taosMemoryFree((void *)ptr); \
      (ptr) = NULL;                \
    }                              \
  } while (0)

So, an attempt to print a message will result in dereferencing a null pointer.

The PVS-Studio warning: V522 Dereferencing of the null pointer 'mgmt' might take place. qworker.c 1442

To fix the code, the author should place the qError function call before the macro.

if (NULL == mgmt->schHash) {
  qError("init %d scheduler hash failed", mgmt->cfg.maxSchedulerNum);
  taosMemoryFreeClear(mgmt);
  QW_ERR_JRET(terrno);
}

Oh, those macros...

Errors N15–N112. No checks when allocating memory

The TDengine project devs should keep an eye on memory allocation checks after malloc functions (or similar ones). Sometimes the checks are there:

void* buf = taosMemoryMalloc(tlen);
if (NULL == buf) {
  taosArrayDestroy(reqNew.pArray);
  tDeleteSVCreateTbBatchReq(&req);
  goto end;
}

But often, there are none. This is quite sad and bad given it's a library for IoT devices:

Ignoring memory allocation errors is generally a no-go for libraries. We can't predict how and where the library will be used. So, if something goes wrong, the library authors must notify the app devs so they could handle a situation.
IoT often involves embedded devices with a relatively limited memory, where memory shortages are not such an exotic situation to be handled.
Just imagine this awkward moment—an app randomly crashes because the library developer didn't account for memory issues. Worse, this may lead to database inconsistency.

Learn more details in the article: "Four reasons to check what the malloc function returned". If someone in your team doesn't check pointers after malloc, I suggest you gently make them read this article—repeatedly—until they are fully aware and enlightened.

What does the lack of checks look like in TDengine? Diverse, yet boring to delve into. I'm going to give you a couple examples.

taos_linked_list_t *taos_linked_list_new(void) {
  taos_linked_list_t *self =
    (taos_linked_list_t *)taos_malloc(sizeof(taos_linked_list_t));
  self->head = NULL;
  self->tail = NULL;
  self->free_fn = NULL;
  self->compare_fn = NULL;
  self->size = 0;
  return self;
}

The PVS-Studio warning: V522 There might be dereferencing of a potential null pointer 'self'. Check lines: 28, 27. taos_linked_list.c 28

unsigned char*
SZ_skip_compress_double(double* data, size_t dataLength, size_t* outSize)
{
  *outSize = dataLength*sizeof(double);
  unsigned char* out = (unsigned char*)malloc(dataLength*sizeof(double));
  memcpy(out, data, dataLength*sizeof(double));
  return out;
}

The PVS-Studio warning: V575 The potential null pointer is passed into 'memcpy' function. Inspect the first argument. Check lines: 28, 27. sz_double.c 28

Other similar warnings. open icon

V522 There might be dereferencing of a potential null pointer '* coeff_array'. Check lines: 304, 303. dataCompression.c 304
V522 There might be dereferencing of a potential null pointer 'keys'. Check lines: 344, 333. iniparser.c 344
V522 Dereferencing of the null pointer 'vce' might take place. The potential null pointer is passed into 'compressSingleFloatValue' function. Inspect the first argument. Check lines: 209, 439, 433. dataCompression.c 209
V522 Dereferencing of the null pointer 'lce' might take place. The potential null pointer is passed into 'addExactData' function. Inspect the fourth argument. Check lines: 275, 442, 434. dataCompression.c 275
V522 There might be dereferencing of a potential null pointer '* decData'. Check lines: 514, 463. dataCompression.c 514
V522 Dereferencing of the null pointer 'vce' might take place. The potential null pointer is passed into 'compressSingleDoubleValue' function. Inspect the first argument. Check lines: 234, 575, 569. dataCompression.c 234
V522 Dereferencing of the null pointer 'lce' might take place. The potential null pointer is passed into 'addExactData' function. Inspect the fourth argument. Check lines: 275, 578, 570. dataCompression.c 275
V522 There might be dereferencing of a potential null pointer 'result'. Check lines: 143, 134. CompressElement.c 143
V522 There might be dereferencing of a potential null pointer 'type'. Check lines: 152, 122. szd_double.c 152
V522 There might be dereferencing of a potential null pointer '* dia'. Check lines: 18, 17. DynamicIntArray.c 18
V522 There might be dereferencing of a potential null pointer 'type'. Check lines: 130, 107. sz_double.c 130
V522 There might be dereferencing of a potential null pointer 'vce'. Check lines: 132, 126. sz_double.c 132
V522 There might be dereferencing of a potential null pointer 'types'. Check lines: 160, 129. szd_float.c 160
V522 There might be dereferencing of a potential null pointer 'type2code'. Check lines: 48, 43. transcode.c 48
V522 There might be dereferencing of a potential null pointer 'diff'. Check lines: 49, 44. transcode.c 49
V522 There might be dereferencing of a potential null pointer 'tp_code'. Check lines: 63, 36. transcode.c 63
V522 There might be dereferencing of a potential null pointer 'tp_code'. Check lines: 146, 106. transcode.c 146
V522 There might be dereferencing of a potential null pointer 'type'. Check lines: 138, 114. sz_float.c 138
V522 There might be dereferencing of a potential null pointer 'vce'. Check lines: 141, 134. sz_float.c 141
V522 There might be dereferencing of a potential null pointer '* dba'. Check lines: 18, 17. DynamicByteArray.c 18
V522 There might be dereferencing of a potential null pointer 'huffmanTree->code[n->c]'. Check lines: 129, 125. Huffman.c 129
V522 There might be dereferencing of a potential null pointer '* out'. Check lines: 425, 424. Huffman.c 425
V522 There might be dereferencing of a potential null pointer 'symbol'. Check lines: 633, 632. dumper.c 633
V522 There might be dereferencing of a potential null pointer 'stackTrace'. Check lines: 716, 715. dumper.c 716
V522 There might be dereferencing of a potential null pointer 'subgeomArray'. Check lines: 2084, 2082. geos_ts_c.cpp 2084
V522 There might be dereferencing of a potential null pointer '* vgroup_ids'. Check lines: 97, 83. taos_counter.c 97
V522 There might be dereferencing of a potential null pointer '* keys'. Check lines: 98, 88. taos_counter.c 98
V522 There might be dereferencing of a potential null pointer 'node'. Check lines: 92, 90. taos_linked_list.c 92
V522 There might be dereferencing of a potential null pointer 'node'. Check lines: 108, 106. taos_linked_list.c 108
V522 There might be dereferencing of a potential null pointer 'self'. Check lines: 42, 41. taos_map.c 42
V522 There might be dereferencing of a potential null pointer 'self'. Check lines: 78, 77. taos_map.c 78
V522 There might be dereferencing of a potential null pointer 'self->addrs'. Check lines: 98, 94. taos_map.c 98
V522 There might be dereferencing of a potential null pointer 'new_addrs'. Check lines: 287, 283. taos_map.c 287
V522 There might be dereferencing of a potential null pointer 'self'. Check lines: 35, 34. taos_metric_formatter.c 35
V522 There might be dereferencing of a potential null pointer 'k'. Check lines: 60, 47. taos_metric.c 60
V522 There might be dereferencing of a potential null pointer 'self'. Check lines: 45, 44. taos_string_builder.c 45
V522 There might be dereferencing of a potential null pointer 'self->str'. Check lines: 59, 58. taos_string_builder.c 59
V522 There might be dereferencing of a potential null pointer 'self'. Check lines: 49, 47. taos_collector_registry.c 49
V522 There might be dereferencing of a potential null pointer 'self'. Check lines: 39, 38. taos_metric_sample.c 39
V522 There might be dereferencing of a potential null pointer 'e'. Check lines: 532, 530. lru_cache.cc 532
V522 There might be dereferencing of a potential null pointer 'column_families'. Check lines: 1038, 1036. c.cc 1038
V522 There might be dereferencing of a potential null pointer 'cf_names'. Check lines: 2526, 2522. c.cc 2526
V522 There might be dereferencing of a potential null pointer 'cf_options'. Check lines: 2527, 2523. c.cc 2527
V522 There might be dereferencing of a potential null pointer 'level_meta'. Check lines: 5308, 5307. c.cc 5308
V522 There might be dereferencing of a potential null pointer 'file_meta'. Check lines: 5339, 5338. c.cc 5339
V522 There might be dereferencing of a potential null pointer 'buf'. Check lines: 5599, 5596. c.cc 5599
V522 There might be dereferencing of a potential null pointer 'wi'. Check lines: 5627, 5626. c.cc 5627
V522 There might be dereferencing of a potential null pointer 'result'. Check lines: 5672, 5671. c.cc 5672
V575 The potential null pointer is passed into 'memset' function. Inspect the first argument. Check lines: 51, 50. sz_double.c 51
V575 The potential null pointer is passed into 'memset' function. Inspect the first argument. Check lines: 273, 272. sz_double.c 273
V575 The potential null pointer is passed into 'memset' function. Inspect the first argument. Check lines: 380, 379. sz_double.c 380
V575 The potential null pointer is passed into 'memset' function. Inspect the first argument. Check lines: 47, 46. sz_float.c 47
V575 The potential null pointer is passed into 'memset' function. Inspect the first argument. Check lines: 275, 274. sz_float.c 275
V575 The potential null pointer is passed into 'memset' function. Inspect the first argument. Check lines: 402, 401. sz_float.c 402
V575 The potential null pointer is passed into 'memcpy' function. Inspect the first argument. Check lines: 33, 27. DynamicByteArray.c 33
V575 The potential null pointer is passed into 'memset' function. Inspect the first argument. Check lines: 20, 19. Huffman.c 20
V575 The potential null pointer is passed into 'memset' function. Inspect the first argument. Check lines: 29, 24. Huffman.c 29
V575 The potential null pointer is passed into 'memset' function. Inspect the first argument. Check lines: 30, 25. Huffman.c 30
V575 The potential null pointer is passed into 'memset' function. Inspect the first argument. Check lines: 31, 26. Huffman.c 31
V575 The potential null pointer is passed into 'memset' function. Inspect the first argument. Check lines: 32, 27. Huffman.c 32
V575 The potential null pointer is passed into 'memset' function. Inspect the first argument. Check lines: 172, 171. Huffman.c 172
V575 The potential null pointer is passed into 'memset' function. Inspect the first argument. Check lines: 194, 193. Huffman.c 194
V575 The potential null pointer is passed into 'memset' function. Inspect the first argument. Check lines: 413, 412. Huffman.c 413
V575 The potential null pointer is passed into 'memset' function. Inspect the first argument. Check lines: 415, 414. Huffman.c 415
V575 The potential null pointer is passed into 'memset' function. Inspect the first argument. Check lines: 417, 416. Huffman.c 417
V575 The potential null pointer is passed into 'memset' function. Inspect the first argument. Check lines: 419, 418. Huffman.c 419
V575 The potential null pointer is passed into 'memset' function. Inspect the first argument. Check lines: 440, 439. Huffman.c 440
V575 The potential null pointer is passed into 'memset' function. Inspect the first argument. Check lines: 442, 441. Huffman.c 442
V575 The potential null pointer is passed into 'memset' function. Inspect the first argument. Check lines: 444, 443. Huffman.c 444
V575 The potential null pointer is passed into 'memset' function. Inspect the first argument. Check lines: 446, 445. Huffman.c 446
V575 The potential null pointer is passed into 'memset' function. Inspect the first argument. Check lines: 464, 463. Huffman.c 464
V575 The potential null pointer is passed into 'memset' function. Inspect the first argument. Check lines: 466, 465. Huffman.c 466
V575 The potential null pointer is passed into 'memset' function. Inspect the first argument. Check lines: 468, 467. Huffman.c 468
V575 The potential null pointer is passed into 'memset' function. Inspect the first argument. Check lines: 470, 469. Huffman.c 470
V575 The potential null pointer is passed into 'memset' function. Inspect the first argument. Check lines: 566, 565. Huffman.c 566
V575 The potential null pointer is passed into 'memset' function. Inspect the first argument. Check lines: 568, 567. Huffman.c 568
V575 The potential null pointer is passed into 'memset' function. Inspect the first argument. Check lines: 570, 569. Huffman.c 570
V575 The potential null pointer is passed into 'memset' function. Inspect the first argument. Check lines: 572, 571. Huffman.c 572
V575 The potential null pointer is passed into 'memset' function. Inspect the first argument. Check lines: 603, 602. Huffman.c 603
V575 The potential null pointer is passed into 'memset' function. Inspect the first argument. Check lines: 605, 604. Huffman.c 605
V575 The potential null pointer is passed into 'memset' function. Inspect the first argument. Check lines: 607, 606. Huffman.c 607
V575 The potential null pointer is passed into 'memset' function. Inspect the first argument. Check lines: 609, 608. Huffman.c 609
V575 The potential null pointer is passed into 'memset' function. Inspect the first argument. Check lines: 656, 655. Huffman.c 656
V575 The potential null pointer is passed into 'memset' function. Inspect the first argument. Check lines: 658, 657. Huffman.c 658
V575 The potential null pointer is passed into 'memset' function. Inspect the first argument. Check lines: 660, 659. Huffman.c 660
V575 The potential null pointer is passed into 'memset' function. Inspect the first argument. Check lines: 662, 661. Huffman.c 662
V575 The potential null pointer is passed into 'memcpy' function. Inspect the first argument. Check lines: 710, 708. Huffman.c 710
V575 The potential null pointer is passed into 'memset' function. Inspect the first argument. Check lines: 22, 21. TightDataPointStorageF.c 22
V575 The potential null pointer is passed into 'memset' function. Inspect the first argument. Check lines: 194, 193. TightDataPointStorageF.c 194
V575 The potential null pointer is passed into 'memset' function. Inspect the first argument. Check lines: 22, 21. TightDataPointStorageD.c 22
V575 The potential null pointer is passed into 'memset' function. Inspect the first argument. Check lines: 194, 193. TightDataPointStorageD.c 194
V575 The potential null pointer is passed into 'memcpy' function. Inspect the first argument. Check lines: 3320, 3319. geos_ts_c.cpp 3320
V575 The potential null pointer is passed into 'memcpy' function. Inspect the first argument. Check lines: 3337, 3336. geos_ts_c.cpp 3337
V575 The potential null pointer is passed into 'memset' function. Inspect the first argument. Check lines: 42, 41. taos_metric.c 42
V575 The potential null pointer is passed into 'memcpy' function. Inspect the first argument. Check lines: 145, 144. taos_string_builder.c 145
V575 The potential null pointer is passed into 'memcpy' function. Inspect the first argument. Check lines: 533, 532. c.cc 533

Note. Some may point out that some errors stem from libraries rather than TDengine itself. But as I've already mentioned, when it comes to secure development, it doesn't matter where exactly the bug is found. Let me repeat once again: there is no difference for a user whether the application crashes because of an error in TDengine or in another library that was needed to build TDengine. Developers are responsible not only for the quality of their own code, but also for the quality of the third-party code they use.

Errors N113–N130 (in fact, more). Pointer dereference before the check

It is a common error when a pointer is used before it's checked. The simplest error of this type is as follows:

bool
RectangleIntersection::clip_linestring_parts(
  const geom::LineString* gi, ....)
{
  auto n = gi->getNumPoints();

  if(gi == nullptr || n < 1) {
    return false;
  }
  ....
}

The PVS-Studio warning: V595 The 'gi' pointer was utilized before it was verified against nullptr. Check lines: 137, 139. RectangleIntersection.cpp 137

The gi pointer check should have been written earlier. I think there is no point in elaborating on this case.

Here is a similar case:

int32_t tsortOpen(SSortHandle* pHandle) {
  int32_t code = 0;
  if (pHandle->opened) {
    return code;
  }

  if (pHandle == NULL || pHandle->fetchfp == NULL ||
      pHandle->comparFn == NULL) {
    return TSDB_CODE_INVALID_PARA;
  }
  ....
}

V595 The 'pHandle' pointer was utilized before it was verified against nullptr. Check lines: 2883, 2887. tsort.c 2883

It's the same here. There are some variations, but hopefully the point is clear. If the logic behind the V595 diagnostic rule doesn't seem quite obvious to you, you may learn more about it from this post: "Explanation on Diagnostic V595". It's been 10 years since this note was written. Since then, its data flow analysis has grown more advanced, and it now evaluates possible pointer values with greater precision. However, this does not make this diagnostic rule any less useful, and it still effectively finds bugs caused by incorrect ordering of pointer dereference and null checks.

Other errors of this type. open icon

V595 The 'col' pointer was utilized before it was verified against nullptr. Check lines: 2075, 2076. geos_ts_c.cpp 2075
V595 The 'keys' pointer was utilized before it was verified against nullptr. Check lines: 88, 89. taos_counter.c 88
V595 The 'dbCache' pointer was utilized before it was verified against nullptr. Check lines: 196, 199. ctgCache.c 196
V595 The 'pDbCache' pointer was utilized before it was verified against nullptr. Check lines: 1631, 1633. ctgCache.c 1631
V595 The 'pInfo->pState' pointer was utilized before it was verified against nullptr. Check lines: 155, 158. streamfilloperator.c 155
V595 The 'pFillInfo' pointer was utilized before it was verified against nullptr. Check lines: 1582, 1588. streamfilloperator.c 1582
V595 The 'string' pointer was utilized before it was verified against nullptr. Check lines: 869, 870. mndTopic.c 869
V595 The 'pFile' pointer was utilized before it was verified against nullptr. Check lines: 1351, 1353. osFile.c 1351
V595 The 'bins' pointer was utilized before it was verified against nullptr. Check lines: 4267, 4268. sclfunc.c 4267
V595 The 'pCtx->freeFunc' pointer was utilized before it was verified against nullptr. Check lines: 351, 358. schUtil.c 351
V595 The 'pNodeList' pointer was utilized before it was verified against nullptr. Check lines: 481, 482. clientImpl.c 481
V595 The 'pReq' pointer was utilized before it was verified against nullptr. Check lines: 2831, 2832. clientImpl.c 2831
V595 The 'vgroup_ids' pointer was utilized before it was verified against nullptr. Check lines: 83, 84. taos_counter.c 83
V595 The 'pReq' pointer was utilized before it was verified against nullptr. Check lines: 1214, 1215. transCli.c 1214
V595 The 'pReq' pointer was utilized before it was verified against nullptr. Check lines: 2755, 2758. transCli.c 2755
V595 The 'pReq' pointer was utilized before it was verified against nullptr. Check lines: 3364, 3380. transCli.c 3364
This is where I got bored and stopped writing out warnings. In fact, there are more errors than that.

Resource leaks

Manual memory management is fraught with errors. The TDengine project is mostly written in C, so it's not surprising that we may encounter errors of this type.

Errors N131–N140. Memory leak on a program execution flow

taos_map_t *taos_map_new() {
  int r = 0;

  taos_map_t *self = (taos_map_t *)taos_malloc(sizeof(taos_map_t));
  self->size = 0;
  self->max_size = TAOS_MAP_INITIAL_SIZE;

  self->keys = taos_linked_list_new();
  if (self->keys == NULL) return NULL;
  ....
}

The PVS-Studio warning: V773 The function was exited without releasing the 'self' pointer. A memory leak is possible. taos_map.c 82

If a new key can't be created, the taos_map_new function terminates prematurely. This does not free the memory buffer pointed to by self.

Check out similar errors. open icon

V773 The function was exited without releasing the 'new_addrs' pointer. A memory leak is possible. taos_map.c 289
V773 The function was exited without releasing the 'k' pointer. A memory leak is possible. taos_metric.c 52
V773 The function was exited without releasing the 'new_nexts' pointer. A memory leak is possible. regex_internal.c 1421
V773 The function was exited without releasing the 'new_indices' pointer. A memory leak is possible. regex_internal.c 1421
V773 The function was exited without releasing the 'new_edests' pointer. A memory leak is possible. regex_internal.c 1421
V773 The function was exited without releasing the 'new_eclosures' pointer. A memory leak is possible. regex_internal.c 1421
V773 The function was exited without releasing the 'self' pointer. A memory leak is possible. taos_collector_registry.c 53
V773 The function was exited without releasing the 'new_start' pointer. A memory leak is possible. regexec.c 534
V773 The function was exited without releasing the 'new_end' pointer. A memory leak is possible. regexec.c 534

Errors N141–N144. Careless use of the realloc function

INLINE void addDIA_Data(DynamicIntArray *dia, int value)
{
  if(dia->size==dia->capacity)
  {
    dia->capacity = dia->capacity << 1;
    dia->array = (unsigned char *)
      realloc(dia->array, dia->capacity*sizeof(unsigned char));
  }
  dia->array[dia->size] = (unsigned char)value;
  dia->size ++;
}

The PVS-Studio warning: V701 realloc() possible leak: when realloc() fails in allocating memory, original pointer 'dia->array' is lost. Consider assigning realloc() to a temporary pointer. DynamicIntArray.c 54

If the realloc function fails to allocate a new buffer, it will return NULL. The old value of the dia->array pointer will be lost and it will be impossible to free the buffer whose address was previously stored in it.

Given the lack of a pointer check further, the memory leak is minor. However, I'd like to highlight this improper handling of the realloc function.

Note. In this fragment, the memory allocation error will lead to interesting consequences. Data will be written...

dia->array[dia->size] = (unsigned char)value;

...not to a null pointer, but to some remote memory area, the address of which depends on the previous array size. Consequences are unpredictable. Access violation is possible. Perhaps, some data in memory will be corrupted but the program will continue to work, at least for a while. As they say, happy debugging :)

Other similar errors:

V701 realloc() possible leak: when realloc() fails in allocating memory, original pointer 'dba->array' is lost. Consider assigning realloc() to a temporary pointer. DynamicByteArray.c 57
V701 realloc() possible leak: when realloc() fails in allocating memory, original pointer 'dba->array' is lost. Consider assigning realloc() to a temporary pointer. DynamicByteArray.c 68
V701 realloc() possible leak: when realloc() fails in allocating memory, original pointer 'self->str' is lost. Consider assigning realloc() to a temporary pointer. taos_string_builder.c 84

Errors N145–N157. Careless use of the emplace_back function

Status SstFileWriter::Open(const std::string& file_path) {
  ....
  for (size_t i = 0; i < user_collector_factories.size(); i++) {
    int_tbl_prop_collector_factories.emplace_back(
        new UserKeyTablePropertiesCollectorFactory(
            user_collector_factories[i]));
  }
  ....
}

The PVS-Studio warning: V1023 A pointer without owner is added to the 'int_tbl_prop_collector_factories' container by the 'emplace_back' method. A memory leak will occur in case of an exception. sst_file_writer.cc 298

If the container is full, memory is reallocated. This operation may fail, resulting in a std::bad_alloc exception. In this case, the pointer will be lost and the created object will never be deleted.

A secure construction that protects against potential memory leaks:

int_tbl_prop_collector_factories.emplace_back(
  std::make_unique<UserKeyTablePropertiesCollectorFactory>(
    user_collector_factories[i]));

Other warnings. open icon

V1023 A pointer without owner is added to the 'locations' container by the 'emplace_back' method. A memory leak will occur in case of an exception. ConnectedElementLocationFilter.cpp 54
V1023 A pointer without owner is added to the 'locations' container by the 'emplace_back' method. A memory leak will occur in case of an exception. ConnectedElementLocationFilter.cpp 67
V1023 A pointer without owner is added to the 'outOERs' container by the 'emplace_back' method. A memory leak will occur in case of an exception. MaximalEdgeRing.cpp 117
V1023 A pointer without owner is added to the 'edgeRings' container by the 'emplace_back' method. A memory leak will occur in case of an exception. PolygonBuilder.cpp 87
V1023 A pointer without owner is added to the 'copied_operands_' container by the 'emplace_back' method. A memory leak will occur in case of an exception. merge_context.h 41
V1023 A pointer without owner is added to the 'copied_operands_' container by the 'emplace_back' method. A memory leak will occur in case of an exception. merge_context.h 57
V1023 A pointer without owner is added to the 'jobs' container by the 'emplace_back' method. A memory leak will occur in case of an exception. db_impl_compaction_flush.cc 458
V1023 A pointer without owner is added to the 'parent_iters_' container by the 'emplace_back' method. A memory leak will occur in case of an exception. range_del_aggregator.cc 373
V1023 A pointer without owner is added to the 'builder_guards' container by the 'emplace_back' method. A memory leak will occur in case of an exception. version_set.cc 5026
V1023 A pointer without owner is added to the 'table_properties_collectors' container by the 'emplace_back' method. A memory leak will occur in case of an exception. block_based_table_builder.cc 533
V1023 A pointer without owner is added to the 'table_properties_collectors' container by the 'emplace_back' method. A memory leak will occur in case of an exception. block_based_table_builder.cc 540
V1023 A pointer without owner is added to the 'int_tbl_prop_collector_factories' container by the 'emplace_back' method. A memory leak will occur in case of an exception. sst_file_writer.cc 290

Buffer/array overflow

The high performance of the C and C++ languages comes at the cost of many automatic checks being absent. There are also no built-in checks for allocated buffer overflows. Compilers partially compensate for the lack of checks by performing static analysis and issuing warnings for obvious cases. However, it is also useful to use specialized tools such as PVS-Studio that can detect even more errors.

Error N158. Buffer overflow

const char* rocksdb_iter_value(const rocksdb_iterator_t* iter, size_t* vlen) {
  Slice s = iter->rep->value();
  *vlen = s.size();
  return s.data();
}

int32_t streamDefaultIterGet_rocksdb(....) {
  ....
  while (rocksdb_iter_valid(pIter)) {
    const char* key = rocksdb_iter_key(pIter, &klen);
    int32_t     vlen = 0;
    const char* vval = rocksdb_iter_value(pIter, (size_t*)&vlen);
  ....
}

The PVS-Studio warning: V512 A call of the 'rocksdb_iter_value' function will lead to overflow of the buffer '& vlen'. streamBackendRocksdb.c 4390

This code will work on the 32-bit app version but fail on a 64-bit build.

The address of the vlen 32-bit variable is interpreted as a pointer to the type size_t:

int32_t     vlen = 0;
const char* vval = rocksdb_iter_value(pIter, (size_t*)&vlen);

In the rocksdb_iter_value function, a value of type size_t will be written at this address.

In a 32-bit program—if we do not consider exotic architectures—the size of the size_t variable is 4 bytes and coincides with the size of the int32_t type. The code will work correctly.

In a 64-bit program, the size of size_t type equals 8 bytes. So, writing a 64-bit value by the address of the vlen variable causes some data to be written outside this variable. As a result, 4 bytes will be written to the stack after this variable, resulting in undefined program behavior.

Errors N159. Buffer overflow due to confusion in constants

First, let's keep in mind that MAX_QUERY_VALUE_LEN is 1024:

#define MAX_QUERY_VALUE_LEN       1024

Next, note that the third dimension of the array char data[100][100][100][1024] is also 1024:

typedef struct _script_t {
  ....
  char              cols[12];
  char              data[100][100][1024];
  char              system_exit_code[12];
  ....
} SScript;

Finally, here is the code containing an error:

bool simExecuteNativeSqlCommand(SScript *script, char *rest, bool isSlow) {
  ....
  char *value = NULL;
  if (i < MAX_QUERY_COL_NUM) {
    value = script->data[numOfRows][i];
  }
  if (value == NULL) {
    continue;
  }
  ....
  int32_t    *length = taos_fetch_lengths(pSql);
  ....
  if (length[i] < 0 || length[i] > 1 << 20) {
    fprintf(stderr, "Invalid length(%d) of BINARY or NCHAR\n", length[i]);
    exit(-1);
  }
  memset(value, 0, MAX_QUERY_VALUE_LEN);
  memcpy(value, row[i], length[i]);
  value[length[i]] = 0;  
  ....
}

The PVS-Studio warning: V512 A call of the 'memcpy' function will lead to overflow of the buffer 'value'. simExec.c 786

There is some value in an element of the length[i] array. It's pre-checked:

if (length[i] < 0 || length[i] > 1 << 20) {
  fprintf(stderr, "Invalid length(%d) of BINARY or NCHAR\n", length[i]);
  exit(-1);
}

This value is then used as the size of the data to be copied:

memcpy(value, row[i], length[i]);

The problem is that 1 << 20 is not 1024, but 1048576. So, the check doesn't save from a potential buffer overflow. I think the correct thing to do is to use a named constant MAX_QUERY_VALUE_LEN in the condition, rather than a magic number. This constant should also be used when declaring the array.

char        data[100][100][MAX_QUERY_VALUE_LEN];  
....
if (length[i] < 0 || length[i] > MAX_QUERY_VALUE_LEN) {
  fprintf(stderr, "Invalid length(%d) of BINARY or NCHAR\n", length[i]);
  exit(-1);
}
memset(value, 0, MAX_QUERY_VALUE_LEN);
memcpy(value, row[i], length[i]);

Oops, I just realized there's another error in the code:

value[length[i]] = 0;

This line is unnecessary and even harmful. First, the array is already pre-filled with zeros after the memset function call and the additional null terminator is not needed. Second, if length[i] == MAX_QUERY_VALUE_LEN, the array overflows. Basically, the author should delete this line.

Let's dwell on this error a bit more. Since we assume the null terminator, we cannot copy MAX_QUERY_VALUE_LEN bytes. Then there will be no room for the null terminator. Hence, we need to modify the check once again and use the >= operator instead of >.

if (length[i] < 0 || length[i] >= MAX_QUERY_VALUE_LEN)

Error N160. Potential array overflow

Let's take a moment to recap. Can you imagine we've already gotten to 160 bugs?!

And do you realize that we've already found 160 errors in the database? That's so unfortunate.

I prescribe the immediate introduction of static code analyzers into the TDengine development process :)

dictionary * iniparser_load(const char * ininame)
{
  ....
  char line    [ASCIILINESZ+1] ;
  ....
  memset(line,    0, ASCIILINESZ);
  ....
  last=0 ;

  while (fgets(line+last, ASCIILINESZ-last, in)!=NULL) {
    lineno++ ;
    len = (int)strlen(line)-1;
    if (len==0)
      continue;
    /* Safety check against buffer overflows */
    if (line[len]!='\n') {
      fprintf(stderr,
              "iniparser: input line too long in %s (%d)\n",
              ininame,
              lineno);
      dictionary_del(dict);
      fclose(in);
      return NULL ;
    }
    ....
}

The PVS-Studio warning: V557 Array underrun is possible. The value of 'len' index could reach -1. iniparser.c 695

If the input for fgets is an empty string, the len variable will be -1, which leads to an array overflow. We've gone through it in the article: "Shoot yourself in the foot when handling input data". This note considers identical reproduced errors.

Given the above, this comment in the code looks even more ironic:

/* Safety check against buffer overflows */

Typos

Various programming languages have different protection measures against null pointers/references, array overflows, and division by zero. Some, like C and C++, rely entirely on a developer. Others throw exceptions. However, no language is immune to typos. It's hard to determine what a typo is, so there are no standard, language-level methods for dealing with them.

But that doesn't mean that nothing can be done. PVS-Studio analyzer detects a large number of common types of typos. While it doesn't implement a universal approach, it handles a wide range of typical cases—and that makes it highly effective in practice.

Error N161. Overwriting value

static int32_t getRowsBlockWithinMergeLimit(....) {
  ....
  if (keepRows == 0) {
    *pSkipBlock = true;
    *pRes = pOrigBlk;
  }

  *pSkipBlock = false;
  ....
}

The PVS-Studio warning: V519 The '* pSkipBlock' variable is assigned values twice successively. Perhaps this is a mistake. Check lines: 2198, 2202. tsort.c 2202

The line *pSkipBlock = true makes no sense, as the value will change to false anyway. Most likely, the author should have added else:

if (keepRows == 0) {
  *pSkipBlock = true;
  *pRes = pOrigBlk;
}
else {
  *pSkipBlock = false;
}

Error N162. Copy-paste repeat

static void processSimpleMeta(SMqMetaRsp* pMetaRsp, cJSON** meta) {
  ...
  } else if (pMetaRsp->resMsgType == TDMT_VND_ALTER_TABLE) {
    processAlterTable(pMetaRsp, meta);
  } else if (pMetaRsp->resMsgType == TDMT_VND_DROP_TABLE) {
    processDropTable(pMetaRsp, meta);
  } else if (pMetaRsp->resMsgType == TDMT_VND_DROP_TABLE) {
    processDropTable(pMetaRsp, meta);
  } else if (pMetaRsp->resMsgType == TDMT_VND_DELETE) {
  ....
}

The PVS-Studio warning: V517 The use of 'if (A) {...} else if (A) {...}' pattern was detected. There is a probability of logical error presence. Check lines: 2316, 2318. clientRawBlockWrite.c 2316

The above blocks of the same-type test might have been written using the copy-paste method. At some point, the developer probably got distracted, and repeated this snippet:

} else if (pMetaRsp->resMsgType == TDMT_VND_DROP_TABLE) {
  processDropTable(pMetaRsp, meta);

This is a fairly common type of error. We have plenty of them in our collection. If one of the blocks is redundant, the author can safely remove it. In this case, the typo doesn't affect the program operation. In the worst case, the developer implied another check or another action here.

Error N163. Forgotten dangerous code with possible division by 0

OffsetSegmentGenerator::OffsetSegmentGenerator(....) : ....
{
  ....
  // compute intersections in full precision, to provide accuracy
  // the points are rounded as they are inserted into the curve line
  filletAngleQuantum = MATH_PI / 2.0 / bufParams.getQuadrantSegments();

  int quadSegs = bufParams.getQuadrantSegments();
  if (quadSegs < 1) quadSegs = 1;
  filletAngleQuantum = MATH_PI / 2.0 / quadSegs;
  ....
}

The PVS-Studio warning: V519 The 'filletAngleQuantum' variable is assigned values twice successively. Perhaps this is a mistake. Check lines: 82, 86. OffsetSegmentGenerator.cpp 86

The author wrote the following code:

filletAngleQuantum = MATH_PI / 2.0 / bufParams.getQuadrantSegments();

The getQuadrantSegments function can return 0, so the code was rewritten to protect against division by zero:

int quadSegs = bufParams.getQuadrantSegments();
if (quadSegs < 1) quadSegs = 1;
filletAngleQuantum = MATH_PI / 2.0 / quadSegs;

Except, they forgot to delete the previous line. As a result, we can end up with the division by zero and, as a result, undefined behavior.

Error N164. Identical functions

typedef struct {
  int64_t firstVer;
  int64_t lastVer;
  int64_t createTs;
  int64_t closeTs;
  int64_t fileSize;
  int64_t syncedOffset;
} SWalFileInfo;

static inline int64_t walGetCurFileFirstVer(SWal* pWal) {
  if (pWal->writeCur == -1) return -1;
  SWalFileInfo* pInfo =
    (SWalFileInfo*)taosArrayGet(pWal->fileInfoSet, pWal->writeCur);
  return pInfo->firstVer;
}

static inline int64_t walGetCurFileLastVer(SWal* pWal) {
  if (pWal->writeCur == -1) return -1;
  SWalFileInfo* pInfo =
    (SWalFileInfo*)taosArrayGet(pWal->fileInfoSet, pWal->writeCur);
  return pInfo->firstVer;
}

The PVS-Studio warning: V524 It is odd that the body of 'walGetCurFileLastVer' function is fully equivalent to the body of 'walGetCurFileFirstVer' function. walInt.h 97

There are two data members in the SWalFileInfo structure:

firstVer;
lastVer.

There are two functions to obtain values from these data members:

walGetCurFileFirstVer;
walGetCurFileLastVer.

But the function bodies are identical and both return the value of the firstVer data member.

Errors N165–N170. "Parentheses curse"

int32_t dmInit() {
  dInfo("start to init dnode env");
  int32_t code = 0;
  ....
  if ((code = dmCheckDiskSpace()) != 0) return code;
  if ((code = dmCheckRepeatInit(dmInstance())) != 0) return code;
  if ((code = dmInitSystem()) != 0) return code;
  if ((code = dmInitMonitor()) != 0) return code;
  if ((code = dmInitAudit()) != 0) return code;
  if ((code = dmInitDnode(dmInstance())) != 0) return code;
  if ((code = InitRegexCache() != 0)) return code;
  ....
}

The PVS-Studio warning: V593 Consider reviewing the expression of the 'A = B != C' kind. The expression is calculated as following: 'A = (B != C)'. dmEnv.c 182

Who spotted the error? ;)

A nice typo, I think. The parenthesis is misplaced in the last check. The result of the InitRegexCache function call is compared to 0, and only after 0 or 1, it is written to the code variable.

Here are a few other similar typos:

V593 Consider reviewing the expression of the 'A = B != C' kind. The expression is calculated as following: 'A = (B != C)'. mndArbGroup.c 299
V593 Consider reviewing the expression of the 'A = B != C' kind. The expression is calculated as following: 'A = (B != C)'. mndConfig.c 430
V593 Consider reviewing the expression of the 'A = B != C' kind. The expression is calculated as following: 'A = (B != C)'. mndUser.c 418
V593 Consider reviewing the expression of the 'A = B < C' kind. The expression is calculated as following: 'A = (B < C)'. streamMeta.c 411
V593 Consider reviewing the expression of the 'A = B != C' kind. The expression is calculated as following: 'A = (B != C)'. transCli.c 1876

Error N171. Pointless check

void
CoordinateSequence::add(const CoordinateSequence& cs,
                        std::size_t from, std::size_t to)
{
  if (cs.stride() == stride() && cs.hasM() == cs.hasM()) {
      m_vect.insert(m_vect.end(),
                    std::next(cs.m_vect.cbegin(),
                    static_cast<std::ptrdiff_t>(from * stride())),
                    std::next(cs.m_vect.cbegin(),
                    static_cast<std::ptrdiff_t>((to + 1u)*stride())));
  } else {
  ....
}

The PVS-Studio warning: V501 There are identical sub-expressions to the left and to the right of the '==' operator: cs.hasM() == cs.hasM() CoordinateSequence.cpp 154

The developer's hand trembled and accidentally added an extra cs.. As a result, some part of the condition will always be true:

cs.hasM() == cs.hasM()

Error N172. Not an error but still an error

bool startsWith(const std::string & s, char prefix) {
  if (s.empty() == 0) {
    return false;
  }

  return s[0] == prefix;
}

The PVS-Studio warning: V557 Array overrun is possible. The '0' index is pointing beyond array bound. string.cpp 53

The analyzer's warning isn't quite correct here. In fact, there is no array overflow. Even an empty string (the null-terminated string) contains at least one character. However, the analyzer points out that if the string is empty, there is no point to access a null element. It's weird and it's probably some kind of a typo.

Note N1. Perhaps, we should refine this diagnostic rule to show another message for such a case. I'll open a task for my colleagues.

Note N2. Before C++11, such code was considered erroneous as it causes undefined behavior.

The error is that the result of the empty function call is compared to 0. The comparison is obviously unnecessary and the correct code should look like this:

bool startsWith(const std::string & s, char prefix) {
  if (s.empty()) {
    return false;
  }

  return s[0] == prefix;
}

Error N173. Typo when using similar variable names

int32_t streamTaskUpdateTaskCheckpointInfo(....) {
  ....
  bool valid = (pInfo->checkpointId  <= pReq->checkpointId &&
                pInfo->checkpointVer <= pReq->checkpointVer &&
                pInfo->processedVer  <= pReq->checkpointVer);
  ....
}

The PVS-Studio warning: V1013 Suspicious subexpression in a sequence of similar comparisons. streamCheckpoint.c 654

The variable names look similar, so the typo is not surprising.

The pInfo->processedVer variable should be compared to pReq->processedVer instead of comparing with pReq->checkpointVer.

Errors N174 and N175. Sausage and the price for laziness

I've already broken down two other typos in previous articles on refactoring sloppy code:

Breaking down bugs in TDengine to master refactoring, part 1: sausage code
Breaking down bugs in TDengine to master refactoring, part 3: price of laziness

Other errors

The project will benefit greatly from fixing the errors we've reviewed in this article. However, this piece isn't meant to serve as a guide to the errors. I've already mentioned that in the introduction, but let me say it again:

I skimmed through the report and noted only a portion of these issues. My purpose was to show the benefits of static code analysis as a practice rather than find as many errors as possible.
Static analysis is most efficient when used regularly, not occasionally.

As a conclusion, let's look at a few more various errors not mentioned in the above sections.

Errors N176 and N177. Errors when using the shift operator <<

uint64_t unpackUint64(uint8_t* ch, uint8_t sz) {
  uint64_t n = 0;
  for (uint8_t i = 0; i < sz; i++) {
    n = n | (ch[i] << (8 * i));
  }
  return n;
}

The PVS-Studio warning: V629 Consider inspecting the 'ch[i] << (8 * i)' expression. Bit shifting of the 32-bit value with a subsequent expansion to the 64-bit type. indexFstUtil.c 55

A 64-bit value must be created from a byte array. However, a failure will occur when attempting to set the upper 32-bits in a 64-bit variable. On shift, the left operand—an 8-bit unsigned character—will be implicitly converted to a 32-bit int. However, it's not enough: if the int value is shifted by more than 31 bits, an overflow will occur. The developer must explicitly cast the operand to a 64-bit type beforehand:

n = n | ((uint64_t)(ch[i]) << (8 * i));

Here is a similar error:

static int hashset_add(hashset_t set, void *item) {
  int ret = hashset_add_member(set, item);

  size_t old_capacity = set->capacity;
  if (set->nitems >= (double)old_capacity * set->load_factor) {
    size_t *old_items = set->items;
    ++set->nbits;
    set->capacity = (size_t)(1 << set->nbits);
  ....
}

But the PVS-Studio warning is different: V1028 Possible overflow. Consider casting operands of the '1 << set->nbits' operator to the 'size_t' type, not the result. tdbPager.c 88

Error N178. Unreachable code

How about a little attention test? Try to find the bug in this function:

static int32_t getBlkFromSessionCache(struct SOperatorInfo* pOperator,
  int64_t sessionId, SGcSessionCtx* pSession, SSDataBlock** ppRes)
{
  int32_t code = TSDB_CODE_SUCCESS;
  SGroupCacheOperatorInfo* pGCache = pOperator->info;
  bool locked = false;
  SGcDownstreamCtx* pCtx = &pGCache->pDownstreams[pSession->downstreamIdx];
  
  while (true) {
    bool got = false;
    code = getBlkFromSessionCacheImpl(pOperator, sessionId,
                                      pSession, ppRes, &got);
    if (TSDB_CODE_SUCCESS != code || got) {
      goto _return;
    }
    
    if ((atomic_load_64(&pCtx->fetchSessionId) == sessionId)
      || (-1 == atomic_val_compare_exchange_64(
                  &pCtx->fetchSessionId, -1, sessionId))) {
      if (locked) {
        (void)taosThreadMutexUnlock(&pSession->pGroupData->mutex);
        locked = false;
      }
      
      code = getCacheBlkFromDownstreamOperator(pOperator, pCtx,
                                               sessionId, pSession, ppRes);
      goto _return;
    } else {
      // FOR NOW, SHOULD NOT REACH HERE
      qError("Invalid fetchSessionId:%" PRId64 ",
             currentSessionId:%" PRId64, pCtx->fetchSessionId, sessionId);
      return TSDB_CODE_QRY_EXECUTOR_INTERNAL_ERROR;
    }

    if (locked) {
      code = groupCacheSessionWait(pOperator, pCtx, sessionId,
                                   pSession, ppRes);
      locked = false;
      if (TSDB_CODE_SUCCESS != code) {
        goto _return;
      }
      
      break;
    }
    
    (void)taosThreadMutexLock(&pSession->pGroupData->mutex);
    locked = true;
  };


_return:

  if (locked) {
    (void)taosThreadMutexUnlock(&pSession->pGroupData->mutex);
  }

  return code;
}

The PVS-Studio warning: V779 Unreachable code detected. It is possible that an error is present. groupcacheoperator.c 1227

Any luck? If not, the error is hiding here:

if (....)        // << (A)
{
  ....
  goto _return;  // << (B)
} else {
  ....
  return TSDB_CODE_QRY_EXECUTOR_INTERNAL_ERROR; // << (C)
}
....
if (locked) {    // << (D)
....
_return:         // << (E)

Regardless of the condition (A), the code (D) will never get control. We will either move (B) to the label (E) or exit the function (C).

Errors N179-N184. Potential overflow

typedef struct SFilePage {
  int32_t num;
  ....
} SFilePage;

typedef struct tMemBucket {
  ....
  int32_t            bytes;
  ....
} tMemBucket;

static int32_t loadDataFromFilePage(tMemBucket *pMemBucket, ....) {
  ....
  SFilePage *pg = getBufPage(pMemBucket->pBuffer, *pageId);
  ....
  (void)memcpy((*buffer)->data + offset, pg->data,
               (size_t)(pg->num * pMemBucket->bytes));
  ....
}

The PVS-Studio warning: V1028 Possible overflow. Consider casting operands of the 'pg->num * pMemBucket->bytes' operator to the 'size_t' type, not the result. tpercentile.c 64

Explicit type casting doesn't help avoid an overflow when multiplying 32-bit variables.

(size_t)(pg->num * pMemBucket->bytes)

The author should perform type casting before multiplication, not after:

(size_t)(pg->num) * pMemBucket->bytes

Look at similar warnings:

V1028 Possible overflow. Consider casting operands of the 'pColData->nVal + 1' operator to the 'int64_t' type, not the result. tdataformat.c 1904
V1028 Possible overflow. Consider casting operands of the 'pColData->nVal + 1' operator to the 'int64_t' type, not the result. tdataformat.c 1904
V1028 Possible overflow. Consider casting operands, not the result. compaction_picker_level.cc 818
V1028 Possible overflow. Consider casting operands of the 'vlen * 4' operator to the 'size_t' type, not the result. tbase64.c 23
V1028 Possible overflow. Consider casting operands of the 'inlen * 3' operator to the 'size_t' type, not the result. tbase64.c 59

Error N185. Incorrect std namespace extension

namespace std {
inline void swap(ROCKSDB_NAMESPACE::port::WindowsThread& th1,
                 ROCKSDB_NAMESPACE::port::WindowsThread& th2) {
  th1.swap(th2);
}
}  // namespace std

The PVS-Studio warning: V1061 Extending the 'std' namespace may result in undefined behavior. win_thread.h 110

You may learn more about this defect in the documentation if you'd like to. Honestly, I'm getting tired of going over all these bugs :) I'm going to hit the 190 mark and then I'm done. Or shall I reach 200 anyway? No, down with perfectionism! I need a break.

Error N186. Comparing of "garbage" bytes

typedef struct STreeNode {
  int32_t index;
  void   *pData;  // TODO remove it?
} STreeNode;

int32_t tMergeTreeAdjust(SMultiwayMergeTreeInfo* pTree, int32_t idx) {
  ....
  STreeNode kLeaf = pTree->pNode[idx];
  ....
  if (memcmp(&kLeaf, &pTree->pNode[1], sizeof(kLeaf)) != 0) {
  ....
}

The PVS-Studio warning: V1103 The values of padding bytes are unspecified. Comparing objects with padding using 'memcmp' may lead to unexpected result. tlosertree.c 127

In a 64-bit program, there are 4 additional bytes between the index variable and the pointer intended to align the 64-bit pointer to the 8-byte boundary. It's a bad idea to compare such structures using the memcmp function. The additional bytes for alignment contain random values. The memcmp function may consider the structures as different even though the values of all fields are the same.

Errors N187-N190. Use of obsolete functions related to cryptography

uint32_t taosSafeRand(void) {
  ....
  if (!CryptGenRandom(hCryptProv, 4, &seed)) return seed;
  ....
}

The PVS-Studio warning: V1109 The 'CryptGenRandom' function is deprecated. Consider switching to an equivalent newer function. osRand.c 56

Other similar warnings:

V1109 The 'CryptAcquireContextA' function is deprecated. Consider switching to an equivalent newer function. osRand.c 50
V1109 The 'CryptAcquireContextA' function is deprecated. Consider switching to an equivalent newer function. osRand.c 51
V1109 The 'CryptReleaseContext' function is deprecated. Consider switching to an equivalent newer function. osRand.c 58

Conclusion

Static analysis enables users to meet various technical needs listed below.

Enhance software quality and reliability while minimizing reputational risks associated with zero-day vulnerabilities.
Detect bugs and potential vulnerabilities at the development stage: the earlier a defect is detected, the cheaper it is to fix.
Identify error patterns that your team may not even be aware of.
Shift the focus during code reviews toward algorithms and high-level bugs rather than scrutinize variable names and parentheses placement.
Write simpler and more reliable code. Even the analyzer's false positives may be useful. If the code confuses an analyzer, it's likely to confuse a human too. It's better to rewrite this code.
Maintain overall code quality. For example, an increase in error density might indicate the need for better training for new team members.
Build a secure software development lifecycle (SSDLC).

PVS-Studio can be used for all these tasks. It supports code analysis for C, C++, C#, and Java. It runs on Windows, Linux, and macOS. PVS-Studio is a SAST solution to enhance quality, reliability, and security of your projects.

Additional links

#Cpp #Embedded

Tags:

#Cpp #Embedded

Why SSDLC needs static analysis: a case study of 190 bugs in TDengine

130 shades of null pointers

Error N1. Selection error

Errors N2–N6. Errors in error handlers

Errors N7–N10. Incorrect assert usage

Errors N11–N13. dynamic_cast gets bolder

Error N14. Macros...

Errors N15–N112. No checks when allocating memory

Errors N113–N130 (in fact, more). Pointer dereference before the check

Resource leaks

Errors N131–N140. Memory leak on a program execution flow

Errors N141–N144. Careless use of the realloc function

Errors N145–N157. Careless use of the emplace_back function

Buffer/array overflow

Error N158. Buffer overflow

Errors N159. Buffer overflow due to confusion in constants

Error N160. Potential array overflow

Typos

Error N161. Overwriting value

Error N162. Copy-paste repeat

Error N163. Forgotten dangerous code with possible division by 0

Error N164. Identical functions

Errors N165–N170. "Parentheses curse"

Error N171. Pointless check

Error N172. Not an error but still an error

Error N173. Typo when using similar variable names

Errors N174 and N175. Sausage and the price for laziness

Other errors

Errors N176 and N177. Errors when using the shift operator <<

Error N178. Unreachable code

Errors N179-N184. Potential overflow

Error N185. Incorrect std namespace extension

Error N186. Comparing of "garbage" bytes

Errors N187-N190. Use of obsolete functions related to cryptography

Conclusion

Additional links

Posts: articles

Poll:

Comments (0)

Your contact information:

Desired license type:

Want to try PVS‑Studio for free?