为什么在类型定义结构时使用不同的标识符？

Question

考虑这段代码：

typedef struct _Node Node;

struct _Node {
    struct _Node * node;
};

或者这个：

typedef struct _Node {
    struct _Node * node;
} Node;

有什么理由不将它们重写为

typedef struct Node Node;

struct Node {
    Node * node;
};

和

typedef struct Node {
    Node * node;
} Node;

据我所知，它们是等价的。这些下划线前缀的原因是什么？我还看到了其他变体，其中它不是下划线，而是第一次出现时是大写字母，第二次出现时是小写字母，或者其他使它们不同的东西。一个例子在这里：

typedef struct Books {
   char title[50];
   char author[50];
   char subject[100];
   int book_id;
} Book;

此示例来自 tutorialspoint，我知道它们通常不应被视为可靠来源。但是这样做有什么好的理由吗？

Answer 1

人们使用像 struct _Node 这样的名称来故意忽略标准中设置的规则（或者，更常见的是，因为他们不知道标准中设置的规则）。粗略地说，该标准表示以下划线开头的名称大多保留给 'the implementation'，表示编译器和系统库。详见C11§7.1.3 Reserved identifiers：

All identifiers that begin with an underscore and either an uppercase letter or another underscore are always reserved for any use.

All identifiers that begin with an underscore are always reserved for use as identifiers with file scope in both the ordinary and tag name spaces.

另请注意 §6.2.1 Scopes of identifiers — 添加强调：

An identifier can denote an object; a function; a tag or a member of a structure, union, or enumeration; a typedef name; a label name; a macro name; or a macro parameter. The same identifier can denote different entities at different points in the program. A member of an enumeration is called an enumeration constant. Macro names and macro parameters are not considered further here, because prior to the semantic phase of program translation any occurrences of macro names in the source file are replaced by the preprocessing token sequences that constitute their macro definitions.

另请注意，POSIX 也保留了 _t 后缀 — 参见 The Compilation Environment。

也就是说，思维过程似乎是"this name shouldn't be used much; prefix it with an underscore to discourage its use"。还有 "I've seen it used in system headers; I'll copy that style"，没有意识到系统 headers 是这样编码的，以避免踩到为普通程序员保留的名称 space 并使用为普通程序员保留的名称 space实施。

您的重写是明智的；这就是我通常做的。

解决扩展问题：

不是每个人都知道结构标签与普通标识符有不同的名称space，所以他们不知道 typedef struct Book Book; 是完全安全和明确的（第一个 Book 在标签名称 space 中并且必须以 struct 开头；第二个 Book 在普通标识符名称 space 中并且必须 而不是 前面有 struct).

此外，人们查看系统 headers，看看他们做了什么，并认为他们应该复制那里的样式，而没有意识到实施有不同的规则强加于它可以和不能用于什么名字。

请注意，Linux kernel coding standards 不鼓励对结构类型使用 typedef；他们要求你在任何地方都使用 struct WhatEver。该规则有一些优点和一些缺点 - self-consistency 可能比您使用的约定更重要。这意味着 'go with the flow' 对于现有项目，但在您自己的新项目中采用哪种方式并不重要，只要您始终如一。

您还可以在圣经中找到使用结构标记的替代名称和相应的 typedef 名称的先例——Kernighan 和 Ritchie 的意思是 "The C Programming Language"。（有趣的是，他们的示例在 1978 年第一版和 1988 年第二版之间发生了很大变化。）

第二版：

typedef struct tnode *Treeptr;

typedef struct tnode {
    …
    Treeptr left;
    Treeptr right;
} Treenode;

第一版：

typedef struct tnode {
    …
    struct tnode *left;
    struct tnode *right;
} TREENODE, *TREEPTR;

请注意，现代风格倾向于避免对指针使用 typedef。参见 Is it a good idea to typedef pointers?。

为什么在类型定义结构时使用不同的标识符？

Why use different identifiers when typedefing structs?

c

struct

typedef