Support custom types by davispuh · Pull Request #303 · jacob-carlborg/dstep

davispuh · 2026-03-19T18:57:12Z

Consider C code like:

#define FIELD_PTR(Record, TYPE, Field) ((TYPE *)&((Record)->Field))

Currently dstep is not able to parse such macro definition because cast's TYPE is not known at definition time but will be only known at usage time. So such definitions are just ignored.

This PR implements support for custom types that doesn't need to be known at definition time.
Note that it's not trivial to determine where (SYMBOL) is cast with a type or simply a parameter/value so after parsing we use fixCasts to try to fix these places with correct usage.

We also repurpose CXTypeKind.unexposed for these custom type definitions.

With this PR result for such macro definition will be:

extern (D) auto FIELD_PTR(T0, T1, T2)(auto ref T0 Record, auto ref T1 TYPE, auto ref T2 Field)
{
    return cast(TYPE*) &(Record.Field);
}

Note that this change has a side effect that definitions like:

#define __unused  __attribute__((__unused__))
#define CONST  const
#define STATIC  static

Will be converted to:

enum __unused = __attribute__(__unused__);
enum CONST = const;
enum STATIC = static;

While previously they were simply dropped.
But I consider this to be wanted change to convert as much code as possible and I will submit another PR that will comment out invalid code.

Copilot

Pull request overview

This PR extends dstep’s macro parsing and type translation to support “custom” type placeholders (types not known at macro definition time) by representing them as CXTypeKind.unexposed and post-processing ambiguous cast-like patterns.

Changes:

Update type translation helpers to take a full clang.Type and emit unexposed type spellings when no declaration is available.
Extend macro type-name parsing to optionally accept arbitrary identifiers as type-names in cast/sizeof contexts.
Add a fixCasts post-pass to rewrite some ambiguous (SYMBOL) <unary-op> ... parses into binary operator expressions when SYMBOL matches a macro parameter.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 3 comments.

File	Description
`dstep/translator/Type.d`	Adjusts low-level type translation to accept `Type` and emit spelling for `unexposed` when needed for custom-type macros.
`dstep/translator/MacroParser.d`	Adds “any type” parsing in cast/sizeof, accepts keywords as identifiers in primary expressions, and introduces `fixCasts` post-processing.

Comments suppressed due to low confidence (1)

dstep/translator/MacroParser.d:981

parseCastExpr now calls parseTypeName(..., anyType=true), which will treat any identifier in parentheses as a type (via CXTypeKind.unexposed). This makes patterns like (bar)(a) parse as a cast to the unknown type bar instead of a function call bar(a), regressing existing behavior (there are unit tests for translating (bar)(a) as a call). Consider tightening the heuristic: only accept an unexposed identifier as a type in cast context when it can be proven to be a type (e.g., present in the type table) or when it matches a macro type-parameter, and otherwise fall back to parsing as a parenthesized expression/call.

Expression parseCastExpr(ref Token[] tokens, Cursor[string] table, bool defined)
{
    auto local = tokens;

    if (!accept!("(")(local, TokenKind.punctuation))
        return parseUnaryExpr(tokens, table, defined);

    Type type = parseTypeName(local, table, true);

    if (!type.isValid)
        return parseUnaryExpr(tokens, table, defined);

    if (!accept!(")")(local, TokenKind.punctuation))
        return parseUnaryExpr(tokens, table, defined);

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-19T19:12:58Z

+    local = tokens;
+    if (accept(local, spelling, TokenKind.identifier) ||
+        accept(local, spelling, TokenKind.keyword))
+    {
+        tokens = local;
        return Expression(Identifier(spelling));
-
-    auto local = tokens;
+    }


parsePrimaryExpr now accepts TokenKind.keyword as an Identifier. This will cause macro bodies that expand to keywords (e.g., __declspec, static, const, struct, etc.) to start producing output instead of being dropped, and the resulting D expression is often invalid (e.g., enum STATIC = static; or enum CINDEX_LINKAGE = __declspec(dllexport);). If the goal is “convert as much as possible”, consider emitting these as commented-out/disabled nodes or gating keyword-as-identifier behind an option so generated bindings remain compilable and functional snapshot tests don’t regress.

Copilot · 2026-03-19T19:12:59Z

+                } else if (["+", "-"].canFind(unaryExpr.operator))
+                {
+                    auto newExpr = new AddExpr();
+                    newExpr.operator = unaryExpr.operator;
+                    newExpr.left = Identifier(castExpr.type.spelling);
+                    newExpr.right = unaryExpr.subexpr;
+                    expr = newExpr;
+                } else if (["*", "/", "%"].canFind(unaryExpr.operator))
+                {
+                    auto newExpr = new MulExpr();
+                    newExpr.operator = unaryExpr.operator;
+                    newExpr.left = Identifier(castExpr.type.spelling);
+                    newExpr.right = unaryExpr.subexpr;
+                    expr = newExpr;


In fixCasts, the MulExpr branch checks for "/" and "%", but parseUnaryExpr never produces unary operators for / or %, so these cases are unreachable. Consider narrowing this to just "*" (dereference) to reduce confusion, or expanding the logic to match the actual operators that can appear here.

Copilot · 2026-03-19T19:12:59Z

@@ -1411,6 +1418,9 @@ Type parseTypeName(ref Token[] tokens, Cursor[string] table)
    if (!parseSpecifierQualifierList(local, type, table))
        return type;

+    if (anyType && acceptIdentifier(local, type.spelling))
+        type.kind = CXTypeKind.unexposed;
+
    parseAbstractDeclarator(local, type, table);

    tokens = local;


This change introduces new behavior (treating unknown identifiers in cast/sizeof type-names as unexposed custom types, plus fixCasts). There doesn’t appear to be unit test coverage for the motivating case (e.g. #define FIELD_PTR(Record, TYPE, Field) ((TYPE *)&((Record)->Field))) or for ambiguous (SYMBOL) cases that fixCasts is meant to correct. Adding focused MacroTranslTests for these scenarios would help prevent regressions (especially around distinguishing casts vs calls).

Support custom types

619ab95

Copilot AI review requested due to automatic review settings March 19, 2026 18:57

Copilot started reviewing on behalf of davispuh March 19, 2026 18:57 View session

davispuh mentioned this pull request Mar 19, 2026

Comment out compiler builtins #304

Open

Copilot AI reviewed Mar 19, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support custom types#303

Support custom types#303
davispuh wants to merge 1 commit into
jacob-carlborg:masterfrom
davispuh:types

davispuh commented Mar 19, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Mar 19, 2026

Uh oh!

Copilot AI Mar 19, 2026

Uh oh!

Copilot AI Mar 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

davispuh commented Mar 19, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Mar 19, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 19, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 19, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants