double cpp expansion?

Discussion:

(too old to reply)

s***@amu.edu.pl

2004-02-27 15:31:55 UTC

#define A(x) expandA()+B(x)
#define B expandB+A
B(Barg)
A(Aarg)

my preprocessors give:

expandB+ expandA()+ expandB+A ( Barg )
expandA()+ expandB+A ( Aarg )

In the first case `B' macro is expanded twice, unlike
the second case, where each macro is expanded once.

With a little simplified definitions:
#define A() B()
#define B A
my solution is:

B ( ) //replace "B"

A ( ) //initial rescan leaves "A" as is
|B|
// "(" found after "A" - replace "A" in
// context of "B" - nested expansion

B ( ) // start nested rescan
| A |
| B |

B'( ) // rescan finds B in "B" context - paints token,
| A | // nothing else changes
| B |

B() // result

Could you please check my expansion and tell me
what error I do, if any? Thank you.

--
S.Tobias

Douglas A. Gwyn

2004-02-27 21:28:16 UTC

Permalink

I'm not sure what bug your preprocessor has. During the
expansion of B in B(Barg) there is an expansion of A(Barg)
that should not further recurse on B since B is still
"being replaced" (most readily implemented as a flag in
the symbol table). The nested B resulting from expanding
A(Barg) should be flagged ("painted blue", another flag in
the symbol table) and never replaced. The result of
expanding B(Barg) should thus be "expB+expA()+B(Barg)".
If you have the source code for the preprocessor you could
try "instrumenting" it (adding carefully placed printouts)
to watch how it is making its decisions while expanding
your test code.

s***@amu.edu.pl

2004-02-28 01:05:46 UTC

Permalink

Post by Douglas A. Gwyn
the symbol table) and never replaced. The result of
expanding B(Barg) should thus be "expB+expA()+B(Barg)".

Thank you a lot! You have reassured me.

Compilers/preprocessors I have tested were:
latest gcc, latest como, msvc7.1, Digital unix cc (probably old)
and latest mcpp - all of them expand to expB+expA()+expB+A(Barg)

Did the standard change, ambiguities, or nobody (few?) did it
the right way? Is there at least one exemplary implementation?

Post by Douglas A. Gwyn
If you have the source code for the preprocessor you could
try "instrumenting" it (adding carefully placed printouts)

I've looked into gcc's cpp code, sounds like a little too much
for me just now. But mcpp (announced these days) has an interesting
feature #pragma __debug_cpp __expand.

I think I'll start sending bug reports.
Thanks again!

--
S.Tobias

Paul Mensonides

2004-02-28 03:21:20 UTC

Permalink

This is incorrect. When a macro invocation spans the end of a replacement list
it is not necessarily considered nested. The traditional approach is to
consider it *not* nested, and a ton of existing code requires that behavior.
For this reason and the wording of the standard, virtually every major
preprocessor implementation has this behavior--despite the non-normative note in
the appendix that says it is unspecified whether it is nested or not.

The viewpoint that macro expansion forms an invocation hierarchy is faulty which
leads to interpretations like this one. Macro expansion is conceptually
(regardless of a particular implementation strategy) an "in place", iterative
operation. As such, the expansion procedes as follows (ignoring argument
expansion because there is none involved):

#define A(x) expandA()+B(x)
#define B expandB+A

B(Barg)

B (Barg)
| ^ |
|___|
|
B invocation

expandB+A (Barg)
|^ |
|_________|
|
B context

expandB+ A (Barg)
| |^| |
| |_|______|
|__________| |
| A invocation (#1)
B context

expandB+ expandA()+B(Barg)
|^ |
|_________________|
|
A context

expandB+ expandA()+ B (Barg)
| | ^ | |
| |___| |
| | |
| B invocation (#2)
|_____________________|
|
A context

expandB+ expandA()+ expandB+A (Barg)
| |^ | |
| |_________| |
| | |
| B context |
|___________________________|
|
A context

expandB+ expandA()+ expandB+A' (Barg)
| | ^ | | (#3)
| |__________| |
| | |
| B context |
|____________________________|
|
A context

Note in particular the points #1, #2, and #3. The invocation of A at point #1
is not considered nested within the disabling context established by the
replacement list of B. At point #2, B's invocation *is* nested within the
context established by the replacement list of A. Hence, when the A
preprocessing token is found at point #3, the A-disabling context is still
active, and the token is painted (i.e. the apostrophe). The second example
procedes as follows:

// #define A(x) expandA()+B(x)
// #define B expandB+A

A(Aarg)

A(Aarg)
|^ |
|_______|
|
A expansion

expandA()+B(Aarg)
|^ |
|_________________|
|
A context

expandA()+ B (Aarg)
| | ^ | |
| |___| |
| | |
| B invocation
|_____________________|
|
A context

expandA()+ expandB+A (Aarg)
| |^ | |
| |_________| |
| | |
| B context |
|___________________________|
|
A context

expandA()+ expandB+A' (Aarg)
| | ^ | |
| |__________| |
| | |
| B context |
|____________________________|
|
A context

Regards,
Paul Mensonides

Douglas A. Gwyn

2004-02-28 07:25:02 UTC

Permalink

Post by Paul Mensonides
This is incorrect. When a macro invocation spans the end of a replacement list
it is not necessarily considered nested.

The wording in the C standard makes it pretty clear that
the inner replacement occurs *during* the process of the
original replacement (thus setting the condition for blue
paint), and that it is only whether a macro *name* is
seen within a replacement buffer that determines the
onset of a nested replacement (which can involve argument
tokens beyond the span of the higher-level macro-plus-
arguments).

Post by Paul Mensonides
a ton of existing code requires that behavior.

Really? I find that surprising.

Post by Paul Mensonides
The viewpoint that macro expansion forms an invocation hierarchy is faulty which
leads to interpretations like this one.

The specification is explicitly recursive.

Post by Paul Mensonides
expandB+ A (Barg)
| |^| |
| |_|______|
|__________| |
| A invocation (#1)
B context
expandB+ expandA()+B(Barg)
|^ |
|_________________|
|
A context

No, you dropped a context there. The original expansion
of B has not yet concluded, because part of the process
requires expansion of a nested macro A, which has not
quite concluded...

Post by Paul Mensonides
expandB+ expandA()+ B (Barg)
| | ^ | |
| |___| |
| | |
| B invocation (#2)
|_____________________|
|
A context

...until the A-replacement (sub)buffer is examined for
further nested macros to replace, *during* which
examination an occurrence of B (or of A) shall not be
replaced. The specification practically tracks this
very example step by step, so it is easy to apply.

Note that the process described in 6.10.3.4 is one
*component* of the process 6.10.3, not something that
happens after 6.10.3 is complete. In fact the reference
to nested macros encountering the name being replaced
makes sense only with that understanding. I think what
may be throwing you is the use of the term "nested",
which you might think means geographically nested instead
of procedurally nested. Unfortunately the term is used
colloquially with no further elucidation; however, there
is no clue that it should be thought of as geographic
nesting, and in its immediate context the only already
defined nesting process it could be referring to is the
logical recursive expansion.

The only intentional ambiguity in this area is the one
referred to in the response to DR #017 (Question 9),
which refers to the situation only after *all* expansion
is complete, and a different situation than in this
example. Further, the specification makes it very clear
that blue paint is permanent, so there is no way for the
newly created "B" to ever trigger macro replacement.

Paul Mensonides

2004-02-28 09:05:36 UTC