(This is a follow-up from "Are there any realistic use cases for `decltype(auto)` variables?")
Consider the following scenario - I want to pass a function f
to another function invoke_log_return
which will:
Invoke
f
;Print something to stdout;
Return the result of
f
, avoiding unnecessary copies/moves and allowing copy elision.
Note that, if f
throws, nothing should be printed to stdout. This is what I have so far:
template <typename F>
decltype(auto) invoke_log_return(F&& f)
{
decltype(auto) result{std::forward<F>(f)()};
std::printf(" ...logging here...\n");
if constexpr(std::is_reference_v<decltype(result)>)
{
return decltype(result)(result);
}
else
{
return result;
}
}
Let's consider the various possibilities:
When
f
returns a prvalue:result
will be an object;invoke_log_return(f)
will be a prvalue (eligible for copy elision).
When
f
returns an lvalue or xvalue:result
will be a reference;invoke_log_return(f)
will be a lvalue or xvalue.
You can see a test application here on godbolt.org. As you can see, g++
performs NRVO for the prvalue case, while clang++
doesn't.
Questions:
Is this the shortest possible way of "perfectly" returning a
decltype(auto)
variable out of a function? Is there a simpler way to achieve what I want?Can the
if constexpr { ... } else { ... }
pattern be extracted to a separate function? The only way to extract it seems to be a macro.Is there any good reason why
clang++
does not perform NRVO for the prvalue case above? Should it be reported as a potential enhancement, or isg++
's NRVO optimization not legal here?
Here's an alternative using a on_scope_success
helper (as suggested by Barry Revzin):
template <typename F>
struct on_scope_success : F
{
int _uncaught{std::uncaught_exceptions()};
on_scope_success(F&& f) : F{std::forward<F>(f)} { }
~on_scope_success()
{
if(_uncaught == std::uncaught_exceptions()) {
(*this)();
}
}
};
template <typename F>
decltype(auto) invoke_log_return_scope(F&& f)
{
on_scope_success _{[]{ std::printf(" ...logging here...\n"); }};
return std::forward<F>(f)();
}
While invoke_log_return_scope
is much shorter, this requires a different mental model of the function behavior and the implementation of a new abstraction. Surprisingly, both g++
and clang++
perform RVO/copy-elision with this solution.
One major drawback of this approach, as mentioned by Ben Voigt, is that the return value of f
cannot be part of the log message.
We can use a modified version of
std::forward
: (the name forward is avoided to prevent ADL problems)This function template is used to forward a
decltype(auto)
variable. It can be used like this:This way, if
std::forward<F>(f)()
returnsa prvalue, then
result
is a non-reference, andinvoke_log_return
returns a non-reference type;an lvalue, then
result
is an lvalue-reference, andinvoke_log_return
returns an lvalue reference type;an xvalue, then
result
is an rvalue-reference, andinvoke_log_return
returns an rvalue reference type.(Essentially copied from my https://stackoverflow.com/a/57440814)
That's the simplest and most clear way to write it:
The GCC gets the right (no needless copies or moves) expected result:
So if code is clear, have ever the same functionality but is't optimized to run as much as the competitors does it's a compiler optimization failure and clang should work it out. That's the kind of problem that make lot more sense solved in the tool instead the application layer implementation.
https://gcc.godbolt.org/z/50u-hT