Stephan T. Lavavej created ARROW-15520:
------------------------------------------
Summary: Unqualified format() calls are ambiguous in C++20
Key: ARROW-15520
URL: https://issues.apache.org/jira/browse/ARROW-15520
Project: Apache Arrow
Issue Type: Bug
Components: C++
Affects Versions: 7.0.0
Environment: Visual Studio 2022 17.2 Preview 2
Reporter: Stephan T. Lavavej
I work on MSVC's C++ Standard Library implementation, and we regularly build
open-source projects, including Apache Arrow, with development versions of the
MSVC toolset in order to find and fix compiler/library bugs before they can
cause problems for our programmer-users like you. This also allows us to
provide advance notice of breaking changes in the C++ Standard that will affect
you, which is the case here.
We recently implemented C++20's std::format(), followed by the C++20 Defect
Report [P2418R2|https://wg21.link/P2418R2] "Add Support For std::generator-like
Types To std::format" with
[microsoft/STL#2323|https://github.com/microsoft/STL/pull/2323]. This prevents
Apache Arrow from compiling with the latest C++ Standard mode enabled, because
MakeTimeFormatter() in diff.cc contains unqualified calls:
[https://github.com/apache/arrow/blob/56e270fda7f5647a157acd1e428d9735d6399881/cpp/src/arrow/array/diff.cc#L636-L681]
(The issue involves Argument-Dependent Lookup. The using-declaration `using
arrow_vendored::date::format;` means that the following unqualified calls to
`format()` will consider the overload in the `arrow_vendored::date` namespace,
which is the desired overload. However, because the arguments are
`std::chrono::duration` types, `std` is considered an "associated namespace",
so it will also be searched for overloads. Our implementation of the chrono
header includes the new format header (as permitted by the Standard - we do
this because chrono types are formattable in C++20), so the `std::format`
overload is visible. Finally, the signature change required by the P2418R2
paper makes `std::format` a highly greedy "perfect forwarding" signature, so
it's ambiguous with the desired `arrow_vendored::date::format` overload.)
The full steps to reproduce are:
# Build the microsoft/STL repo, and update the INCLUDE/LIB/etc. environment
variables so that it can be consumed. (Or wait for Visual Studio 2022 17.2
Preview 2 to ship, as it will contain these changes.)
# Configure Apache Arrow with the latest C++ Standard version: cmake -G Ninja
-S . -B build -DCMAKE_CXX_STANDARD=23 (note that C++23 must be selected at this
time even though std::format() is C++20 - it's a long story)
# diff.cc fails to compile with:
{noformat}
ninja: Entering directory `build'
[13/191] Building CXX object
src\arrow\CMakeFiles\arrow_static.dir\array\diff.cc.obj
FAILED: src/arrow/CMakeFiles/arrow_static.dir/array/diff.cc.obj
C:\PROGRA~1\MIB055~1\2022\Preview\VC\Tools\MSVC\1431~1.311\bin\Hostx64\x64\cl.exe
/nologo /TP -DARROW_EXPORTING -DARROW_HAVE_RUNTIME_AVX2
-DARROW_HAVE_RUNTIME_AVX512 -DARROW_HAVE_RUNTIME_BMI2
-DARROW_HAVE_RUNTIME_SSE4_2 -DARROW_HAVE_SSE4_2 -DARROW_STATIC
-DARROW_WITH_TIMING_TESTS -DURI_STATIC_BUILD -D_CRT_SECURE_NO_WARNINGS
-D_ENABLE_EXTENDED_ALIGNED_STORAGE -IC:\Temp\arrow\cpp\build\src
-IC:\Temp\arrow\cpp\src -IC:\Temp\arrow\cpp\src\generated
-IC:\Temp\arrow\cpp\thirdparty\flatbuffers\include
-IC:\Temp\arrow\cpp\build\boost_ep-prefix\src\boost_ep
-IC:\Temp\arrow\cpp\build\xsimd_ep\src\xsimd_ep-install\include
-IC:\Temp\arrow\cpp\thirdparty\hadoop\include /DWIN32 /D_WINDOWS /GR /EHsc
/D_SILENCE_TR1_NAMESPACE_DEPRECATION_WARNING /EHsc /wd5105 /bigobj /utf-8 /W3
/wd4800 /wd4996 /wd4065 /MD /O2 /Ob2 /DNDEBUG -std:c++latest /showIncludes
/Fosrc\arrow\CMakeFiles\arrow_static.dir\array\diff.cc.obj
/Fdsrc\arrow\CMakeFiles\arrow_static.dir\arrow_static.pdb /FS -c
C:\Temp\arrow\cpp\src\arrow\array\diff.cc
C:\Temp\arrow\cpp\src\arrow\array\diff.cc(652): error C2666:
'arrow_vendored::date::format': 2 overloads have similar conversions
C:\Temp\arrow\cpp\src\arrow/vendored/datetime/date.h(6264): note: could be
'std::basic_string<char,std::char_traits<char>,std::allocator<char>>
arrow_vendored::date::format<_Elem,std::chrono::time_point<std::chrono::system_clock,std::chrono::duration<__int64,std::nano>>>(const
CharT *,const Streamable &)'
with
[
_Elem=char,
CharT=char,
Streamable=std::chrono::time_point<std::chrono::system_clock,std::chrono::duration<__int64,std::nano>>
]
C:\GitHub\STL\out\build\x64\out\inc\format(3100): note: or 'std::wstring
std::format<std::chrono::time_point<std::chrono::system_clock,std::chrono::duration<__int64,std::nano>>>(const
std::_Basic_format_string<wchar_t,std::chrono::time_point<std::chrono::system_clock,std::chrono::duration<__int64,std::nano>>>,std::chrono::time_point<std::chrono::system_clock,std::chrono::duration<__int64,std::nano>>
&&)' [found using argument-dependent lookup]
C:\GitHub\STL\out\build\x64\out\inc\format(3095): note: or 'std::string
std::format<std::chrono::time_point<std::chrono::system_clock,std::chrono::duration<__int64,std::nano>>>(const
std::_Basic_format_string<char,std::chrono::time_point<std::chrono::system_clock,std::chrono::duration<__int64,std::nano>>>,std::chrono::time_point<std::chrono::system_clock,std::chrono::duration<__int64,std::nano>>
&&)' [found using argument-dependent lookup]
C:\Temp\arrow\cpp\src\arrow\array\diff.cc(680): note: while trying to match the
argument list '(const _Elem *,
std::chrono::time_point<std::chrono::system_clock,std::chrono::duration<__int64,std::nano>>)'
with
[
_Elem=char
]
C:\Temp\arrow\cpp\src\arrow\array\diff.cc(680): note: note: qualification
adjustment (const/volatile) may be causing the ambiguity
C:\Temp\arrow\cpp\src\arrow\array\diff.cc(451): note: see reference to function
template instantiation 'arrow::Formatter
arrow::MakeFormatterImpl::MakeTimeFormatter<arrow::TimestampType,true>(const
std::string &)' being compiled
[...]{noformat}
The fix is very simple: remove the using-declaration and explicitly qualify
each call. I will submit a pull request on GitHub for this.
(Even if Apache Arrow isn't planning on using C++20 any time soon, making this
change will make it easier for MSVC to continue validating Apache Arrow with
the latest toolset and Standard changes, and it will remove a potential future
headache if and when Apache Arrow does migrate to later Standard versions.)
--
This message was sent by Atlassian Jira
(v8.20.1#820001)