-
Notifications
You must be signed in to change notification settings - Fork 767
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
SYCL free function namespace support #17585
base: sycl
Are you sure you want to change the base?
SYCL free function namespace support #17585
Conversation
[SYCL] fix formatting [SYCL] remove unnecessary space
…namespace-support
[SYCL] include additional header in tests [SYCL] do not iterate through arguments twice
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Initial feedback. What about the templates though?
I left a bunch of mechanical comments, perhaps this link might be useful
https://llvm.org/docs/CodingStandards.html
You might also consider adding a release note for your change.
clang/test/CodeGenSYCL/free_function_default_template_arguments.cpp
Outdated
Show resolved
Hide resolved
clang/lib/Sema/SemaSYCL.cpp
Outdated
[](raw_ostream &OS, const NamespaceDecl *NS) {}, OS, DC); | ||
} | ||
|
||
static bool insertFreeFunctionDeclaration(const PrintingPolicy &Policy, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Maybe a documentation comment here makes sense? The function is named insertFreeFunctionDeclaration
yet it returns true if a namespace was inserted. Normally functions that are named like that return success status in a bool flag. Alternatively the bool return value can be converted to a reference parameter with a good name.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Reworked to have a separate class to emit free function and shim for that. Added comments for each method.
Policy.SuppressTagKeyword = true; | ||
Param->getType().print(ParmListWithNamesOstream, Policy); | ||
Policy.SuppressTagKeyword = false; | ||
ParmListWithNamesOstream << " " << Param->getNameAsString(); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Could you please elaborate what this particular addition is trying to achieve, why the previous code did not suffice and how does it relate to namespace printing?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
In general, ParamList
contains only parameter types, i.e. for the function
void some_func(float a, float* b)
ParamList
contains {float, float*}
It was added new list to have additional list of parameters with names to pass already existed tests for free function which checked generated header.
flag
Policy.SuppressTagKeyword = true;
forces printing without type tags, i.e. without words class
and struct
.
clang/lib/Sema/SemaSYCL.cpp
Outdated
PrintNamespaces(O, FD); | ||
NSInserted = true; | ||
} | ||
O << FD->getReturnType().getAsString() << " "; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I wonder why doing FD->print() does not suffice anymore?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
FD->print()
works fine for free function declared outside of namespace. If it is declared in it, print
method will emit:
void NS1::NS2::some_func(...)
for function
namespace NS1::NS2{ void some_func(...){} }
The form
void NS1::NS2::some_func(...){}
works fine but forward declaration
void NS1::NS2::some_func(...)
does not work. That is why it is needed to emit namespace first into intermediate header for function declaration.
P.S.
I reviewed Policy to use FD->print()
but did not find.
// CHECK-NEXT: } | ||
|
||
// CHECK: Forward declarations of kernel and its argument types: | ||
// CHECK: namespace ns { | ||
// CHECK: namespace ns1 { | ||
// CHECK-NEXT: template <typename A> class hasDefaultArg; | ||
// CHECK-NEXT: } | ||
// CHECK-NEXT: }} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we add more FE tests? Like with various combinations of namespaces around the free function kernel declaration? With inline namespace and not. Can we also test that codegen and semantic analysis is ok for free function kernels defined in a (maybe nested) namespace?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I added new e2e tests to check any possible namespaces: nested, anonymous, inline etc. Is it enough or add in these tests too? New tests do not check header directly but if something is emitted wrong, they will fail.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
SYCL compiler is complicated and has a lot of components. If we only have a e2e test and it fails suddenly (for example after a pulldown), it may take a while to identify which component now has a problem.
This is one of reasons why we normally check each component separately with unit tests and everything together in e2e tests. FE-only tests are "unit" tests in this scenario. They will help more quickly to identify that the problem is in FE. They will also help people to fix any FE problems without needing to have sycl rt and device. So, I still encourage to add FE-only tests.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Good point, I did not see that these tests are units. Added new checks.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Well they are "unit" because clang is enormous itself and has its own unit tests but in terms of SYCL compiler we can consider them as unit tests.
sycl/test/e2e_test_requirements/no-unsupported-without-info.cpp
Outdated
Show resolved
Hide resolved
de6c1a4
to
bdb3967
Compare
concept NumericType = std::is_arithmetic_v<std::remove_reference_t<T>>; | ||
|
||
template <typename T> | ||
requires NumericType<T> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't know if we allow unconditional C++20 in tests. I remember someone from Codeplay coming to me saying they have runners with old gcc on it. On the other hand, this is core compiler feature, not STL, so maybe nobody would complain.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I looked at the current tests before adding flag %cxx_std_optionc++20
, few of them use it already. I guess it is ok.
const float expected = 3.14f + static_cast<float>(i); | ||
assert(ptr[i] == expected && |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Will it work with fast-math (or downstream)?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, it works with fast math
SYCL free function docs:
https://github.com/intel/llvm/blob/sycl/sycl/doc/extensions/proposed/sycl_ext_oneapi_free_function_kernels.asciidoc
Changes should generate right forward declarations of any function(not template) and shim functions in namespace or not.