Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SYCL free function namespace support #17585

Open
wants to merge 26 commits into
base: sycl
Choose a base branch
from

Conversation

dklochkov-emb
Copy link
Contributor

SYCL free function docs:
https://github.com/intel/llvm/blob/sycl/sycl/doc/extensions/proposed/sycl_ext_oneapi_free_function_kernels.asciidoc

Changes should generate right forward declarations of any function(not template) and shim functions in namespace or not.

@dklochkov-emb dklochkov-emb self-assigned this Mar 21, 2025
@dklochkov-emb dklochkov-emb requested review from a team as code owners March 21, 2025 17:01
@dklochkov-emb dklochkov-emb marked this pull request as draft March 21, 2025 17:01
@dklochkov-emb dklochkov-emb marked this pull request as ready for review March 26, 2025 09:49
Copy link
Contributor

@Fznamznon Fznamznon left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Initial feedback. What about the templates though?

I left a bunch of mechanical comments, perhaps this link might be useful
https://llvm.org/docs/CodingStandards.html

You might also consider adding a release note for your change.

[](raw_ostream &OS, const NamespaceDecl *NS) {}, OS, DC);
}

static bool insertFreeFunctionDeclaration(const PrintingPolicy &Policy,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Maybe a documentation comment here makes sense? The function is named insertFreeFunctionDeclaration yet it returns true if a namespace was inserted. Normally functions that are named like that return success status in a bool flag. Alternatively the bool return value can be converted to a reference parameter with a good name.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Reworked to have a separate class to emit free function and shim for that. Added comments for each method.

Comment on lines +6776 to +6779
Policy.SuppressTagKeyword = true;
Param->getType().print(ParmListWithNamesOstream, Policy);
Policy.SuppressTagKeyword = false;
ParmListWithNamesOstream << " " << Param->getNameAsString();
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Could you please elaborate what this particular addition is trying to achieve, why the previous code did not suffice and how does it relate to namespace printing?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In general, ParamList contains only parameter types, i.e. for the function
void some_func(float a, float* b)
ParamList contains {float, float*}
It was added new list to have additional list of parameters with names to pass already existed tests for free function which checked generated header.
flag
Policy.SuppressTagKeyword = true;
forces printing without type tags, i.e. without words class and struct.

PrintNamespaces(O, FD);
NSInserted = true;
}
O << FD->getReturnType().getAsString() << " ";
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I wonder why doing FD->print() does not suffice anymore?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

FD->print() works fine for free function declared outside of namespace. If it is declared in it, print method will emit:
void NS1::NS2::some_func(...)
for function
namespace NS1::NS2{ void some_func(...){} }

The form
void NS1::NS2::some_func(...){}
works fine but forward declaration
void NS1::NS2::some_func(...) does not work. That is why it is needed to emit namespace first into intermediate header for function declaration.
P.S.
I reviewed Policy to use FD->print() but did not find.

// CHECK-NEXT: }

// CHECK: Forward declarations of kernel and its argument types:
// CHECK: namespace ns {
// CHECK: namespace ns1 {
// CHECK-NEXT: template <typename A> class hasDefaultArg;
// CHECK-NEXT: }
// CHECK-NEXT: }}
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we add more FE tests? Like with various combinations of namespaces around the free function kernel declaration? With inline namespace and not. Can we also test that codegen and semantic analysis is ok for free function kernels defined in a (maybe nested) namespace?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I added new e2e tests to check any possible namespaces: nested, anonymous, inline etc. Is it enough or add in these tests too? New tests do not check header directly but if something is emitted wrong, they will fail.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

SYCL compiler is complicated and has a lot of components. If we only have a e2e test and it fails suddenly (for example after a pulldown), it may take a while to identify which component now has a problem.
This is one of reasons why we normally check each component separately with unit tests and everything together in e2e tests. FE-only tests are "unit" tests in this scenario. They will help more quickly to identify that the problem is in FE. They will also help people to fix any FE problems without needing to have sycl rt and device. So, I still encourage to add FE-only tests.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good point, I did not see that these tests are units. Added new checks.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Well they are "unit" because clang is enormous itself and has its own unit tests but in terms of SYCL compiler we can consider them as unit tests.

@dklochkov-emb dklochkov-emb force-pushed the sycl-free-function-namespace-support branch from de6c1a4 to bdb3967 Compare April 1, 2025 15:31
concept NumericType = std::is_arithmetic_v<std::remove_reference_t<T>>;

template <typename T>
requires NumericType<T>
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't know if we allow unconditional C++20 in tests. I remember someone from Codeplay coming to me saying they have runners with old gcc on it. On the other hand, this is core compiler feature, not STL, so maybe nobody would complain.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I looked at the current tests before adding flag %cxx_std_optionc++20, few of them use it already. I guess it is ok.

Comment on lines +97 to +98
const float expected = 3.14f + static_cast<float>(i);
assert(ptr[i] == expected &&
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Will it work with fast-math (or downstream)?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, it works with fast math

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants