Bug summary Cray CCE 15.0.0 objects to how <code class="notransla

The OpenMP specification says in <a href="https://www.openmp.org/spec-html/5.0/openmps

Thanks for the insights <a class="user-mention notranslate" data-hovercard-type="user"

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

omp.library-only with Cray OpenMP about adaptivecpp HOT 6 OPEN

will-saunders-ukaea commented on June 10, 2024

omp.library-only with Cray OpenMP

from adaptivecpp.

Comments (6)

tomdeakin commented on June 10, 2024 3

Ah yes, it's the same behaviour as Will sketched out using a worker thread. I'll ask for some advice on this from our local friendly neighbourhood OpenMP runtime expert... Standby!

from adaptivecpp.

illuhad commented on June 10, 2024

The OpenMP specification says in https://www.openmp.org/spec-html/5.0/openmpsu112.html

The omp_get_max_threads routine returns an upper bound on the number of threads that could be used to form a new team if a parallel construct without a num_threads clause were encountered after execution returns from this routine.

It only talks about future parallel constructs, not about current parallel constructs, so to me it does not sound like it is illegal to use outside of a parallel region.

Perhaps we can ask someone who knows more about OpenMP - @tomdeakin perhaps?

from adaptivecpp.

will-saunders-ukaea commented on June 10, 2024

I agree that omp_get_max_threads should be callable outside a parallel region. I did some more digging with CCE - it strongly objects to OpenMP calls in additional threads:

#include <omp.h>
#include <iostream>
#include <thread>

int main(int argc, char ** argv){

auto lambda_max = []() -> void{
  std::cout << omp_get_max_threads() << std::endl;
};

// Works
//lambda_max();
//lambda_max();

// fails
std::thread t1(lambda_max);
t1.join();
std::thread t2(lambda_max);
t2.join();

return 0;
}

Fails with:

$ OMP_NUM_THREADS=1 ./a.out
1
CCE OpenMP fatal error: omp_get_max_threads attempted from non-OpenMP thread

A parallel region is also rejected:

#include <omp.h>
#include <iostream>
#include <thread>

int main(int argc, char ** argv){

auto lambda_omp = []() -> void{
#pragma omp parallel
{
}
};

std::thread t1(lambda_omp);
t1.join();
std::thread t2(lambda_omp);
t2.join();
return 0;
}

Gives:

CCE OpenMP fatal error: OpenMP parallel attempted from non-OpenMP thread

from adaptivecpp.

tomdeakin commented on June 10, 2024

There is one more relevant bit of the OpenMP spec which I think I think is the bit that's causing the problem. The API call says:

The value returned by omp_get_max_threads is the value of the first element of the nthreads-var ICV of the current task.

This implicitly says that an OpenMP thread must call it because there must be a current task (everything in OpenMP is a task). OpenMP bootstraps this by making an initial thread when the program starts.
So in that std::thread example, the thread calling the API routine doesn't have all the OpenMP state.
Calling the lambda works in your example above because it's the initial thread that's calling the API routine.

What thread in AdaptiveCpp is trying to call the OpenMP API functions?

from adaptivecpp.

illuhad commented on June 10, 2024

Thanks for the insights @tomdeakin :)

The only use of this function I've found is here:

AdaptiveCpp/include/hipSYCL/glue/omp/omp_kernel_launcher.hpp

Line 120 in 367cb7a

int max_threads = get_max_num_threads();

which means it's invoked right before the parallel for which executes the kernel. All of this is executed in a worker thread of the runtime, which is then probably where the problem originates: The worker thread is not an OpenMP thread.

This is very unfortunate, because it basically means that OpenMP is unusable in multi-threaded applications. Or have I misunderstood something?

from adaptivecpp.

will-saunders-ukaea commented on June 10, 2024

@tomdeakin Don't suppose there was any news on this?

from adaptivecpp.

omp.library-only with Cray OpenMP about adaptivecpp HOT 6 OPEN

Comments (6)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent