Will you be covering parallelization of code which calls many levels of subroutines?
For example, say within main I've got a function called initialize_arrays that calls many different functions in order to set some initial values of some arrays, A, B, and C. And within these functions many other routines may be called. Say each function takes anywhere from 1-N input values in order to perform it's operations. and routine_1, routine_2, routine_3 are required to matrix A where routine_1 calls routine_1a, routine_1b, routine_1c, etc. At what levels do the parallel commands need to be implemented?
I am curious how the privatization commands should be implemented when filling out the arrays.
Sorry for the extreme generalization, let me know it if is not clear.
Thanks!