Using ParallelTaskManager in ProjectPoints, ThreeBodyGFunctions and MatrixVectorProduct #1214

gtribello · 2025-03-04T10:21:38Z

Description

@Iximiel this change uses the ParallelTaskManager for some other methods in PLUMED. In making these work I made some changes in ParallelTaskManager. Can you check if this revised code still works on the GPU, please? If it does, then I will merge it into the final-backpropegation branch.

In terms of running these things on the GPU, I reckon that you can try with MatrixVectorProductBase and with MatrixProductDiagonal. I think ProjectPoints will never run on the GPU. Similarly, ThreeBodyGFunctions uses lepton so will not run on the GPU. I don't think it is important to get this stuff running on the GPU. None of it is very computationally expensive. I started with these classes because they were relatively easy to convert to the new way of doing things and so I could build some familiarity with doing this sort of thing before starting on the hard stuff.

Target release

I would like my code to appear in release 2.11

Type of contribution

changes to code or doc authored by PLUMED developers, or additions of code in the core or within the default modules
changes to a module not authored by you
new module contribution or edit of a module authored by you

Copyright

I agree to transfer the copyright of the code I have written to the PLUMED developers or to the author of the code I am modifying.

the module I added or modified contains a COPYRIGHT file with the correct license information. Code should be released under an open source license. I also used the command cd src && ./header.sh mymodulename in order to make sure the headers of the module are correct.

Tests

I added a new regtest or modified an existing regtest to validate my changes.
I verified that all regtests are passed successfully on GitHub Actions.

…elTaskManager

Iximiel · 2025-03-04T10:30:36Z

@gtribello What is the priority between the actions from this PR and volume and secondary structures?

Iximiel · 2025-03-04T10:36:22Z

@gtribello matrix view is for working with sparse matrices?

gtribello · 2025-03-04T10:40:54Z

@gtribello What is the priority between the actions from this PR and volume and secondary structures?

secondary structure is way more important!

gtribello · 2025-03-04T10:42:05Z

@gtribello matrix view is for working with sparse matrices?

yes, but with the way I have implemented sparse matrices in PLUMED. there may be a better way to implement them in the longer term.

…ng paths with GPU

gtribello · 2025-03-06T08:18:44Z

Hi @Iximiel

I added an implementation of the RMSD action that underpins my implementation of the Path CVs here that can potentially be parallelised with the GPUs. I think this is something that we should perhaps work on getting on the GPU once the secondary structure variables are done. I think this for two reasons:

It is a logical next step. Paths use RMSD much like the secondary structure variables, so if you have to do RMSD for secondary structure variables, it shouldn't be too hard to get them to also work for the path CVs.
This is something that Francesco Gervasio (who is now paying some of the bills) might be very interested in having. I think he uses these PathCVs a lot, so if we have a fast implementation of them on the GPU, it would be a good thing.

I think the first step is to download this branch and check that you can still compile and run everything that you have done so far on the GPU (i.e. multicolvars and secondary structure variables). If that works, we can merge this into the final-backpropagation branch and then start work on the GPU version of RMSDVector.

It is perhaps worth noting that I had to make some changes to the way that ParallelTaskManager operates. The significant one that I think relaxes an assumption that I made in earlier versions of the code and it is perhaps worth explaining some terminology that I have introduced in this regard.

So, as you know, multicolvar basically calculates a vector or set of vectors. These vectors are stored in PLMD::Value objects that are then passed to other actions. The main loop is parallelised over tasks, and each task calculates one element for each of the vector objects that the PLMD::Value objects that the multicolvar can pass. Consequently, if an action has four PLMD::Value components, we know that performTask will return four scalars.

In this version, I have relaxed this assumption, I thus have two important quantities in ParallelActionInput:

input.ncomponents = Number of PLMD::Value objects that are being calculated
input.nscalars = Number of scalars that are being calculated during the task.

In all the cases we have looked at so far input.nscalars = input.ncomponents. In RMSDVector and some other things that we will get to in the coming weeks input.nscalars > input.ncomponents. In other words, there will be tasks that calculate more than one of the scalars that will be stored in the underlying PLMD::Value. I think I have made this change everywhere it needs to be made but I thought I would explain it just in case I have made a mistake when modifying the GPU code.

To be clear, the secondary structure variables are still the main priority. Once you have done that, though, it would be good to get on to getting this branch merged and the RMSDVector stuff working. You can find tests for this action in regtest/mapping. mapping/rt39 is a good place to start.

Iximiel · 2025-03-06T12:42:51Z

As now it compiles, but crashes on the GPU
I think there are some problem with how we should manage the ParallelActionsInput in the CPU/GPU data passage, but I need to dig deeper on this

gtribello · 2025-03-06T13:10:06Z

OK. I imagine the most likely culprit is the std::vector called args.

Gareth Aneurin Tribello and others added 6 commits March 4, 2025 09:33

Using taskmanager for MatrixProductDiagonal

5aa23af

Using task manager for MatrixVectorProduct

0056886

ThreeBodyGFunctions now work with ParallelTaskManager

8348a88

Moved copying of argument data to ParallelTaskManager

d7ee64c

Using new ParallelTaskManager for ProjectPoints

abb2617

Fixes to new functionalities required by rebase and changes in Parall…

25ef473

…elTaskManager

Gareth Aneurin Tribello and others added 4 commits March 5, 2025 19:18

Merge branch 'final-backpropegation' into gpuwork/matrices

ed45a72

Doing templates in Daniele's way

bec8916

Added regtest for GPATH with SIMPLE RMSD (rather than just optimal)

d9cc2a0

Now doing RMSDVector using ParallelTaskManager in preparation for doi…

4dae0b0

…ng paths with GPU

Gareth Aneurin Tribello and others added 2 commits March 6, 2025 16:43

Using ParallelTaskManager for quaternion bond product matrix

65feafc

Ran astyle

05a60bc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using ParallelTaskManager in ProjectPoints, ThreeBodyGFunctions and MatrixVectorProduct #1214

Using ParallelTaskManager in ProjectPoints, ThreeBodyGFunctions and MatrixVectorProduct #1214

gtribello commented Mar 4, 2025

Iximiel commented Mar 4, 2025

Iximiel commented Mar 4, 2025

gtribello commented Mar 4, 2025

gtribello commented Mar 4, 2025

gtribello commented Mar 6, 2025 •

edited

Loading

Iximiel commented Mar 6, 2025

gtribello commented Mar 6, 2025

Using ParallelTaskManager in ProjectPoints, ThreeBodyGFunctions and MatrixVectorProduct #1214

Are you sure you want to change the base?

Using ParallelTaskManager in ProjectPoints, ThreeBodyGFunctions and MatrixVectorProduct #1214

Conversation

gtribello commented Mar 4, 2025

Description

Target release

Type of contribution

Copyright

Tests

Iximiel commented Mar 4, 2025

Iximiel commented Mar 4, 2025

gtribello commented Mar 4, 2025

gtribello commented Mar 4, 2025

gtribello commented Mar 6, 2025 • edited Loading

Iximiel commented Mar 6, 2025

gtribello commented Mar 6, 2025

gtribello commented Mar 6, 2025 •

edited

Loading