Batched execution #100

jank324 · 2023-11-13T16:41:39Z

It would be in the spirit of the Cheetah's applications, to enable batched execution. This would be, for example, that a quadrupole's strength is set to vector of strengths, which would then result in a 3D tensor of transfer maps and allow parallel evaluations at much higher speed.

cr-xu · 2023-11-14T15:07:23Z

This is a tricky problem, which will be very helpful to have.

There are still two cases to be considered, e.g. batch evaluation of incoming_beam and batch evaluation of segment_setting

The incoming_beam situation is easier to consider and implement: we have a fixed Segment and transfer_map, we would just need to broadcast the transfer map multiplication properly with the batched/stacked incoming beam (which is already halfway done in the ParticleBeam case anyway).

The changing segment_setting case is more tricky, but also more important. Ultimately, we're modifying the network (transfer map) during the forward pass. One concern is whether we'll be able to keep the execution speed (somewhat) fast enough.

For bookkeeping purposes, the general feature I would like is the ability to perform fast calculation of outgoing beam parameters for one static incoming beam, and vectorized accelerator settings with the general form [..., n_batch, n_dim] where n_dim being the number of accelerator settings to be changed.

Currently, I have to do something like:

def forward(self, X: torch.Tensor) -> torch.Tensor:
    input_shape = X.shape
    X = X.reshape(-1, n_dim)
    Y = torch.zeros(X.shape[:-1])
    for i, x in enumerate(X):
        self.segment.Q1.k1 = x[0] # Set the input parameters 
        ...
        # Track the beam
        out_beam = self.segment(self.incoming_beam)
        # Compute the objective
        obj = ...
        Y[i] = obj
    return Y.reshape(input_shape[:-1])

which more or less scale linearly (poorly):

i 1.0
CPU times: user 1.55 ms, sys: 1.06 ms, total: 2.61 ms
Wall time: 1.85 ms
i 10.0
CPU times: user 4.6 ms, sys: 14 µs, total: 4.61 ms
Wall time: 4.65 ms
i 100.0
CPU times: user 38.6 ms, sys: 266 µs, total: 38.9 ms
Wall time: 39.4 ms
i 1000.0
CPU times: user 335 ms, sys: 1.22 ms, total: 336 ms
Wall time: 347 ms
i 10000.0
CPU times: user 3.23 s, sys: 15.6 ms, total: 3.25 s
Wall time: 3.26 s

jank324 · 2023-11-17T12:05:57Z

Maybe we can take inspiration from how this was done here: https://github.com/UM-ARM-Lab/pytorch_kinematics

jank324 added the enhancement New feature or request label Nov 13, 2023

jank324 mentioned this issue Nov 17, 2023

Release v0.6.2 #94

Closed

6 tasks

jank324 linked a pull request Jan 4, 2024 that will close this issue

Vectorised simulations #116

Merged

14 tasks

jank324 mentioned this issue Jan 5, 2024

Vectorised simulations #116

Merged

14 tasks

jank324 mentioned this issue Feb 14, 2024

Small emittances in beam not computed correctly #127

Closed

cr-xu mentioned this issue Apr 4, 2024

[Vectorised branch] Tracking crashes with single Marker, or only elements without lengths between non skippable elemtents #143

Closed

jank324 closed this as completed in #116 Apr 17, 2024

jank324 mentioned this issue May 28, 2024

Release v0.7 #165

Closed

7 tasks

cr-xu mentioned this issue Jun 6, 2024

Fix dipole tracking issue; Add tests accordingly #170

Merged

14 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Batched execution #100

Batched execution #100

jank324 commented Nov 13, 2023

cr-xu commented Nov 14, 2023

jank324 commented Nov 17, 2023

Batched execution #100

Batched execution #100

Comments

jank324 commented Nov 13, 2023

cr-xu commented Nov 14, 2023

jank324 commented Nov 17, 2023