Run post-breach phase in separate thread #758

shreyamalviya · 2020-08-03T13:06:37Z

Fixes #696

codecov · 2020-08-03T13:12:12Z

Codecov Report

Merging #758 into develop will not change coverage.
The diff coverage is n/a.

@@           Coverage Diff            @@
##           develop     #758   +/-   ##
========================================
  Coverage    60.31%   60.31%           
========================================
  Files          161      161           
  Lines         4899     4899           
========================================
  Hits          2955     2955           
  Misses        1944     1944

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update f0428d0...444c2cb. Read the comment docs.

ShayNehmad · 2020-08-06T10:29:57Z

While this is a good start, we'd like all PBAs to run simultaneously, not only have the PBAs run async. So we want to open a thread for each PBA (maybe up to a limit of 4 threads running at the same time)

ShayNehmad · 2020-08-07T23:00:26Z

Cool. Can you do some time measurements and see how much time this saves (and if you increase the thread max limit)?

shreyamalviya · 2020-08-08T04:51:08Z

Running with no threading amongst the PBAs gives 246374421 nanoseconds, and with threading, 5 threads gives the best performance which is 241448650 nanoseconds.

monkey/infection_monkey/monkey.py

ShayNehmad · 2020-08-09T11:28:47Z

Hmm... Unfortunately seems like this makes almost no difference :(
Well, seems like this will shave 2 seconds off the Monkey's runtime in total, since the PBAs themselves are executed async. This is not a huge improvement, but it's something :)

If you're done with testing I think we can merge.

We do learn a very important lesson from this: We should profile the Monkey execution and see how long each part takes, so we can understand where we might be able to parallelize and make it quicker. Please create an issue for that.

shreyamalviya · 2020-08-10T06:06:55Z

Yes, I'll create an issue. I'm also done testing so I'm going to go ahead and merge this in a while.

VakarisZ · 2020-08-10T09:38:45Z

monkey/infection_monkey/post_breach/post_breach_handler.py

+        pool = Pool(5)
+        pool.map(self.run_pba, self.pba_list)


Have you tried this with multithreading? Performance was worse? Processes take longer to spawn, but are truly "parallel". If I/O is long, maybe we don't really need parallel execution, maybe we just need to init all I/O asap?

I'm importing Pool from multiprocessing.dummy.

multiprocessing.dummy replicates the API of multiprocessing but is no more than a wrapper around the threading module.

Ahh, I see!

VakarisZ · 2020-08-10T09:44:52Z

I agree with Shay. Be wary of such issues, because we should parallelize code only to increase performance and if we need to increase performance, let's increase it where it matters. Parallelizing just because we can is not the best way to go, as it introduces a lot of potential problems with a questionable return.

VakarisZ · 2020-08-10T09:45:26Z

monkey/infection_monkey/post_breach/post_breach_handler.py

+            LOG.debug("Executing PBA: '{}'".format(pba.name))
+            pba.run()
+        except Exception as e:
+            LOG.error("PBA {} failed. Error info: {}".format(pba.name, e))


Logging doesn't get messed up?

No? Just that all PBA logs are not logged together now.

What I'm concerned with is the possibility of log showing:

Executing PBA X Output of PBA Y Errors of PBA Y

This might be a bit misleading, but I think we'll manage.

shreyamalviya requested review from ShayNehmad and VakarisZ August 3, 2020 13:06

shreyamalviya force-pushed the pba-threading branch 2 times, most recently from 6c51ac0 to d9cc851 Compare August 7, 2020 08:19

ShayNehmad reviewed Aug 9, 2020

View reviewed changes

monkey/infection_monkey/monkey.py Outdated Show resolved Hide resolved

shreyamalviya added 3 commits August 10, 2020 11:28

Run post-breach phase in separate thread

c0bff44

Make PBAs run parallely

7c108e1

Change max threads from 4 to 5 & modify log message

444c2cb

shreyamalviya force-pushed the pba-threading branch from d9cc851 to 444c2cb Compare August 10, 2020 06:00

shreyamalviya marked this pull request as ready for review August 10, 2020 06:06

VakarisZ reviewed Aug 10, 2020

View reviewed changes

VakarisZ approved these changes Aug 10, 2020

View reviewed changes

VakarisZ merged commit 62c4eeb into guardicore:develop Aug 11, 2020

shreyamalviya deleted the pba-threading branch September 2, 2020 18:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Run post-breach phase in separate thread #758

Run post-breach phase in separate thread #758

shreyamalviya commented Aug 3, 2020

codecov bot commented Aug 3, 2020 •

edited

Loading

ShayNehmad commented Aug 6, 2020

ShayNehmad commented Aug 7, 2020

shreyamalviya commented Aug 8, 2020

ShayNehmad commented Aug 9, 2020

shreyamalviya commented Aug 10, 2020

VakarisZ Aug 10, 2020

shreyamalviya Aug 10, 2020

VakarisZ Aug 11, 2020

VakarisZ commented Aug 10, 2020

VakarisZ Aug 10, 2020

shreyamalviya Aug 10, 2020

VakarisZ Aug 11, 2020

Run post-breach phase in separate thread #758

Run post-breach phase in separate thread #758

Conversation

shreyamalviya commented Aug 3, 2020

codecov bot commented Aug 3, 2020 • edited Loading

Codecov Report

ShayNehmad commented Aug 6, 2020

ShayNehmad commented Aug 7, 2020

shreyamalviya commented Aug 8, 2020

ShayNehmad commented Aug 9, 2020

shreyamalviya commented Aug 10, 2020

VakarisZ Aug 10, 2020

Choose a reason for hiding this comment

shreyamalviya Aug 10, 2020

Choose a reason for hiding this comment

VakarisZ Aug 11, 2020

Choose a reason for hiding this comment

VakarisZ commented Aug 10, 2020

VakarisZ Aug 10, 2020

Choose a reason for hiding this comment

shreyamalviya Aug 10, 2020

Choose a reason for hiding this comment

VakarisZ Aug 11, 2020

Choose a reason for hiding this comment

codecov bot commented Aug 3, 2020 •

edited

Loading