Performance enhancements #372

InnovativeInventor · 2022-01-02T01:42:58Z

This is a work-in-progress, draft PR with some performance enhancements. In particular,

__slots__ is enabled for a few frequently-created classes (Assignment and SubgraphView), resulting in minor performance improvements
caching for iterating over the neighbors of all flips is implemented to improve flow and cut edge calculation runtimes
caching for iterating over the neighbors of a node in a graph is implemented (since networkx's neighbors function is quite expensive and a partition's graph will always remain static)
Graph.nodes and Graph.edges is cached

Combined, these speedups improve both standard GerryChain run performance and replay performance. In particular, the time it takes to run 100 steps on PA goes from 29.8 seconds to 26 seconds (~13% decrease) and the time spent on each loop when replaying goes from 610 ms (based on #363) to 469 ms (~23% decrease).

Some caveats:

I used functools.cache, which is a Python 3.9 feature. We should switch to functools.lru_cache or something with better compatibility with older versions of Python before we merge this in.
For caching reasons, GerryChain Partitions will no longer accept a networkx graph (to convert a networkx graph to a gerrychain.Graph object, a new method Graph.from_networkx is introduced in this PR). I had to rewrite the tests accordingly.

InnovativeInventor · 2022-01-02T02:22:03Z

Note that the behavior of Graph.nodes and Graph.edges will not be backwards-compatible. It may be better to rename the cached versions to Graph._cached_nodes and Graph._cached_edges.

InnovativeInventor · 2022-01-04T21:34:05Z

Some more benchmark stats with 11906d0 (replay perf on 1000-step chains, PA congressional):

this PR (Performance enhancements #372): 220 it/s
Parker's PR (Use __slots__ in Partition #363): 170 it/s
main branch: 72 it/s

The FrozenGraph class also opens the door to using faster read-only data structures (generated when the class is instantiated) behind the scenes without breaking backwards-compatibility.

InnovativeInventor · 2022-01-05T00:18:32Z

Updated stats (wrote some better benchmarks):

- 247 it/s (my perf tuning + Parker's slots)
- 188 it/s (Parker's slots)
- 82 it/s (main branch)

InnovativeInventor · 2022-01-05T03:36:52Z

Note: I will break this up into multiple PRs, so it'll be easier to review.

InnovativeInventor · 2022-01-05T15:13:29Z

I realized that I was profiling #363 incorrectly -- the performance is now as follows:

353 it/s (my perf tuning + Parker's slots)
194 it/s (Parker's slots)
189 it/s (main branch)

My bad -- looks like __slots__ doesn't help that much after all.

codecov-commenter · 2022-01-06T19:15:36Z

Codecov Report

Merging #372 (a318d62) into main (f9f73a5) will decrease coverage by 0.05%.
The diff coverage is 70.00%.

❗ Current head a318d62 differs from pull request most recent head 05233ce. Consider uploading reports for the commit 05233ce to get more accurate results

@@            Coverage Diff             @@
##             main     #372      +/-   ##
==========================================
- Coverage   87.89%   87.83%   -0.06%     
==========================================
  Files          37       37              
  Lines        1660     1669       +9     
==========================================
+ Hits         1459     1466       +7     
- Misses        201      203       +2

Impacted Files	Coverage Δ
gerrychain/graph/graph.py	`85.95% <66.66%> (-0.78%)`	⬇️
gerrychain/partition/assignment.py	`96.00% <100.00%> (+0.05%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update f9f73a5...05233ce. Read the comment docs.

InnovativeInventor · 2022-01-11T19:01:55Z

Memory leak!:

GerryChain/gerrychain/partition/partition.py

Line 121 in c55f382

@functools.cache

… reduce __getitem__ overhead

… to prevent double-calculating flows

… caching

InnovativeInventor · 2022-01-19T15:48:21Z

Reminder to self: Partition.crosses_parts caching increases performance (but the previous impl introduced a memory leak).

Note that the "seed_and_freeze" test changed due to the way flows are being cached. Hopefully there are no correctness issues introduced here.

…y lookup

InnovativeInventor · 2022-05-16T15:57:18Z

Closing due to the successful merges of all the broken out PRs. The retworkx PR is still a work-in-progress, but will be ready soon.

InnovativeInventor force-pushed the perf-tuning branch from 7cbf632 to 11906d0 Compare January 4, 2022 17:34

InnovativeInventor force-pushed the perf-tuning branch 4 times, most recently from 02e5d3c to d884a4d Compare January 5, 2022 03:33

InnovativeInventor changed the title ~~Perf tuning~~ Performance enhancements Jan 5, 2022

InnovativeInventor force-pushed the perf-tuning branch 3 times, most recently from e063a39 to 05233ce Compare January 6, 2022 19:12

InnovativeInventor force-pushed the perf-tuning branch 8 times, most recently from 1cc2d7c to 848ea4e Compare January 6, 2022 19:47

This was referenced Jan 6, 2022

Add neighborhood flips caching #373

Merged

More __slots__ #374

Merged

Add option to disable cut_edges updater with use_cut_edges flag #375

Merged

InnovativeInventor added the work-in-progress label Jan 6, 2022

InnovativeInventor force-pushed the perf-tuning branch 3 times, most recently from ab7acfa to a8e1402 Compare January 10, 2022 18:37

InnovativeInventor force-pushed the perf-tuning branch from 350f285 to c55f382 Compare January 10, 2022 21:45

InnovativeInventor force-pushed the perf-tuning branch from ce6e5a6 to fde511f Compare January 19, 2022 15:18

Replace references to assignment[node] to assignment.mapping[node] to…

f9c41c9

… reduce __getitem__ overhead

InnovativeInventor force-pushed the perf-tuning branch from fde511f to 5aa7733 Compare January 19, 2022 15:29

Refactor out Assignment.update (replace with Assignment.update_flows)…

f47e469

… to prevent double-calculating flows

InnovativeInventor force-pushed the perf-tuning branch from 5aa7733 to 564da2a Compare January 19, 2022 15:36

Change flows_from_changes to expect partitions as arguments to enable…

a09078a

… caching

InnovativeInventor force-pushed the perf-tuning branch from 564da2a to f06806b Compare January 19, 2022 15:40

Simplify Assignment iterators (no performance improvement)

4e46230

InnovativeInventor force-pushed the perf-tuning branch 2 times, most recently from 1d4a69e to e387ce4 Compare January 19, 2022 15:47

InnovativeInventor force-pushed the perf-tuning branch 5 times, most recently from cc7be93 to db18c15 Compare January 19, 2022 16:19

InnovativeInventor and others added 3 commits January 19, 2022 12:52

Add FrozenGraph implementation (requires Python 3.8)

1225215

Note that the "seed_and_freeze" test changed due to the way flows are being cached. Hopefully there are no correctness issues introduced here.

Add __slots__ and replace PopulatedGraph.degree with direct dictionar…

f885a30

…y lookup

Switch to using more performant/preferred lookup call

a31725e

InnovativeInventor force-pushed the perf-tuning branch from db18c15 to a31725e Compare January 19, 2022 17:53

InnovativeInventor marked this pull request as ready for review January 19, 2022 18:07

InnovativeInventor closed this May 16, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance enhancements #372

Performance enhancements #372

InnovativeInventor commented Jan 2, 2022 •

edited

Loading

InnovativeInventor commented Jan 2, 2022

InnovativeInventor commented Jan 4, 2022

InnovativeInventor commented Jan 5, 2022

InnovativeInventor commented Jan 5, 2022

InnovativeInventor commented Jan 5, 2022

codecov-commenter commented Jan 6, 2022

InnovativeInventor commented Jan 11, 2022

InnovativeInventor commented Jan 19, 2022

InnovativeInventor commented May 16, 2022

Performance enhancements #372

Performance enhancements #372

Conversation

InnovativeInventor commented Jan 2, 2022 • edited Loading

InnovativeInventor commented Jan 2, 2022

InnovativeInventor commented Jan 4, 2022

InnovativeInventor commented Jan 5, 2022

InnovativeInventor commented Jan 5, 2022

InnovativeInventor commented Jan 5, 2022

codecov-commenter commented Jan 6, 2022

Codecov Report

InnovativeInventor commented Jan 11, 2022

InnovativeInventor commented Jan 19, 2022

InnovativeInventor commented May 16, 2022

InnovativeInventor commented Jan 2, 2022 •

edited

Loading