Introduce P3 and RocksDB fixes #624

shizzard · 2024-10-02T11:17:12Z

This PR includes several directions of work:

P3: fix configuration;
ar_kv: rework, introduce SST flushes, WAL syncs, and proper termination sequence.

shizzard · 2024-10-02T15:41:03Z

@ldmberman short explanation for ar_kv changes for you:

Reconnect machanism introduced a race condition and was not used anyway, so it was removed.
The repair mechanism is actually inherited from leveldb and cannot repair CF databases, therefore is dangerous to use: in best case scenario it will drop the data from WAL, in worst case it will only apply default CF changes, corrupting the database. Repair was completely removed for this reason.
The termination sequence is now set as [memtable flush, WAL sync, close], which is sufficient for data to be persisted.
RocksDB will handle dangling corrupted records in MANIFEST, SST file or WAL if the process crashed mid-write, dropping the corrupted entries, so there is no need to do anything about that.
In order to be sure that CF databases are consistent, the atomic_flush option is applied to all of them: it ensures that all the CFs are flushed atomically.

Worth reading, despite it is 10 years old: facebook/rocksdb#236 (comment) (other comments are also useful).

shizzard · 2024-10-02T16:03:24Z

Update: I found some traces of adding CF support for repair mechanism, but I cannot make it work anyway, neither with Erlang bindings nor RocksDB CLI tool (ldb).
trace commit

apps/arweave/src/ar_kv.erl

JamesPiechota · 2024-10-02T18:08:10Z

apps/arweave/src/ar_kv.erl

-							{exception, io_lib:format("~p", [Exc])}]),
-					{error, Exc};
-				_ ->
-					case reconnect(Name, Ref) of


The new code discards the retry functionality. Is this what you were referring to in your PR comment about the reconnect functionality not working right? If the retry didn't work or, worse, introduced a race condition, then I agree good to remove - but just want to double check that we're not accidentally using a valuable bit of retry logic

Reconnect machanism introduced a race condition and was not used anyway, so it was removed.

Ah, I see. All methods were called with RetryCount set to 1? In that case, I completely agree - kill the retry functionality

Yes, but the problem was bigger than that.
The reconnect was called from the functions that were executed on the caller side (not in the ar_kv process), so these calls may happen concurrently, a few at once. This means, that if the database for some reason is down, and the db reference is outdated, few other processes will do attempt gen_server:call with demand to reconnect the database. This calls will be serialized in the process mailbox, and gen_server will do close/open sequence several times. While the database is closed, other processes will find the database reference dead, will call for reconnects, and this will never end.

ah got it - sounds like a mess. Good call removing.

This calls will be serialized in the process mailbox, and gen_server will do close/open sequence several times. While the database is closed, other processes will find the database reference dead, will call for reconnects, and this will never end.

This is not true, the first reconnect changes the reference and the subsequent processes simply take it - https://github.com/ArweaveTeam/arweave/blob/master/apps/arweave/src/ar_kv.erl#L171

Nevertheless, we probably do not need the reconnect functionality now so it makes sense to remove it.

@ldmberman yes, that's the race condition, because several processes will hit the same gen_server:call at the same time and the ar_kv server will do reconnect sequence several times.
If you're referring to the fact that one database is only used by one process (not sure that this is true), then yes, there is no race condition just because the race contains just one process.

apps/arweave/src/ar_kv.erl

apps/arweave/src/ar_kv_sup.erl

apps/arweave/src/ar_kv.erl

rebar.config

apps/arweave/src/ar_kv.erl

ldmberman · 2024-10-03T11:11:49Z

@ldmberman short explanation for ar_kv changes for you:

Reconnect machanism introduced a race condition and was not used anyway, so it was removed.

The repair mechanism is actually inherited from leveldb and cannot repair CF databases, therefore is dangerous to use: in best case scenario it will drop the data from WAL, in worst case it will only apply default CF changes, corrupting the database. Repair was completely removed for this reason.

The termination sequence is now set as [memtable flush, WAL sync, close], which is sufficient for data to be persisted.

RocksDB will handle dangling corrupted records in MANIFEST, SST file or WAL if the process crashed mid-write, dropping the corrupted entries, so there is no need to do anything about that.

In order to be sure that CF databases are consistent, the atomic_flush option is applied to all of them: it ensures that all the CFs are flushed atomically.

Worth reading, despite it is 10 years old: facebook/rocksdb#236 (comment) (other comments are also useful).

This is a relatively fresh discussion of the repairing procedure. The key point from there:

The WAL is processed afterwards, when the insertion for new column families are ignored as they are not represented in the recovered manifest.

This is (so far) in line with what @shizzard observed locally in tests. There are no other caveats mentioned so I think it is only about flushing the CFs once they are created (no need to flush them explicitly later on.) In any case, @shizzard and I had a discussion and decided to remove the repairing code for now.

Regarding the atomic_flush option, we do not need it because we do not use RocksDB transactions. I would not introduce it until we need it.

… atomic_flush from default db options

… syncs

shizzard · 2024-10-03T15:50:25Z

@ldmberman @JamesPiechota I think it is ready to merge, unless you have any changes in mind.

apps/arweave/src/ar.erl

apps/arweave/src/ar_kv.erl

shizzard added 2 commits October 2, 2024 14:14

Introduce P3 and RocksDB fixes

b856512

Fix rebar.config

97fc0fd

shizzard requested review from ldmberman, vird, JamesPiechota and humaite October 2, 2024 15:04

JamesPiechota reviewed Oct 2, 2024

View reviewed changes

Fix ar_test_node

c696ef8

JamesPiechota reviewed Oct 2, 2024

View reviewed changes

apps/arweave/src/ar_kv.erl Show resolved Hide resolved

Fix ar_kv tests

bb6b6f3

shizzard added 5 commits October 3, 2024 14:50

Fix ar_kv tests

c21fccd

Remove repair config options; add CF databases one-time flush; remove…

8e5a34b

… atomic_flush from default db options

Implement periodic db flush and WAL sync (configless)

9c2d908

Intruduce configurable intervals for periodic rocksdb flushes and wal…

e205a91

… syncs

Fix typo

339e74f

JamesPiechota reviewed Oct 3, 2024

View reviewed changes

apps/arweave/src/ar.erl Show resolved Hide resolved

JamesPiechota reviewed Oct 3, 2024

View reviewed changes

apps/arweave/src/ar_kv.erl Show resolved Hide resolved

Implement flush and WAL sync intervals for JSON config

fea9e6e

JamesPiechota approved these changes Oct 4, 2024

View reviewed changes

shizzard merged commit f59047b into master Oct 4, 2024
65 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce P3 and RocksDB fixes #624

Introduce P3 and RocksDB fixes #624

shizzard commented Oct 2, 2024

shizzard commented Oct 2, 2024 •

edited

Loading

shizzard commented Oct 2, 2024 •

edited

Loading

JamesPiechota Oct 2, 2024

JamesPiechota Oct 2, 2024

shizzard Oct 2, 2024 •

edited

Loading

JamesPiechota Oct 2, 2024

ldmberman Oct 3, 2024

shizzard Oct 4, 2024 •

edited

Loading

ldmberman commented Oct 3, 2024

shizzard commented Oct 3, 2024

Introduce P3 and RocksDB fixes #624

Introduce P3 and RocksDB fixes #624

Conversation

shizzard commented Oct 2, 2024

shizzard commented Oct 2, 2024 • edited Loading

shizzard commented Oct 2, 2024 • edited Loading

JamesPiechota Oct 2, 2024

Choose a reason for hiding this comment

JamesPiechota Oct 2, 2024

Choose a reason for hiding this comment

shizzard Oct 2, 2024 • edited Loading

Choose a reason for hiding this comment

JamesPiechota Oct 2, 2024

Choose a reason for hiding this comment

ldmberman Oct 3, 2024

Choose a reason for hiding this comment

shizzard Oct 4, 2024 • edited Loading

Choose a reason for hiding this comment

ldmberman commented Oct 3, 2024

shizzard commented Oct 3, 2024

shizzard commented Oct 2, 2024 •

edited

Loading

shizzard commented Oct 2, 2024 •

edited

Loading

shizzard Oct 2, 2024 •

edited

Loading

shizzard Oct 4, 2024 •

edited

Loading