Feature/verify tool sample #722

ldmberman · 2025-02-20T12:28:52Z

No description provided.

JamesPiechota · 2025-02-20T18:15:47Z

apps/arweave/src/ar_data_sync.erl

-					{seek_offset, SeekOffset},
-					{reply, io_lib:format("~p", [Reply])},
-					{is_recorded_unpacked, io_lib:format("~p", [UnpackedReply])}]),
+			case RequestOrigin of


log_chunk_error should already skip logging for http and tx_data origins - is there an origin that I missed? I can add it to the matching clause

https://github.com/ArweaveTeam/arweave/pull/722/files#diff-2934ddad2ed46ac194a6be7abde4be6d440bc7565e1128bba234fad9fd2bd5d2L1489-L1494

JamesPiechota · 2025-02-20T18:21:59Z

apps/arweave/src/ar_verify_chunks.erl

+%% such that if an offset is sampled, no other offsets are selected from the
+%% open interval (Offset - ?DATA_CHUNK_SIZE, Offset + ?DATA_CHUNK_SIZE).
+generate_sample_offsets(Start, End, Count) when is_integer(Start), is_integer(End) ->
+	Candidates = lists:seq(Start + 1, End, ?DATA_CHUNK_SIZE),


For a full 3.6TB partition I think Candidates will be a list of length 14.4 million entries or have I misread?

Could that become a memory issue (e.g. if running verify on multiple storage modules concurrently)?

JamesPiechota · 2025-02-20T18:23:47Z

apps/arweave/src/ar_verify_chunks.erl

+%% Use generate_sample_offsets/3 to obtain offsets (with exclusion)
+%% and then queries ar_data_sync:get_chunk/2 with options to trigger unpacking.
+sample_random_chunks(Count, Packing, Start, End, StoreID) ->
+	Offsets = generate_sample_offsets(Start, End, Count),


Rather than precalculate the list of offsets, what about doing one offset at a time sampled randomly from the range, and if that chunk exists on disk just adding that offset to a set so we never try it again? Maybe this will avoid the 14.4M offset list?

Lev Berman added 3 commits February 20, 2025 12:38

Add long verification for sampled chunks in the verify tool

9bf2657

Only log chunk_record_not_found during mining

1f72b7c

Fix bug where miner_requested_disk_pool_chunk is logged untimely

8023da7

JamesPiechota reviewed Feb 20, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/verify tool sample #722

Feature/verify tool sample #722

ldmberman commented Feb 20, 2025

JamesPiechota Feb 20, 2025

JamesPiechota Feb 20, 2025

JamesPiechota Feb 20, 2025

Feature/verify tool sample #722

Are you sure you want to change the base?

Feature/verify tool sample #722

Conversation

ldmberman commented Feb 20, 2025

JamesPiechota Feb 20, 2025

Choose a reason for hiding this comment

JamesPiechota Feb 20, 2025

Choose a reason for hiding this comment

JamesPiechota Feb 20, 2025

Choose a reason for hiding this comment