You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have observed a problem using this library while processing Kinesis stream.
I currently have a stream of 20shards, and I want to read it from trim horizon, therefore I create 20 workers (same application name, different ids) to process the stream. In theory, 20 shards would allow me to read up to 2MB x 20 x 60 = 2400MB per minute, however, I only observer a maximum of ~ 1000MB. This was very weird hence I did an experiment by changing the code (commented out the trackRecords). Then the snapshot build could obtain the maximum read throughput (however, there is no checkpoint on DynamoDB table due to no tracking).
abortStreamOnError("processRecords") {
val records = transformRecords(processRecordsInput.records())
//trackRecords(records)
//checkpointIfNeeded(processRecordsInput.checkpointer())
records.grouped(EnqueueBatchSize).foreach { r =>
enqueueRecords(r)
checkpointIfNeeded(processRecordsInput.checkpointer())
}
}
Could you please let me know is there anyway to increase the throughput without losing correctness?
Many thanks,
Truc
The text was updated successfully, but these errors were encountered:
I have observed a problem using this library while processing Kinesis stream.
I currently have a stream of 20shards, and I want to read it from trim horizon, therefore I create 20 workers (same application name, different ids) to process the stream. In theory, 20 shards would allow me to read up to 2MB x 20 x 60 = 2400MB per minute, however, I only observer a maximum of ~ 1000MB. This was very weird hence I did an experiment by changing the code (commented out the trackRecords). Then the snapshot build could obtain the maximum read throughput (however, there is no checkpoint on DynamoDB table due to no tracking).
Could you please let me know is there anyway to increase the throughput without losing correctness?
Many thanks,
Truc
The text was updated successfully, but these errors were encountered: