-
Notifications
You must be signed in to change notification settings - Fork 40
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Intermittent data ingestion issue #222
Comments
@drnextgis If you are running multiple workers, you should try to ensure that the multiple workers are not accessing the same partitions as the partition maintenance tasks are locking and will hinder concurrency to those partitions. PgSTAC does a number of things to try to mitigate locking issues and allow concurrency, but due to the nature that we need to actually modify the layout of the database (adding partitions / modifying table constraints) there is definitely locking that happens and the possibility for contention. It will always be safest from that standpoint to run ingests sequentially and if not, to make sure to chunk data out so that you are not running ingests to the same partition simultaneously. |
I grasp that adhering to the practice of performing sequential operations of writing data into the same partition is a general guideline. However, due to our daily data ingestion into the Catalog, and considering that each collection can be further partitioned by either year or month, we inevitably find ourselves consistently ingesting data into the same partition 😥 Furthermore, I observed that invoking |
Proposed solution: #223 |
The issue if you get too aggressive with pre-generating partitions is the risk that if you create many empty partitions it can just make the query planning process slower for everything. Can you try this approach and see if that helps solve the issues you are seeing?
|
Are there any situations that might result in data not being ingested into the database following the successful invocation of load_items? Concurrent ingestions or heavy database loads during the ingestion process? We've encountered this issue few times, but the underlying cause remains uncertain; typically, rerunning it resolves the problem.
The text was updated successfully, but these errors were encountered: