Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Discuss] Changed default settings for 3.0 #17170

Open
Tracked by #3351
andrross opened this issue Jan 28, 2025 · 4 comments
Open
Tracked by #3351

[Discuss] Changed default settings for 3.0 #17170

andrross opened this issue Jan 28, 2025 · 4 comments
Labels
Cluster Manager discuss Issues intended to help drive brainstorming and decision making Indexing Indexing, Bulk Indexing and anything related to indexing Search Search query, autocomplete ...etc untriaged

Comments

@andrross
Copy link
Member

andrross commented Jan 28, 2025

Is your feature request related to a problem? Please describe

Coming from part of @Bukhtawar's comment here, the 3.0 release is an opportunity to change default settings or behavior. We often introduce new features or guardrails that are generally helpful but are considered breaking behavior to automatically enable by default. This issue is to discuss any such changes that should be made and get them added to the list for the 3.0 release.

Describe the solution you'd like

List changes to defaults that should be made for the 3.0 release.

Proposed Changes:

@andrross andrross added enhancement Enhancement or improvement to existing feature or request untriaged labels Jan 28, 2025
@andrross andrross added Indexing Indexing, Bulk Indexing and anything related to indexing Search Search query, autocomplete ...etc labels Jan 28, 2025
@andrross andrross added discuss Issues intended to help drive brainstorming and decision making and removed enhancement Enhancement or improvement to existing feature or request labels Jan 28, 2025
@andrross
Copy link
Member Author

I'm intentionally spamming the component labels to get visibility here across all components :)

@msfroh
Copy link
Collaborator

msfroh commented Jan 29, 2025

Ooh, ooh! Pick me!

  • Increase maxMergeAtOnce parameter in OpenSearch 3.0 #17051 -- We can get a head-start on a new default planned for Lucene 11.
  • I would also change a few TieredMergePolicy settings to get more consistent segment sizes:
    • floorSegmentMB should be much higher than the default 2MB -- say 50 MB
    • segmentsPerTier should probably be lower -- I've had good experience with 5 (down from 10)
    • IMO we should add a setting to cap maxSegmentSize at a fraction of total shard size (e.g. no segment should be larger than 1/5 of the shard size) to see benefits from concurrent segment search even for smaller shards.

@HenryTheSir
Copy link

Hi,

I‘m unsure if I have to follow guidelines to suggest changes. If this is wrong just delete my comment.

My two suggestions would be to change the default codec to zstd, docs say read and write are both increased with this codec.
Second suggestion would be to enable cluster.allocator.existing_shards_allocator.batch_enabled by default to lower pressure on clusters on startups.

best regards
Henry

@kkewwei
Copy link
Contributor

kkewwei commented Feb 2, 2025

My two suggestions:
1.enable cluster.routing.allocation.balance.prefer_primary and cluster.routing.allocation.rebalance.primary.enable
2. increase cluster.max_shards_per_node, 1000 is too small, how about 3000?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Cluster Manager discuss Issues intended to help drive brainstorming and decision making Indexing Indexing, Bulk Indexing and anything related to indexing Search Search query, autocomplete ...etc untriaged
Projects
Status: 🆕 New
Status: 🆕 New
Development

No branches or pull requests

4 participants