Skip to content
This repository has been archived by the owner on Nov 13, 2024. It is now read-only.

batch upsert by chunks in cli #132

Merged
merged 3 commits into from
Nov 1, 2023
Merged

batch upsert by chunks in cli #132

merged 3 commits into from
Nov 1, 2023

Conversation

acatav
Copy link
Contributor

@acatav acatav commented Nov 1, 2023

Problem

batch upsert should be determined by the number of chunks and not number of documents for large upsert

Solution

For now we will batch by number of chunks only in the CLI. Later we will refactor the whole upsert pipeline

Type of Change

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds functionality)
  • Breaking change (fix or feature that would cause existing functionality to not work as expected)
  • This change requires a documentation update
  • Infrastructure change (CI configs, etc)
  • Non-code change (docs, etc)
  • None of the above: (explain here)

Test Plan

Manual tests with short documents dataset, and with long documents dataset

Copy link
Contributor

@igiloh-pinecone igiloh-pinecone left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. See a few comments

src/canopy_cli/cli.py Outdated Show resolved Hide resolved
src/canopy_cli/cli.py Outdated Show resolved Hide resolved
@acatav acatav enabled auto-merge November 1, 2023 13:26
@acatav acatav added this pull request to the merge queue Nov 1, 2023
Merged via the queue into dev with commit 10c0e7d Nov 1, 2023
10 checks passed
@acatav acatav deleted the cli-batch-upsert-by-chunks branch November 1, 2023 13:45
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants