Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add StatiqueImporter and related import-statique CLI command #139

Merged
merged 1 commit into from
Sep 2, 2024

Conversation

jmaupetit
Copy link
Member

@jmaupetit jmaupetit commented Jul 29, 2024

Purpose

We need a more efficient way to massively import Statique data.

Proposal

  • add qualicharge import-static command
  • use save_statiques_from_dataframe in /statique/bulk endpoint? postponed to a further PR (this one is already too big)

@jmaupetit jmaupetit self-assigned this Jul 30, 2024
@jmaupetit jmaupetit added feature WIP API Item related to the API service labels Jul 30, 2024
@jmaupetit jmaupetit force-pushed the add-parquet-import branch from 7a9beae to ad6d0da Compare August 1, 2024 15:54
@jmaupetit jmaupetit marked this pull request as ready for review August 30, 2024 17:18
@jmaupetit jmaupetit changed the title add parquet import Add StatiqueImporter and related import-statique CLI command Aug 30, 2024
@loco-philippe
Copy link
Collaborator

J'ai un peu regardé mais uniquement sur quelques points (pas fondamentaux) liés à pandas (cf au dessus). Sur le reste, il faudrait que je passe du temps pour bien comprendre (mais je ne pense pas que ce soit très utile).

Pour les temps de réponse, je ne sais pas quelles sont les parties les plus consommatrices sur lesquelles il faudrait creuser (si ça vaut la peine de gagner du temps).

@jmaupetit
Copy link
Member Author

@loco-philippe merci pour ta review, j'ai appliqué tous les changements demandés. 🙏

Copy link
Collaborator

@loco-philippe loco-philippe left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Impec, merci !

We designed an efficient ORM-free Pandas-based Statique importer. It is
supposed to be 10x faster than using ORM-based routines as implemented
in /statique/bulk endpoint.
@jmaupetit jmaupetit merged commit 29f4a43 into main Sep 2, 2024
12 checks passed
@jmaupetit jmaupetit deleted the add-parquet-import branch September 2, 2024 16:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
API Item related to the API service feature needs review
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants