DM-46747 refactor pqserver per SQR-072 #65

bbrondel · 2025-01-07T19:03:46Z

This divides the functionality in pqserver into separate units. It also switches from using a Connection object to Session objects so that sqlalchemy is managing a connection pool.

Note that the error behavior is not preserved by this change. My impression is that no one is relying on the existing error behavior, so it is to our advantage to move to error formats that are provided by FastAPI.

…object

python/lsst/consdb/cdb_schema.py

python/lsst/consdb/dependencies.py

Vebop · 2025-01-13T18:10:23Z

python/lsst/consdb/handlers/external.py

+)
+
+# Check SQLAlchemy version
+required_version = (2, 0, 0)


Maybe we could put this in our requirements.txt or something?

I've removed the version check from the code, but I think setting up requirements.txt or similar calls for a new ticket. The needs of consdb are fairly complicated given that there are several products coming from this repository.

If you wanted to add requirements on such a major refactoring, that seems fine - I don't see that needing its own ticket.

python/lsst/consdb/handlers/external.py

python/lsst/consdb/models.py

Co-authored-by: Valerie <[email protected]>

Vebop

This organization of dependencies, external, internal, models, and the pqserver.py being reduced to being the 'main app' seems to fit sqre-072 and my familiarity with fastapi.
We also talked about globals and 'requirements.txt' to handle sometime soon.

ktlim · 2025-01-24T17:50:11Z

python/lsst/consdb/cdb_schema.py

+        md.reflect(engine)
+        self.table_names.update([str(table) for table in md.tables])
+        self.schemas = md
+        self.obs_id_column = dict()


Redundant (along with the next line).

ktlim · 2025-01-24T17:50:50Z

python/lsst/consdb/cdb_schema.py

+        self.get_db = get_db
+
+        self.table_names = set()
+        self.schemas = dict()


Isn't this a potential type problem if you're replacing it with a MetaData?

I see the attraction of initializing these all up front, but sometimes initializing to the final, fixed contents can be more readable.

ktlim · 2025-01-24T17:53:58Z

python/lsst/consdb/cdb_schema.py

+                col_name = col_name.value
+                if col_name in md.tables[table].columns:
+                    self.obs_id_column[table] = col_name
+                    break


I think it would be good to comment here that this breaks ties based on the ordering in the enum if more than one such column is in a table. (I do think that that ordering is correct.)

ktlim · 2025-01-24T18:00:12Z

python/lsst/consdb/cdb_schema.py

+        columns = self.timestamp_columns[table]
+        return columns
+
+    def get_schema_version(self) -> Version:


I'm not sure how this might be used, but if we need it, we should pass it through from the source of truth rather than inferring it. For a later ticket.

ktlim · 2025-01-24T18:02:04Z

python/lsst/consdb/cdb_schema.py

+    def get_day_obs_and_seq_num(self, exposure_id: int) -> tuple[int, int]:
+        exposure_table_name = f"cdb_{self.instrument}.exposure"
+        exposure_table = self.schemas.tables[exposure_table_name]
+        query = sqlalchemy.select(exposure_table.c.day_obs, exposure_table.c.seq_num).where(


If this ever becomes a bottleneck, we can replace it with per-instrument formulae.

ktlim · 2025-01-24T18:03:22Z

python/lsst/consdb/cdb_schema.py

+            schema_table_name = table_name + "_schema"
+            if table_name in md.tables and schema_table_name in md.tables:
+                schema_table = md.tables[schema_table_name]
+                stmt = sqlalchemy.select(schema_table.c["key", "dtype", "doc", "unit", "ucd"])


I guess it's too difficult to share this code with the refresh_ method below.

ktlim · 2025-01-24T18:24:47Z

python/lsst/consdb/config.py

+            self.postgres_url = url
+            return url
+
+        return "ERROR DATABASE CONNECTION NOT SPECIFIED"


Should this raise a ValueError?

ktlim · 2025-01-24T18:32:02Z

python/lsst/consdb/dependencies.py

+    logger = logging.getLogger()
+
+    # Check whether the instrument name is valid
+    if _instrument_list is None:


Call get_instrument_list()?

ktlim · 2025-01-24T18:32:37Z

python/lsst/consdb/dependencies.py

+        inspector = inspect(engine)
+        instrument_list = [name[4:] for name in inspector.get_schema_names() if name.startswith("cdb_")]
+
+    if instrument not in [i.lower() for i in instrument_list]:


Call validate_instrument_name()?

ktlim · 2025-01-24T18:35:45Z

python/lsst/consdb/exceptions.py

+        return data
+
+
+class UnknownInstrumentException(Exception):


Actually seems like it could inherit from BadValueException with a particular kind.

bbrondel requested a review from Vebop January 7, 2025 19:03

bbrondel force-pushed the tickets/DM-46747 branch from 6f6d5e9 to 09e4ecc Compare January 8, 2025 21:52

bbrondel added 7 commits January 8, 2025 18:53

Add configuration file.

11ac081

Audit with ruff

24f3924

Fix most of the unit tests

ef89c36

Make InstrumentTable use a generator instead of persisting a Session …

370c587

…object

Fix some unit tests

b3b1e75

Fix some unit tests

04b772e

Fix docker files

b402b60

bbrondel force-pushed the tickets/DM-46747 branch from 09e4ecc to b402b60 Compare January 8, 2025 21:53

Vebop reviewed Jan 13, 2025

View reviewed changes

bbrondel and others added 11 commits January 13, 2025 18:35

clarification of docstring

767e37b

Co-authored-by: Valerie <[email protected]>

Remove print statements

92d5531

Remove unused dataclasses

5f98f5e

Remove clumsy sqlalchemy version check

5a45992

Remove unused dependency

25a4f57

Covert instrument names to lowercase in the validator

c7c4870

Ensure pqserver works inside of pqserver docker image

edf7b06

Add metadata to the root endpoints

a5bcd54

Fix typo in workflow

c533364

Version number cannot be None

5285e4e

Include instrument list in the unknown instrument error response

0e78116

bbrondel requested a review from ktlim January 15, 2025 20:30

bbrondel marked this pull request as ready for review January 15, 2025 20:30

bbrondel added 5 commits January 16, 2025 14:53

Add coverage reporting

4ccd64e

General cleanup

33d3225

Change engine to a depend as recommended by @Vebop

6525ce9

Remove extra definition of

a040b0b

All globals at the top of the file and prepended with underscore.

3e899c4

Vebop approved these changes Jan 23, 2025

View reviewed changes

ktlim approved these changes Jan 24, 2025

View reviewed changes

bbrondel merged commit 323ad56 into main Jan 27, 2025
7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DM-46747 refactor pqserver per SQR-072 #65

DM-46747 refactor pqserver per SQR-072 #65

bbrondel commented Jan 7, 2025

Vebop Jan 13, 2025

bbrondel Jan 14, 2025

JeremyMcCormick Jan 14, 2025 •

edited

Loading

Vebop left a comment

ktlim Jan 24, 2025

ktlim Jan 24, 2025

ktlim Jan 24, 2025

ktlim Jan 24, 2025

ktlim Jan 24, 2025

ktlim Jan 24, 2025

ktlim Jan 24, 2025

ktlim Jan 24, 2025

ktlim Jan 24, 2025

ktlim Jan 24, 2025

DM-46747 refactor pqserver per SQR-072 #65

DM-46747 refactor pqserver per SQR-072 #65

Conversation

bbrondel commented Jan 7, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

JeremyMcCormick Jan 14, 2025 • edited Loading

Choose a reason for hiding this comment

Vebop left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

JeremyMcCormick Jan 14, 2025 •

edited

Loading