Add Empower Connector #902

jburchard · 2023-10-07T21:50:57Z

This connector adds support for the Empower relational app. As of my most recent testing, it includes support for all of the different json objects that are included in their API.

sharinetmc

@jburchard, everything here looks good to me, honestly! I just have a personal question! From what I understand, it seems that the connector basically grabs an object, which contains all the relevant data and then splits it up between the relevant tables that each method is associated with. Everything looks clean and makes sense. And all the documentation and mock tests and everything looks great.

@shaunagm, not sure if you might have any other comments or suggestions? Just marking this as a comment for now in case you might have anything to add before approval?

sharinetmc · 2023-12-01T19:10:23Z

docs/empower.rst

+
+.. note::
+    The Empower API only has a single endpoint to access all account data. As such, it has a very high overhead. This
+    connector employs caching in order to allow the user to specify the tables to extract without additional API calls.


More of a personal question, coming from lack of experience, but wondering what caching is?

So, there is only one endpoint and it grabs a giant JSON blob that is a bit of a pain to parse through and convert to a tabular-ish format. I've broken up that JSON into multiple functions so that folks can grab the bits of data that they want.

However, that means that every single time you call it function, it is grabbing a lot of extraneous data. So, the caching just stores the blob and extracts from it rather than calling the Empower server again.

At the time, it seemed like a cute idea, but perhaps its unnecessary.

shaunagm · 2023-12-04T20:23:05Z

docs/empower.rst

+To instantiate the Empower class, you can either store your ``EMPOWER_API_KEY`` an environment
+variables or pass them in as arguments:
+
+.. code-block:: python


Might be nice to have an example of disabling caching in the quickstart

shaunagm · 2023-12-04T20:26:23Z

parsons/empower/empower.py

+        else:
+            return self.data
+
+    def _unix_convert(self, ts):


You can ignore this, Justin, but just flagging that this is a potential utility method (see #554 )

shaunagm · 2023-12-04T20:28:58Z

parsons/empower/empower.py

+        ts = ts.strftime("%Y-%m-%d %H:%M:%S UTC")
+        return ts
+
+    def _empty_obj(self, obj_name):


Is there is a reason this is a separate method rather than just using this code in the one place it's referenced? You could write it with just that first line - the return True and return False aren't necessary.

shaunagm · 2023-12-04T20:30:33Z

parsons/empower/empower.py

+                See :ref:`parsons-table` for output options.
+        """
+
+        tbl = Table(self.data["profiles"]).long_table("eid", "activeCtaIds")


long_table is a new one on me! Mind sharing what it is and why you're using it here? Thank you!

Sure, this documentation includes a pretty decent example. Basically, it takes a column that has a nested JSON in it -- in this case a list of active call to action ids -- and it creates a new table that is one row for each call to action id and the id of the profile. Then, if you are storing in a DB, you can easily join the two together.

However, this transformation and, many others in parson, was built before JSONs in BigQuery/Redshift became a more widespread thing. So, in a world in which you can store JSONs and query the elements in SQL, this may no longer make sense. Tldr: Should parson's connectors be extracting nested JSONs?

shaunagm · 2023-12-04T20:35:25Z

parsons/empower/empower.py

+        col_list = [v for v in tbl.columns if v.find("value") != -1]
+        tbl.coalesce_columns("answer_id", col_list, remove_source_columns=True)
+        tbl.remove_column("uid")
+        tbl.remove_column("answers")  # Per docs, this is deprecated.


I believe that remove_column errors if the column name isn't found. Is it possible that this column name will stop getting returned? Or do you mean it's deprecated in some other way?

This column is still being returned by Empower, but their docs indicate that it isn't being used for anything any longer. So, I decided to not surface it for the user.

shaunagm · 2023-12-04T20:38:16Z

I added a couple of comments/questions @jburchard but nothing major. Should be pretty close to merging this.

jburchard · 2023-12-04T20:48:09Z

Thanks @shaunagm @sharinetmc for the feedback. I'll take another pass. There are also a few slight modifications (a few errors surfaced) when I started using a forked version of this, that I do want to include in the PR before it is merged.

shaunagm

Just formally requesting changes so it shows up with that status in the pull request list view :)

shaunagm · 2024-04-11T18:12:17Z

Hi Justin, zero pressure - just wanted to see if you have plans to get to this soon?

KasiaHinkson · 2024-11-12T15:23:22Z

TMC has been using Airbyte for our Empower sync and it's no longer working so we're looking for an alternative. This looks like good work, can TMC do anything to help get this over the line?

Also, we were not doing any unnesting at all at the load step, so I'd like to include a function/option to just return the results as a 1 row table with all that nested json that we would write to GCS and then load into BQ, our dbt pipelines handle the rest.

shaunagm · 2024-12-02T21:13:23Z

Merged in #1191 so closing this.

Push code.

33ded2c

jburchard added the new connector Work type - creating a new Parsons connector for a tool label Oct 7, 2023

jburchard requested a review from shaunagm October 7, 2023 21:50

jburchard added 3 commits October 7, 2023 16:59

Some linting that snuck through...

ebf5c9d

Black v2.

082fa10

Merge branch 'main' into empower_connector

8321f0b

shaunagm assigned KasiaHinkson Oct 26, 2023

shaunagm requested review from KasiaHinkson and removed request for shaunagm October 26, 2023 19:58

shaunagm requested review from sharinetmc and removed request for KasiaHinkson November 9, 2023 20:38

shaunagm assigned sharinetmc and unassigned KasiaHinkson Nov 9, 2023

sharinetmc reviewed Dec 1, 2023

View reviewed changes

Merge branch 'main' into empower_connector

fbe7e48

shaunagm reviewed Dec 4, 2023

View reviewed changes

shaunagm requested changes Dec 19, 2023

View reviewed changes

shaunagm closed this Dec 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Empower Connector #902

Add Empower Connector #902

jburchard commented Oct 7, 2023

sharinetmc left a comment

sharinetmc Dec 1, 2023

jburchard Dec 4, 2023

shaunagm Dec 4, 2023

jburchard Dec 4, 2023

shaunagm Dec 4, 2023

shaunagm Dec 4, 2023

shaunagm Dec 4, 2023

jburchard Dec 4, 2023

shaunagm Dec 4, 2023

jburchard Dec 4, 2023

shaunagm commented Dec 4, 2023

jburchard commented Dec 4, 2023

shaunagm left a comment

shaunagm commented Apr 11, 2024

KasiaHinkson commented Nov 12, 2024

shaunagm commented Dec 2, 2024

Add Empower Connector #902

Add Empower Connector #902

Conversation

jburchard commented Oct 7, 2023

sharinetmc left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

shaunagm commented Dec 4, 2023

jburchard commented Dec 4, 2023

shaunagm left a comment

Choose a reason for hiding this comment

shaunagm commented Apr 11, 2024

KasiaHinkson commented Nov 12, 2024

shaunagm commented Dec 2, 2024