"Equal" attribution #267
-
A clarification regarding attributing flows equally across multiple industries (used as a fallback method when other attribution sources are not available). If a flow of 6 is to be equally attributed to the NAICS codes 111001, 111002, 111003, and 211000, what is the desired final attribution?
I think the first attribution is preferable, but I'm not sure. Thoughts? |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 4 replies
-
My preference is also option 1 - it makes sense to equally attribute a parent naics to child naics, so equally attribute the hypothetical parent of 11 and 21 to 11 and 21 before further attributing to NAICS6. I initially attempted to equally attribute data as you've presented in option 1. However, some of the data we import into flowsa are presented in odd combinations of NAICS levels that I couldn't quite make option 1 work for. My function returned a dataset where the parent and summed child NAICS wouldn't be equal - so I defaulted to option 2. Take a look at EIA_MECS_Energy - that is the dataset that caused problems for me. |
Beta Was this translation helpful? Give feedback.
-
@matthewlchambers I'm looping back to this discussion as there is a need to equally attribute census of ag data from parent to child, where the child data isn't published (data contains NAICS4/5 and we want NAICS6). We don't have a general equal allocation function in the recursive branch yet, do we? I see the function to estimate suppressed data for qcew and the functions for the unique situation that is MECS. |
Beta Was this translation helpful? Give feedback.
My preference is also option 1 - it makes sense to equally attribute a parent naics to child naics, so equally attribute the hypothetical parent of 11 and 21 to 11 and 21 before further attributing to NAICS6. I initially attempted to equally attribute data as you've presented in option 1. However, some of the data we import into flowsa are presented in odd combinations of NAICS levels that I couldn't quite make option 1 work for. My function returned a dataset where the parent and summed child NAICS wouldn't be equal - so I defaulted to option 2. Take a look at EIA_MECS_Energy - that is the dataset that caused problems for me.