-
Notifications
You must be signed in to change notification settings - Fork 0
/
about.html
159 lines (132 loc) · 9.5 KB
/
about.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
<!DOCTYPE html>
<html lang="en">
<head>
<meta charset="UTF-8">
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<meta name="description" content="CI Sentinel maps America's critical infrastructure for analysis and exploration.">
<meta name="keywords" content="Critical Infrastructure, Map, CI Sentinel, United States">
<meta name="author" content="Your Name">
<title>About</title>
<link rel="stylesheet" href="./resources/styles/index.css">
</head>
<body>
<nav>
<ul class="navbar">
<li><a href="index.html">Home</a></li>
<li><a href="about.html">About</a></li>
<li><a href="mixed_datasets.html">Datasets</a></li>
<!-- <li><a href="donate.html">Donate</a></li> -->
</ul>
</nav>
<div class="centered-content">
<h2>Introducing CI Sentinel!<h2></h2>
<h4>A comprehensive map of America's critical infrastructure.</h4>
<h3>Methods</h3>
<p>My methods of creating this map came from collecting data from several datasets, and applying distributed weights to each facility, and weighing each of those facilities depending on their own details.</p>
<h3>Classification of Importance by Industry</h3>
<p>One of the most challenging aspects of this projects is trying to classify and rank each of specific facility by importance to national security, and generally trying to rank how "critical" a facility is to the security of the United States. For example, a facility with the NAICS code of 336414 is for "Guided Missile and Space Vehicle Manufacturing;" So facilities with this classification are significantly more important and should be weighed much more than say a facility with the NAICS classification of 111150 which is "Corn Farming." However, as of now, only 3 facility types are being weighed, this will be fixed with time.</p>
<p>With the classification of industries past, it was much easier to rank the criticality of non-commercial and governmental facilities. This was done mainly by measuring the raw metrics of each of these facilities and giving more weight to the facilities with higher metrics. </p>
<h2>Notebooks</h2>
<p>Here is a list of my notebooks and the purposes each one served.</p>
<h3>cleanGeoJSONs.ipynb</h3>
<p>This notebook removed unnecessary fields from the GeoJSONs. In each of the datasets, there are fields that did not need to be measured for the importance of critical infrastructure, such as phone numbers and addresses. These fields took up a ton of storage space and memory use, so it was very necessary to remove a lot of the fields.</p>
<h3>standardizeGeoJSONs.ipynb</h3>
<p>This notebook changes any GeoJSON file that is not in EPSG:4326 to this format. GeoJSONs come with many different Coordinate Reference Systems, such as EPSG:3857, EPSG:7789, or EPSG:31983. These formats are not compatible with each other, so they need to be standardized into one single format. For this project, EPSG:4326 is what I chose because it is easy to read.</p>
<h3>geoJSONtoGPKG.ipynb</h3>
<p>This was created to convert all of the GeoJSONs to binary in order to process them faster. They were not able to speed up the process and I could not figure out why, but I would love some contribution on this.</p>
<h3>weights.ipynb</h3>
<p>This is the one where the magic happens. This notebook takes in the cleaned datasets, calculates the weight of each facility (and give a weight to each object) and then deletes the rest of the fields that aren't the "weight" or "geometry." Deletign the rest of the fields proved to be critical for the use of this application, because if not, it would take an absurd amount of time to process these large files and plot everything onto the map.</p>
<h3>map_creator_best.ipynb</h3>
<p>This notebook creates the map at county, state, and national level. With this, all of the weighted datasets are ingested into memory, and the notebook plots each county and state alphabetically. In cell 6, all of the counties in America are processed alphabetically. In cell 7, all of the states are processed. In cell 5, the whole nation is procesed and the legend is created. For each county amd state, it creates a folder in which there are 4 objects stored, The first one is a map of all of the critical infrastructure for each. Next, it creates a text file which includes the datasets used, the ones not used, and ones that ran into any errors. Next, there is a map of the top 20 most important facilities in each county by weight. Lastly, there is a map of the top 5 aggragate points for each county. This process is repeated for the states in cell 7. Note that I used Google Colab Pro with High-RAM in order to speed up the process.</p>
<h2>List of sectors and datasets:</h2>
<h3>Communications</h3>
<h4>- Cell Towers</h4>
<h4>- Microwave Service Towers</h4>
<h3>Education</h3>
<h4>- Colleges and Universities</h4>
<h4>- Private Schools</h4>
<h3>Emergency Services</h3>
<h4>- Local Law Enforcement</h4>
<h4>- Fire and Emergency Services</h4>
<h4>- State Emergency Services</h4>
<h3>Energy</h3>
<h4>Electric</h4>
<h5>- Electric Power Grid and Transformers</h5>
<h4>Gas</h4>
<h5>- Above Ground LNG Storage</h5>
<h5>- Biodiesel Plants</h5>
<h5>- DOE Petroleum Reserves</h5>
<h5>- Ethanol Plants</h5>
<h5>- Hydrocarbon Gas Liquid Pipelines</h5>
<h5>- LNG Import and Export Terminals</h5>
<h5>- Natural Gas Compressor Stations</h5>
<h5>- Natural Gas Pipelines</h5>
<h5>- Natural Gas Processing Plants</h5>
<h5>- Peak Shaving Facilities</h5>
<h5>- Petroleum Terminals</h5>
<h4>Oil</h4>
<h5>- Oil and Natural Gas Wells</h5>
<h5>- Oil Refineries</h5>
<h3>Economy and Finance</h3>
<h4>- County Business Patterns</h4>
<h4>- Federal Reserve Banks</h4>
<h4>- FDIC Insured Banks</h4>
<h4>- Gold Bullion Depositories</h4>
<h3>Geographic</h3>
<h4>- Counties and State Lines GeoJSON</h4>
<p>This dataset contains no weight, it is simply a GeoJSON outline of each of the counties and states.</p>
<h3>Food</h3>
<h4>- Public Refrigerated Warehouses</h4>
<h3>Government</h3>
<h4>- Courthouses</h4>
<h4>- Major State Government Buildings</h4>
<h4>- State Capital Buildings</h4>
<h4>- US Army Corp of Engineers Offices</h4>
<h3>Healthcare</h3>
<h4>- Health Facilities</h4>
<h3>Industry</h3>
<h4>- Fortune 500 Headquarters</h4>
<h4>- Manufacturing Facilities</h4>
<!-- <h4>- Semiconductors</h4> -->
<h3>Military</h3>
<h4>- Military Installations</h4>
<h3>Mines</h3>
<h4>- Agricultural Mineral Operations</h4>
<h4>- Construction Mineral Operations</h4>
<h4>- Ferrous Metal Mines</h4>
<h4>- Ferrous Metal Processing Plants</h4>
<h4>- Mines and Mineral Resources</h4>
<h4>- Non-ferrous Metal Mines</h4>
<h4>- Non-ferrous Metal Processing Plants</h4>
<h4>- Refractory Abrasive and Other Industrial Mineral Operations</h4>
<h4>- Sand and Gravel Operations</h4>
<h4>- Uranium Deposits</h4>
<h3>Population</h3>
<h4>- Population by Tract</h4>
<h3>Transportation</h3>
<h4>- Airports</h4>
<h4>- Bridges</h4>
<h4>- Ports</h4>
<h4>- Railroads</h4>
<h4>- Roads</h4>
<h4>- Spaceports</h4>
<h3>Waste</h3>
<h4>- Solid Waste Landfill Facilities</h4>
<h3>Water</h3>
<h4>- Aquifers</h4>
<h4>- Dams</h4>
<h4>- USACE Owned and Operated Reservoirs</h4>
<h3>Contribution and Improvements</h3>
<p>I am making this repository open source because an issue as important as critical infrastructure must be mapped out for the average person to see and measure. Contributions are more than welcome to anyone who would like to contribute. Below is a current list of some items that I was not able to find/measure and would welcome any solution to these issues.</p>
<h5>Other HIFLD Datasets:</h5>
<p>There are many other datasets from HIFLD that I have not processed, over 400 to be exact. I would love help in expanding for all of these datasets.</p>
<h5>All Manufacturing Facilities</h5>
<p>The General Manufacturing Facilities dataset has a lot of variability, and all of the NAICS codes need to be graded manually one by one. There are over 65,000 of them as well.</p>
</div>
<!-- Footer -->
<footer>
<p>© 2024 CI Sentinel</p>
<p>Created by Thomas Numnum</p>
</footer>
</body>
</html>