generated from DS4200-S23-Class/project
-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathindex.html
121 lines (81 loc) · 7.98 KB
/
index.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
<!DOCTYPE html>
<html>
<head>
<meta http-equiv="content-type" content="text/html; charset=UTF-8"/>
<!--include style sheet-->
<link rel="stylesheet" type="text/css" href="style.css">
<!--include d3 library -->
<script src="js/d3.v6.1.1/d3.min.js"></script>
<script src="js/topojson.min.js"></script>
<title>DS4200 Final Project: Ritu Shah & Angeline Teo</title>
</head>
<body>
<h1>Health Data Breaches in the United States</h1>
<div>
<h2>Motivation</h2>
<p>Healthcare organizations face significant challenges in safeguarding patient data due to the constantly evolving threat of cyberattacks. A data breach could not only result in reputational damage and legal consequences, but also cause harm to patients. Healthcare organizations need to conduct regular risk assessments to identify potential vulnerabilities and threats to patient data security and develop data security strategies to mitigate those risks. The use of a visualization tool may be effective for healthcare organizations in states across the United States to enhance their security systems and safeguard patient data. This visualization tool will showcase health data breaches reported by the U.S. Department of Health and Human Services that are currently being investigated by the Office for Civil Rights. Attributes that will be visible are the name of the state, the state the breaches occurred, the number of breaches by breach type, the type of breach, the location of breach information, and the year of breach. This can be used to identify connections with states that have experienced data breaches, allowing healthcare organizations to conduct more targeted risk assessments and develop more effective data security strategies.
This visualization will support two domain tasks. The first is conducting a risk assessment. Healthcare organizations need to conduct regular risk assessments to identify potential vulnerabilities and threats to patient data security. By using the visualization tool to identify connections with states that have experienced data breaches, the healthcare organization can conduct a more targeted risk assessment that focuses on specific areas of concern based on the type of data breach. This task requires specialized knowledge of healthcare data security and risk management principles, as well as the ability to analyze data and identify potential risks. Another domain task is developing a data security strategy. Once potential risks have been identified through the risk assessment, the healthcare organization needs to develop a data security strategy to mitigate those risks. This task requires a deep understanding of healthcare data security regulations and best practices, as well as the ability to develop and implement effective security measures. By using the visualization tool to identify connections with states that have experienced data breaches, the healthcare organization can develop a more targeted data security strategy that addresses specific areas of concern.
</p>
</div>
<div>
<h2>Background</h2>
<h4>Data</h4>
<p>Link to original data:
<a href="https://ocrportal.hhs.gov/ocr/breach/breach_report.jsf">https://ocrportal.hhs.gov/ocr/breach/breach_report.jsf</a>
</p>
<p>The data was collected by the U.S. Department of Health and Human Services that are currently being investigated by the Office for Civil Rights. The original dataset can be found in this link: https://ocrportal.hhs.gov/ocr/breach/breach_report.jsf. The U.S. Department of Health and Human Services is a sector of the United States government that promotes the well-being and health of all Americans by supporting services that emphasize the science behind medicine, public health, and social services. The data demonstrates a list of data breaches reported within the last 24 months that have affected 500 or more individuals with unsecured health information. It shows the name of the covered entity, also known as the organization or company that was affected by a data breach, the type of covered entity, and the type of breach. Additionally, the state, the date of the submitted breach, and the number of individuals affected are shown.
Since this data is directly submitted to the U.S. government, there should be no bias as it is information that should be neutralized without any preference. The data is covered from all places around the United States and does not only show one part of the country. The only aspect that is disadvantageous is the option of "Other" which is included in the "Type of Breach" column. This option does not fully specify what type of breach was committed. This information should be fully open about every type of data breach that exists to create more awareness among users of healthcare and allow readers of the data to see if a specific type was more common compared to other breaches or if it would have an impact on the statistics of the data.
The data doesn’t have any missing or unexpected values. There are no outliers, and the data contains all the attributes we want to visualize. However, the values under "Name of Covered Entity" in the data are messy, containing characters that do not make logical sense, extraneous information, and inconsistent capitalization and formatting (ex. "doing business as" is written as "d/b/a", "dba", "doing business as"). In terms of data cleaning, there was an additional column included called "Web Description" that had no data, so this column was removed since it was not essential to our visualization. There were also white spaces towards the end of certain words which were eventually removed. All the data had consistent wording and spelling and did not require any change. There were no new, derived attributes added to the data.
</p>
<h4>Demo Video</h4>
<div>
<video controls>
<source src="final_demo_video.mp4" type="video/mp4">
<track src="captions.vtt" kind="subtitles" srclang="eng" label="English" default>
</div>
<h4>Report</h4>
<p>
<a href="finalreport.pdf"> Click here</a> to read the report.
</p>
</div>
<div>
<h2>Visualization</h2>
<p>The bar plot below shows the number of breaches across the nation by breach type. Select a state on the US map and click a year to update the bar plot and see the number of data breaches for the state and year you selected!</p>
<br>
<h3>U.S. Map: Select a State</h3>
<div id="map"></div>
<br>
<br>
<h3>Number of HIPPA Breaches by Type</h3>
<button onclick="update('2021')">2021</button>
<button onclick="update('2022')">2022</button>
<button onclick="update('2023')">2023</button>
<div id="vis2">
</div>
</div>
<div>
<h2 id= "acknowledgements">Acknowledgements</h2>
<div>
<ul>
<li>
<a href="https://www.geeksforgeeks.org/html-lists/?ref=lbp">https://www.geeksforgeeks.org/html-lists/?ref=lbp</a>
</li>
<li>
<a href="http://using-d3js.com/04_04_working_with_dates.html">http://using-d3js.com/04_04_working_with_dates.html</a>
</li>
<li>
<a href="https://simplemaps.com/">https://simplemaps.com/</a>
</li>
<li>
<a href="https://d3-graph-gallery.com/graph/barplot_button_data_csv.html">https://d3-graph-gallery.com/graph/barplot_button_data_csv.html</a>
</li>
<li>
<a href="https://github.com/topojson/us-atlas">https://github.com/topojson/us-atlas</a>
</li>
</ul>
</div>
</div>
</body>
<!--include my js -->
<script src="js/main.js"></script>
</html>