-
Notifications
You must be signed in to change notification settings - Fork 5
Home
Genealogy work is complicated. Fortunately, we have computers to help us manage complex work. But that puts a significant responsibility on software developers to create the right tools to get the job done well.
In the past, genealogical software was primarily used to manage somebody's conclusions about their genealogy. In order to share those conclusions with their family, or to transport their conclusions to a new computer, the conclusions had to be saved to a disk. GEDCOM was the name of the standard way to save those files to disk.
The world has shifted. Computers are being used much more broadly across all aspects of the genealogical research process. With the arrival of the Internet and the World Wide Web, genealogists are using computers to:
- Make records available online as digital artifacts
- Index and annotate online artifacts so as to make them searchable
- Search for records and other genealogical information
- Make conclusions based on sound evidence found in records
- Support conclusions by accurately citing the sources of the evidence
- Identify contradictory evidence and alternate theories
- Share and collaborate on genealogy work
GEDCOM X is the industry standard for facilitating these activities.
Consider the depiction above, illustrating how GEDCOM X can be used to facilitate genealogy work.
Information on an ancestor can be gathered from multiple sources. The depiction above shows three (of the many) possibilities: a book, a photo, and a census. Note the important distinction between the real, physical manifestation of the sources and their digital representation. The GEDCOM X domain refers to the former as physical artifacts and the latter as digital artifacts.
Both physical artifacts and digital artifacts can be cited as sources for genealogical data. And sources can, in turn, cite other sources as the "source of the source". A JPEG image of a birth certificate, for example, can cite the certificate itself (the physical artifact) as its source. The concept of the "source of the source" is important to measuring the validity and accuracy of genealogical data.
GEDCOMX specifies a model for describing a source in terms of its metadata and for transcribing the contents of the source.
The metadata of the source is data "about" the source. Metadata includes things like the title, publisher, publication date, author, and (especially important to genealogical research) the bibliographic citation for the source. GEDCOM X uses the terms defined by the Dublin Core Metadata Initiative to define standard source metadata. The blue elements in the illustration above represent the source metadata.
The process of making the contents of the source available is called indexing. For example, if the source is a census that lists a John Smith born January 1, 1880 then the result of indexing the source will be a piece of structured digital data called a record that specifies a persona with a name "John Smith" and a birth event with "January 1, 1880" as the text of the date. Technically, the record is also data "about" the source (and hence can be considered source "metadata") but it's useful to distinguish it as a special case because it is separate from the Dublin Core terms and has particular significance to genealogical applications.
The green elements in the illustration above represent the record data.
Genealogical conclusions should be based on sound evidence supported by properly-cited sources. The source metadata can can be used to properly cite the evidence for conclusion data and to support the genealogical proof standard. The record data can be used to supply conclusion data and to measure its validity. The conclusion data is represented in red above.
Not everybody is interested in the entire scope of GEDCOM X. So the GEDCOM X standard is divided into distinct components called "profiles". Each profile is designed to meet the needs of a well-defined set of related requirements. The depiction above illustrates two distinct GEDCOM X profiles: the Record Profile in green and the Conclusion Profile in red. The definition of the blue pieces is defined by the Metadata Profile using common standards such as Dublin Core.
There are other GEDCOM X profiles not depicted above. For example, the standard way to search genealogical data is specified by the Search Profile and the standard way to create and edit conclusion via the World Wide Web is specified by the Conclusion WWW Profile. To read about the different GEDCOM X profiles, see the profile documentation.
To learn how to produce and consume GEDCOM X from your application, take a look at the Developers Guide.
To read the documentation on the GEDCOM X domain and its different profiles and namespaces, try starting at Profiles.
To get involved with the development and enhancement of the GEDCOM X standard, read about the Community.