Skip to content

Genealogical Relationships Research

Genealogical systems have decades of experience modeling:

  • Billions of person records (FamilySearch has 1.5+ billion)
  • Complex relationships: biological, adoptive, step, foster, guardianship
  • Temporal relationships: marriages, divorces, remarriages
  • Evidence and certainty: confidence levels, source citations, proof standards
  • Non-traditional families: same-sex couples, blended families

They’ve solved hard problems in relationship modeling that apply to any entity graph.


StandardEraStatusPurpose
GEDCOM 5.51995Legacy (widely used)Text-based file format for family tree exchange
GEDCOM 5.5.11999Industry standardMinor update, still dominant
GEDCOM X2012ActiveModern data model for evidence-based genealogy
FamilySearch GEDCOM 7.02021CurrentUpdated file format building on GEDCOM X concepts

FamilySearch’s Family Tree API supports exactly two relationship types:

Relationship TypeDescriptionStructure
Couple RelationshipA relationship between two people (spouses/partners)Links two Person records
Child-and-Parents RelationshipA relationship between a child and their parentsLinks child to up to two parents

This is remarkably simple. All family structures emerge from just these two primitives.

Traditional GEDCOM uses a FAM (Family) record as the linking structure:

0 @F1@ FAM
1 HUSB @I1@ (husband/partner 1)
1 WIFE @I2@ (wife/partner 2)
1 CHIL @I3@ (child)
1 CHIL @I4@ (another child)
1 MARR
2 DATE 15 JUN 1985
2 PLAC Chicago, Illinois

The FAM record is the only source of links between individuals. This is similar to a join table in relational databases.

Key Insight: Sibling Relationships Are Implied

Section titled “Key Insight: Sibling Relationships Are Implied”

Neither GEDCOM nor FamilySearch explicitly models sibling relationships. Instead:

  • Siblings are inferred from shared parents
  • Half-siblings share exactly one parent
  • Step-siblings share no biological parents but have parents in a couple relationship

This avoids redundancy and keeps the model simple.


Type URIDescription
http://gedcomx.org/BiologicalParentBirth parent
http://gedcomx.org/AdoptiveParentLegally adopted the child
http://gedcomx.org/StepParentMarried to biological parent
http://gedcomx.org/FosterParentTemporary guardian
http://gedcomx.org/GuardianParentLegal guardian

GEDCOM 5.5.1 uses the PEDI tag to specify parent-child relationship type:

0 @I1@ INDI
1 NAME Child /Name/
1 FAMC @F1@
2 PEDI birth (biological)
1 FAMC @F2@
2 PEDI adopted (adoptive parents)

PEDI values:

  • birth — biological relationship
  • adopted — legal adoption
  • foster — foster care
  • sealing — LDS temple sealing (FamilySearch-specific)

A person can have multiple sets of parents:

  • Biological parents
  • Adoptive parents
  • Step-parents
  • Foster parents

Each is a separate Child-and-Parents relationship with its own type qualifier.


Type URIDescriptionDatePlaceMultiple
http://gedcomx.org/MarriageLegal marriageYesYesYes
http://gedcomx.org/DivorceLegal divorceYesYesYes
http://gedcomx.org/AnnulmentMarriage annulmentYesYesYes
http://gedcomx.org/CommonLawMarriageCommon law unionYesYesYes
http://familysearch.org/v1/LivedTogetherCohabitationNoNoNo
http://familysearch.org/v1/CoupleNeverHadChildrenFlag for childless coupleNoNoNo

Marriage as an Event: Marriage in GEDCOM is an event with date and place, not a boolean status. The MARR record documents when/where the marriage occurred.

Multiple Marriages: When couples marry, divorce, and remarry (even to the same person), best practice is:

  • Create a new FAM record for each marriage
  • Don’t combine multiple MARR/DIV events in one record
  • This maintains chronological clarity and allows other relationships in between

Divorce Without Marriage Record: GEDCOM allows a DIV (divorce) record without a corresponding MARR if the marriage date is unknown but divorce is documented.


GEDCOM X models genealogical data as conclusions that can have:

PropertyTypePurpose
confidenceConfidenceLevelHow certain is this data?
sourceSourceReference[]What sources support this?
attributionAttributionWho contributed this? When?
analysisDocument referenceWritten reasoning/proof
noteNote[]Additional context

Every fact, relationship, and person in GEDCOM X is a “conclusion” that can carry this metadata.

The GPS is the established framework for genealogical certainty:

  1. Reasonably exhaustive research in reliable sources
  2. Complete and accurate source citations for all facts
  3. Analysis and correlation of collected information
  4. Resolution of conflicting evidence
  5. Soundly reasoned, coherently written conclusion

Meeting GPS doesn’t mean “beyond doubt” — it’s closer to legal “clear and convincing” standard. Conclusions can be revisited when new evidence emerges.

TypeDefinitionExample
Direct EvidenceDirectly answers the questionBirth certificate listing parents
Indirect EvidenceRequires interpretation/correlationCensus showing household composition
Negative EvidenceAbsence suggests somethingPerson not in death records = likely alive
Source TypeDefinitionReliability
OriginalCreated at/near time of event by firsthand observerHigher
DerivativeCopied, transcribed, or abstracted from originalLower
AuthoredCreated after the fact (family histories, biographies)Varies

AspectGEDCOM 5.5GEDCOM X
FocusConclusions only (the family tree)Evidence + conclusions (the research process)
SourcesOptional attachmentsFirst-class citizens
ConfidenceNot supportedBuilt into conclusion model
FormatText-based, single formatJSON/XML, multiple serializations
LicenseProprietary (LDS)Open source (Apache 2.0)
MediaExternal links onlyBundled in file format
  1. Source Descriptions — Rich metadata about sources
  2. Source References — Link conclusions to supporting sources
  3. Evidence References — Explicitly model evidence chains
  4. Contributor Attribution — Track who added what and when
  5. Analysis Documents — Written reasoning and proof arguments
  6. Confidence Levels — Express certainty about conclusions
1984: GEDCOM created by LDS Church
1995: GEDCOM 5.5 released
1999: GEDCOM 5.5.1 (minor update)
2012: GEDCOM X introduced (evidence-based model)
2021: FamilySearch GEDCOM 7.0 (modernized file format)

GEDCOM approach:

0 @I1@ INDI
1 NAME Adopted /Child/
1 FAMC @F1@ (biological family)
2 PEDI birth
1 FAMC @F2@ (adoptive family)
2 PEDI adopted
1 ADOP (adoption event)
2 DATE 11 JAN 1990
2 FAMC @F2@ (which family adopted)
3 ADOP BOTH (both parents adopted)

FamilySearch approach:

  • Create Child-and-Parents relationship to biological parents with BiologicalParent fact
  • Create separate Child-and-Parents relationship to adoptive parents with AdoptiveParent fact
  • Both relationships coexist

When a biological parent remarries:

  1. Child has BiologicalParent relationship to birth parents
  2. Child can have StepParent relationship to parent’s new spouse
  3. No need to modify biological relationships

Blended Families (“Patchwork Families”)

Section titled “Blended Families (“Patchwork Families”)”

Multiple FAMC (family child) links handle complex blended families:

  • Each child links to their biological parents
  • Additional links for step/adoptive relationships
  • Couple relationships tie adults together

FamilySearch (as of recent update):

  • Allows documenting same-sex marriages and adoptions
  • Spouse selection no longer restricted by sex
  • Photos, stories, documents accepted for same-sex relationships
  • Required significant technical redesign of tree search systems

Limitations:

  • FamilySearch is operated by LDS Church
  • Same-sex couples cannot be sealed in temple
  • Children cannot be sealed to same-sex parents
  • This is genealogical documentation, not religious endorsement

GEDCOM 7.0:

  • Updated to allow same-sex partnerships
  • HUSB/WIFE terminology being reconsidered for neutrality

IssueDescription
Nuclear family assumptionOriginal design assumed husband/wife/children model
No embedded mediaOnly links to external files
Inconsistent implementationsSoftware interprets spec differently
Proprietary extensionsVendors add non-standard tags
Slow evolution26 years between 5.5.1 and 7.0
Contradictions in specSome requirements are internally inconsistent

Developers face a dilemma:

  • Comply strictly with GEDCOM spec → incompatible with common software
  • Match common practice → deviate from official standard

Many genealogy programs don’t fully implement the standard, creating an ecosystem of partial compatibility.

The FAM record assumes:

  • Exactly 0-2 adults (HUSB/WIFE)
  • Any number of children
  • Adults are in a “couple” relationship

This struggles with:

  • Polyamorous relationships
  • Communal child-rearing
  • Cultures with different family structures
  • Situations where “family” has different meaning

While GEDCOM X has excellent evidence modeling, many users:

  • Don’t use confidence levels
  • Skip source citations
  • Treat it like GEDCOM 5.5 (conclusions only)

The tools support sophisticated research; user behavior lags.


0 @I1@ INDI
1 NAME John /Smith/
1 SEX M
1 BIRT
2 DATE 1 JAN 1900
2 PLAC New York, New York
1 DEAT
2 DATE 15 MAR 1975
2 PLAC Los Angeles, California
1 FAMC @F1@ (family as child)
2 PEDI birth
1 FAMS @F2@ (family as spouse)
0 @F1@ FAM
1 HUSB @I2@
1 WIFE @I3@
1 CHIL @I1@
1 MARR
2 DATE 15 JUN 1985
2 PLAC Chicago, Illinois
1 DIV
2 DATE 3 MAR 1995
{
"id": "R1",
"type": "http://gedcomx.org/Couple",
"person1": { "resource": "#P1" },
"person2": { "resource": "#P2" },
"facts": [
{
"type": "http://gedcomx.org/Marriage",
"date": { "original": "15 June 1985" },
"place": { "original": "Chicago, Illinois" },
"confidence": "http://gedcomx.org/High"
}
],
"sources": [
{ "description": "#S1" }
]
}
{
"id": "R2",
"type": "http://gedcomx.org/ParentChild",
"person1": { "resource": "#P1" }, // parent
"person2": { "resource": "#P3" }, // child
"facts": [
{
"type": "http://gedcomx.org/BiologicalParent"
}
]
}

FamilySearch models all family structures with just:

  • Couple relationships
  • Parent-child relationships

Sibling, grandparent, cousin, etc. are computed, not stored.

Relationships in GEDCOM X have:

  • Their own IDs
  • Facts/events attached
  • Source citations
  • Confidence levels
  • Temporal data

They’re full entities, not just foreign keys.

Sophisticated systems separate:

  • Information — raw data from sources
  • Evidence — information that answers a question
  • Conclusion — accepted answer with supporting evidence

This three-layer model enables proper reasoning and revision.

Don’t model “married: true/false”. Model:

  • Marriage event (date, place)
  • Divorce event (date, place)
  • Remarriage event (date, place)

The current status is computed from event history.

A person can have multiple:

  • Parent relationships (biological + adoptive + step)
  • Spouse relationships (sequential marriages)
  • The same two people can have multiple relationships (married, divorced, remarried)

Both GEDCOM and GEDCOM X support:

  • Standard relationship types
  • Custom types via data URIs
  • Extension mechanisms for domain-specific needs

  • Genealogical Proof Standard
  • Evidence Explained by Elizabeth Shown Mills
  • Mastering Genealogical Proof by Thomas W. Jones
  • Genealogy Standards from Board for Certification of Genealogists

  1. How do other cultures model family? — GEDCOM assumes Western nuclear family. What about extended family structures, clan systems, or cultures with different kinship terminology?

  2. What’s the right abstraction level? — FamilySearch’s two-relationship model is elegant but requires computation for common queries (siblings, grandparents). Is this the right trade-off?

  3. How should confidence propagate? — If a parent-child relationship has low confidence, how does that affect inferred relationships (siblings, grandparents)?

  4. Relationship vs. Role? — Is “biological parent” a relationship type or a role within a parent-child relationship? GEDCOM X uses “facts” on relationships, which is role-like.

  5. Time modeling — Relationships have beginnings (marriage) and endings (divorce, death). How granular should temporal modeling be?