gedcom2sem

connects GedCom files with the semantic web

View project on GitHub

Modeling your data

When preparing your data for the semantic web, the first step is choose or create an ontology. However, that is not the last choice to make. The set of rules provided for the transform command is just one possible outcome of the proces.

Ontologies or RDF schemas

Some ontologies that surfaced from a quick survey:

Indirect relations

A resource describing a person in a family tree will usually contain children, spouses and parents, but maybe also siblings, grandparents and grandchildren. However, these indirect relations produce a mess when shown in a graph, even with just three generations.

The original text page of the screenshots below, the full size graph and the interactive graph

Inverse relations

When presenting your data with a SPARQL endpoint for querying, you could decide to store the spouse relation with just one of both involved persons. But when crawling the web (as for example with the LodLive graph), it is usefull to have all indirect relations at hand.

Ancestors and/or Descendants

The person to person relation will produce a proper pedigree graph, but a graph that shows descendant shows crossing lines unless you omit spouses. Replacing the person to person relations by family nodes might even solve the crossing lines in a mixed graph, but makes the data less recognizable for foaf tools emerging from social media.